Fugu-MT 論文翻訳(概要): Towards LLM-centric Affective Visual Customization via Efficient and Precise Emotion Manipulating

論文の概要: Towards LLM-centric Affective Visual Customization via Efficient and Precise Emotion Manipulating

arxiv url: http://arxiv.org/abs/2602.18016v1
Date: Fri, 20 Feb 2026 06:12:48 GMT
ステータス: 翻訳完了
システム内更新日: 2026-02-23 18:01:41.246451
Title: Towards LLM-centric Affective Visual Customization via Efficient and Precise Emotion Manipulating
Title（参考訳）: 効率・高精度感情操作によるLCM中心の視覚カスタマイズに向けて
Authors: Jiamin Luo, Xuqian Gu, Jingjing Wang, Jiahong Lu,
Abstract要約: 本稿では,マルチモーダル LLM による主観的感情の修正の中で画像を生成することに焦点を当てた,感情的視覚カスタマイズ(L-AVC)タスクを提案する。効率の良い感情間変換(EIC)モジュールを、編集前後のセマンティクスにおける感情変換を効率よく整合させるように調整し、その後に、感情に依存しないコンテンツを正確に保持する精密な感情保持(PER)モジュールを設ける。
参考スコア（独自算出の注目度）: 6.478514718464069
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Previous studies on visual customization primarily rely on the objective alignment between various control signals (e.g., language, layout and canny) and the edited images, which largely ignore the subjective emotional contents, and more importantly lack general-purpose foundation models for affective visual customization. With this in mind, this paper proposes an LLM-centric Affective Visual Customization (L-AVC) task, which focuses on generating images within modifying their subjective emotions via Multimodal LLM. Further, this paper contends that how to make the model efficiently align emotion conversion in semantics (named inter-emotion semantic conversion) and how to precisely retain emotion-agnostic contents (named exter-emotion semantic retaining) are rather important and challenging in this L-AVC task. To this end, this paper proposes an Efficient and Precise Emotion Manipulating approach for editing subjective emotions in images. Specifically, an Efficient Inter-emotion Converting (EIC) module is tailored to make the LLM efficiently align emotion conversion in semantics before and after editing, followed by a Precise Exter-emotion Retaining (PER) module to precisely retain the emotion-agnostic contents. Comprehensive experimental evaluations on our constructed L-AVC dataset demonstrate the great advantage of the proposed EPEM approach to the L-AVC task over several state-of-the-art baselines. This justifies the importance of emotion information for L-AVC and the effectiveness of EPEM in efficiently and precisely manipulating such information.
Abstract（参考訳）: 視覚的カスタマイズに関するこれまでの研究は、主に、様々な制御信号(例えば、言語、レイアウト、キャニー)と、主観的な感情的内容を無視した編集画像との客観的なアライメントに依存しており、さらに、感情的な視覚的カスタマイズのための汎用的な基礎モデルが欠如している。そこで本研究では,マルチモーダル LLM を用いた主観的感情の修正におけるイメージ生成に焦点を当てた LLM 中心の Affective Visual Customization (L-AVC) タスクを提案する。さらに、このL-AVCタスクにおいて、モデルがセマンティクス(感情間セマンティクス変換)の感情変換を効率的に整合させる方法と、感情に依存しないコンテンツを正確に保持する方法(exter-emotion semantic retaining)がより重要であり、挑戦的であることを主張する。そこで本研究では,画像中の主観的感情を編集するための効率的かつ高精度な感情操作手法を提案する。具体的には、効率の良い感情間変換(EIC)モジュールを、編集前後のセマンティクスにおける感情変換を効率よく整合させるように調整し、続いて、感情非依存の内容を正確に保持する高精度な感情表現保持(PER)モジュールを設ける。構築したL-AVCデータセットに対する総合的な実験的評価は、いくつかの最先端ベースライン上でのL-AVCタスクに対するEPEMアプローチの大きな利点を示している。このことは、L-AVCにおける感情情報の重要性と、これらの情報を効率的に正確に操作するEPEMの有効性を正当化する。

論文の概要: Towards LLM-centric Affective Visual Customization via Efficient and Precise Emotion Manipulating

関連論文リスト