Fugu-MT 論文翻訳(概要): Affective Image Editing: Shaping Emotional Factors via Text Descriptions

論文の概要: Affective Image Editing: Shaping Emotional Factors via Text Descriptions

arxiv url: http://arxiv.org/abs/2505.18699v1
Date: Sat, 24 May 2025 13:46:57 GMT
ステータス: 翻訳完了
システム内更新日: 2025-05-27 16:58:42.6013
Title: Affective Image Editing: Shaping Emotional Factors via Text Descriptions
Title（参考訳）: Affective Image Editing: Forming Emotional Factors via Text Descriptions
Authors: Peixuan Zhang, Shuchen Weng, Chengxuan Zhu, Binghao Tang, Zijian Jia, Si Li, Boxin Shi,
Abstract要約: AIEdiT for Affective Image Editing using Text descriptions。我々は、連続的な感情スペクトルを構築し、ニュアンスな感情的要求を抽出する。 AIEdiTは、ユーザの感情的な要求を効果的に反映して、優れたパフォーマンスを達成する。
参考スコア（独自算出の注目度）: 46.13506671212571
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: In daily life, images as common affective stimuli have widespread applications. Despite significant progress in text-driven image editing, there is limited work focusing on understanding users' emotional requests. In this paper, we introduce AIEdiT for Affective Image Editing using Text descriptions, which evokes specific emotions by adaptively shaping multiple emotional factors across the entire images. To represent universal emotional priors, we build the continuous emotional spectrum and extract nuanced emotional requests. To manipulate emotional factors, we design the emotional mapper to translate visually-abstract emotional requests to visually-concrete semantic representations. To ensure that editing results evoke specific emotions, we introduce an MLLM to supervise the model training. During inference, we strategically distort visual elements and subsequently shape corresponding emotional factors to edit images according to users' instructions. Additionally, we introduce a large-scale dataset that includes the emotion-aligned text and image pair set for training and evaluation. Extensive experiments demonstrate that AIEdiT achieves superior performance, effectively reflecting users' emotional requests.
Abstract（参考訳）: 日常生活において、共通の感情刺激としてのイメージは広く応用されている。テキスト駆動画像編集の大幅な進歩にもかかわらず、ユーザの感情的な要求を理解することに注力する作業は限られている。本稿では,AIEdiT for Affective Image Editing for Affective Image Editing using Text descriptionsを紹介する。普遍的な感情的先行性を表現するために、連続的な感情的スペクトルを構築し、ニュアンスな感情的要求を抽出する。感情的要因を操作するために,感情的マッパーを設計し,視覚的に解釈された感情的要求から視覚的に一致した意味表現へと変換する。編集結果が特定の感情を誘発することを保証するため,モデルトレーニングを監督するMLLMを導入する。推論中、私たちは視覚要素を戦略的に歪め、それに対応する感情要素を形作り、ユーザの指示に従って画像を編集する。さらに,感情の一致したテキストと,トレーニングと評価のための画像ペアセットを含む大規模データセットも導入する。 AIEdiTは、ユーザの感情的な要求を効果的に反映し、優れたパフォーマンスを達成することを実証した。

関連論文リスト

Disentangle Identity, Cooperate Emotion: Correlation-Aware Emotional Talking Portrait Generation [63.94836524433559]
DICE-Talkは、感情と同一性を切り離し、類似した特徴を持つ感情を協調するフレームワークである。我々は、モーダル・アテンションを通して、音声と視覚の感情の手がかりを共同でモデル化するアンタングル型感情埋め込み装置を開発した。次に,学習可能な感情バンクを用いた相関強化感情調和モジュールを提案する。第3に、拡散過程における感情の一貫性を強制する感情識別目標を設計する。
論文参考訳（メタデータ） (2025-04-25T05:28:21Z)
EmoSEM: Segment and Explain Emotion Stimuli in Visual Art [25.539022846134543]
本稿では,視覚芸術理解における重要な課題に焦点をあてる。芸術的イメージを与えられたモデルは,特定の人間の感情を誘発するピクセル領域をピンポイントする。近年の芸術理解の進歩にもかかわらず、ピクセルレベルの感情理解は依然として二重の課題に直面している。本稿では,感情理解能力を持つセグメンテーションモデルSAMを実現するために,感情刺激・説明モデル(EmoSEM)を提案する。
論文参考訳（メタデータ） (2025-04-20T15:40:00Z)
EmotiCrafter: Text-to-Emotional-Image Generation based on Valence-Arousal Model [23.26111054485357]
連続感情画像コンテンツ生成(C-EICG)の新たな課題について紹介する。本稿では,テキストプロンプトとValence-Arousal値に基づいて画像を生成する感情画像生成モデルであるEmotiCrafterを提案する。
論文参考訳（メタデータ） (2025-01-10T04:41:37Z)
EmoEdit: Evoking Emotions through Image Manipulation [62.416345095776656]
Affective Image Manipulation (AIM) は、特定の感情的な反応を誘発するために、ユーザーが提供する画像を修正しようとする。本稿では,感情的影響を高めるためにコンテンツ修正を取り入れてAIMを拡張したEmoEditを紹介する。本手法は定性的かつ定量的に評価され,従来の最先端技術と比較して優れた性能を示す。
論文参考訳（メタデータ） (2024-05-21T10:18:45Z)
Make Me Happier: Evoking Emotions Through Image Diffusion Models [36.40067582639123]
そこで本研究では,感情を刺激するイメージを合成し,本来のシーンのセマンティクスと構造を保ちながら,感情を刺激するイメージを合成することを目的とした,感情誘発画像生成の新たな課題を提案する。感情編集データセットが不足しているため、34万対の画像とその感情アノテーションからなるユニークなデータセットを提供する。
論文参考訳（メタデータ） (2024-03-13T05:13:17Z)
EmoGen: Emotional Image Content Generation with Text-to-Image Diffusion Models [11.901294654242376]
本稿では,感情カテゴリを与えられた意味的明瞭で感情に忠実な画像を生成するための新しいタスクである感情画像コンテンツ生成(EICG)を紹介する。具体的には、感情空間を提案し、それを強力なコントラスト言語-画像事前学習(CLIP)空間と整合させるマッピングネットワークを構築する。本手法は,最先端のテクスト・ツー・イメージ・アプローチを定量的・質的に上回る。
論文参考訳（メタデータ） (2024-01-09T15:23:21Z)
Contextual Emotion Estimation from Image Captions [0.6749750044497732]
大規模言語モデルが文脈的感情推定タスクをサポートできるかを,まずイメージをキャプションし,LLMを用いて推論する。 EMOTICデータセットから331画像のサブセットのキャプションと感情アノテーションを生成する。 GPT-3.5(特にtext-davinci-003モデル)は、人間のアノテーションと一致した驚くほど合理的な感情予測を提供する。
論文参考訳（メタデータ） (2023-09-22T18:44:34Z)
SOLVER: Scene-Object Interrelated Visual Emotion Reasoning Network [83.27291945217424]
画像から感情を予測するために,SOLVER(Scene-Object Interrelated Visual Emotion Reasoning Network)を提案する。異なるオブジェクト間の感情関係を掘り下げるために、まずセマンティックな概念と視覚的特徴に基づいて感情グラフを構築します。また、シーンとオブジェクトを統合するScene-Object Fusion Moduleを設計し、シーンの特徴を利用して、提案したシーンベースのアテンションメカニズムでオブジェクトの特徴の融合プロセスを導出する。
論文参考訳（メタデータ） (2021-10-24T02:41:41Z)
Emotion Recognition from Multiple Modalities: Fundamentals and Methodologies [106.62835060095532]
マルチモーダル感情認識(MER)のいくつかの重要な側面について論じる。まず、広く使われている感情表現モデルと感情モダリティの簡単な紹介から始める。次に、既存の感情アノテーション戦略とそれに対応する計算タスクを要約する。最後に,実世界のアプリケーションについて概説し,今後の方向性について論じる。
論文参考訳（メタデータ） (2021-08-18T21:55:20Z)
Enhancing Cognitive Models of Emotions with Representation Learning [58.2386408470585]
本稿では,きめ細かな感情の埋め込み表現を生成するための,新しいディープラーニングフレームワークを提案する。本フレームワークは,コンテキスト型埋め込みエンコーダとマルチヘッド探索モデルを統合する。本モデルは共感対話データセット上で評価され,32種類の感情を分類する最新結果を示す。
論文参考訳（メタデータ） (2021-04-20T16:55:15Z)

関連論文リストは本サイト内にある論文のタイトル・アブストラクトから自動的に作成しています。

指定された論文の情報です。
本サイトの運営者は本サイト（すべての情報・翻訳含む）の品質を保証せず、本サイト（すべての情報・翻訳含む）を使用して発生したあらゆる結果について一切の責任を負いません。