Fugu-MT 論文翻訳(概要): Exposing Hallucinations To Suppress Them: VLMs Representation Editing With Generative Anchors

論文の概要: Exposing Hallucinations To Suppress Them: VLMs Representation Editing With Generative Anchors

arxiv url: http://arxiv.org/abs/2509.21997v1
Date: Fri, 26 Sep 2025 07:24:28 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-29 20:57:54.271063
Title: Exposing Hallucinations To Suppress Them: VLMs Representation Editing With Generative Anchors
Title（参考訳）: 幻覚を抑える:VLMの表現をジェネレーションアンカーで編集する
Authors: Youxu Shi, Suorong Yang, Dong Liu,
Abstract要約: マルチモーダル大規模言語モデル (MLLM) は様々な視覚言語タスクにおいて顕著な成功を収めている。 MLLMは幻覚に非常に敏感であり、視覚的証拠とは相容れない内容を生み出す。本研究では,幻覚の緩和のための訓練不要で自己指導的な方法を提案する。
参考スコア（独自算出の注目度）: 8.089908150148554
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Multimodal large language models (MLLMs) have achieved remarkable success across diverse vision-language tasks, yet they remain highly susceptible to hallucinations, producing content that is fluent but inconsistent with visual evidence. Such hallucinations, spanning objects, attributes, and relations, persist even in larger models, while existing mitigation approaches often require additional finetuning, handcrafted priors, or trade-offs that compromise informativeness and scalability. To address this limitation, we propose a training-free, self-supervised method for hallucination mitigation. Our approach introduces a novel hallucination amplification mechanism: a caption is projected into the visual space via a text-to-image model to reveal implicit hallucination signals, serving as a negative anchor, while the original image provides a positive anchor. Leveraging these dual anchors, we edit decoder hidden states by pulling representations toward faithful semantics and pushing them away from hallucination directions. This correction requires no human priors or additional training costs, ensuring both effectiveness and efficiency. Extensive experiments across multiple benchmarks show that our method significantly reduces hallucinations at the object, attribute, and relation levels while largely preserving recall and caption richness, e.g., achieving a hallucination reduction by over 5% using LLaVA-v1.5-7B on CHAIR. Furthermore, results on diverse architectures, including LLaVA-NEXT-7B, Cambrian-8B, and InstructBLIP-7B, validate strong cross-architecture generalization. More importantly, when applied to hallucination-free captions, our method introduces almost no side effects, underscoring its robustness and practical plug-and-play applicability. The implementation will be publicly available.
Abstract（参考訳）: MLLM(Multimodal large language model)は様々な視覚言語タスクにおいて顕著な成功を収めてきたが、幻覚の影響を受けにくいままであり、視覚的エビデンスとは相容れない内容を生み出している。このような幻覚(オブジェクト、属性、関係性)は、より大きなモデルでも持続するが、既存の緩和アプローチでは、追加の微調整、手作りの事前処理、情報性やスケーラビリティを損なうトレードオフが必要になることが多い。この制限に対処するために,幻覚軽減のための訓練不要で自己指導的な方法を提案する。提案手法では,視覚空間にキャプションを投影し,暗黙の幻覚信号を明らかにすることで,負のアンカーとして機能し,元の画像は正のアンカーとして機能する。これらの双対アンカーを利用することで、表現を忠実な意味論へ引き寄せ、幻覚の方向から引き離すことにより、デコーダの隠された状態を編集する。この修正は、人間の事前や追加のトレーニングコストを必要とせず、有効性と効率性の両方を保証します。 LLaVA-v1.5-7B を CHAIR 上で5% 以上の幻覚化を達成し, 記憶とキャプションの豊かさを保ちながら, 対象, 属性, 関係レベルの幻覚を著しく低減できることを示す。さらに、LLaVA-NEXT-7B、Cambrian-8B、InstructBLIP-7Bなどの多様なアーキテクチャの結果は、強いクロスアーキテクチャの一般化を検証する。さらに, 幻覚のない字幕に適用した場合, 副作用はほとんど見られず, その頑健さと実用的プラグアンドプレイの適用性が強調される。実装は一般公開される予定だ。

論文の概要: Exposing Hallucinations To Suppress Them: VLMs Representation Editing With Generative Anchors

関連論文リスト