Fugu-MT 論文翻訳(概要): Vocabulary Hijacking in LVLMs: Unveiling Critical Attention Heads by Excluding Inert Tokens to Mitigate Hallucination

論文の概要: Vocabulary Hijacking in LVLMs: Unveiling Critical Attention Heads by Excluding Inert Tokens to Mitigate Hallucination

arxiv url: http://arxiv.org/abs/2605.10622v1
Date: Mon, 11 May 2026 14:16:25 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-12 23:28:50.884372
Title: Vocabulary Hijacking in LVLMs: Unveiling Critical Attention Heads by Excluding Inert Tokens to Mitigate Hallucination
Title（参考訳）: LVLMにおける語彙ハイジャック: 幻覚を緩和するために不活性なトークンを除外して致命的な注意を喚起する
Authors: Yangneng Chen, Junlin Li, Weijun Yao, Xilai Ma, Guodong Du, Wenya Wang, Jing Li,
Abstract要約: 我々はロジットレンズを用いて注意機構を解析し、Vocabulary Hijacking(語彙ハイジャック)という別個の異常を明らかにする。本研究では,ハイジャック・アウェア・ビジュアル・アテンション・エンハンスメント(HAVAE, Hijacking-Aware Visual Attention Enhancement)を提案する。
参考スコア（独自算出の注目度）: 25.643533134113607
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large Vision-Language Models (LVLMs) have achieved remarkable progress in multimodal tasks, yet their reliability is persistently undermined by hallucinations-generating text that contradicts visual input. Recent studies often attribute these errors to inadequate visual attention. In this work, we analyze the attention mechanisms via the logit lens, uncovering a distinct anomaly we term Vocabulary Hijacking. We discover that specific visual tokens, defined as Inert Tokens, disproportionately attract attention. Crucially, when their intermediate hidden states are projected into the vocabulary space, they consistently decode to a fixed set of unrelated words (termed Hijacking Anchors) across layers, revealing a rigid semantic collapse. Leveraging this semantic rigidity, we propose Hijacking Anchor-Based Identification (HABI), a robust strategy to accurately localize these Inert Tokens. To quantify the impact of this phenomenon, we introduce the Non-Hijacked Visual Attention Ratio (NHAR), a novel metric designed to identify attention heads that remain resilient to hijacking and are critical for factual accuracy. Building on these insights, we propose Hijacking-Aware Visual Attention Enhancement (HAVAE), a training-free intervention that selectively strengthens the focus of these identified heads on salient visual content. Extensive experiments across multiple benchmarks demonstrate that HAVAE significantly mitigates hallucinations with no additional computational overhead, while preserving the model's general capabilities. Our code is publicly available at https://github.com/lab-klc/HAVAE.
Abstract（参考訳）: LVLM(Large Vision-Language Models)はマルチモーダルタスクにおいて顕著な進歩を遂げているが、その信頼性は視覚入力と矛盾する幻覚生成テキストによって持続的に損なわれている。近年の研究では、これらの誤りは視覚的注意の欠如に起因していることが多い。本研究では,ロジットレンズを用いて注意機構を解析し,Vocabulary Hijacking(語彙ハイジャック)という別個の異常を明らかにする。 Inert Tokensとして定義された特定の視覚トークンが不当に注目を集めていることに気づいた。重要なことに、それらの中間隠れ状態が語彙空間に射影されると、それらは一貫して階層をまたいだ無関係な単語(ヒジャックアンカー(Hijacking Anchors)と呼ばれる)の固定セットにデコードされ、厳密な意味的崩壊が明らかになる。このセマンティックな剛性を活用して、これらの慣性トークンを正確にローカライズするための頑健な戦略であるHijacking Anchor-Based Identification (HABI)を提案する。この現象の影響を定量化するために,非ハイジャック型視覚注意率(NHAR)を導入する。これらの知見に基づいて,ハイジャック・アウェア・ビジュアル・アテンション・エンハンスメント(HAVAE)を提案する。複数のベンチマークにわたる大規模な実験により、HAVAEはモデル全体の能力を保ちながら、追加の計算オーバーヘッドなしで幻覚を著しく緩和することを示した。私たちのコードはhttps://github.com/lab-klc/HAVAEで公開されています。

論文の概要: Vocabulary Hijacking in LVLMs: Unveiling Critical Attention Heads by Excluding Inert Tokens to Mitigate Hallucination

関連論文リスト