Fugu-MT 論文翻訳(概要): Why LLMs Hallucinate on Structured Knowledge: A Mechanistic Analysis of Reasoning over Linearized Representations

論文の概要: Why LLMs Hallucinate on Structured Knowledge: A Mechanistic Analysis of Reasoning over Linearized Representations

arxiv url: http://arxiv.org/abs/2605.26362v1
Date: Mon, 25 May 2026 22:08:59 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-27 17:51:41.488847
Title: Why LLMs Hallucinate on Structured Knowledge: A Mechanistic Analysis of Reasoning over Linearized Representations
Title（参考訳）: LLMが構造化知識に幻覚を与える理由--線形表現による推論の力学解析
Authors: Shanghao Li, Jinda Han, Yibo Wang, Yuanjie Zhu, Zihe Song, Langzhou He, Kenan Kamel A Alghythee, Philip S. Yu,
Abstract要約: 幻覚はランダムノイズではなく、系統的な内部力学から生じる。第一に、注意は、完全なコンテキストに分散するのではなく、ショートカットのような構造的手がかりに焦点を合わせている。第二に、フィードフォワード表現は提供された知識を根拠にせず、モデルがパラメトリックメモリに戻る。
参考スコア（独自算出の注目度）: 36.45654492179688
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: In many reasoning tasks, large language models (LLMs) rely on structured external knowledge, such as graphs and tables, which is typically linearized into sequential token representations. However, even when sufficient knowledge is available, LLMs can still produce hallucinated outputs, and the underlying mechanisms behind such failures remain poorly understood. We investigate these mechanisms and find that hallucinations arise from systematic internal dynamics rather than random noise. First, attention disproportionately concentrates toward shortcut-like structural cues rather than distributing across the full context. Second, feed-forward representations fail to ground the provided knowledge, causing the model to revert to parametric memory. Moreover, our results indicate that hallucination is consistently associated with failures in semantic grounding within feed-forward layers, while attention allocation exhibits greater task-dependent variability. Finally, we show that these mechanistic patterns generalize beyond single-hop graphs to multi-hop and tabular settings, enabling effective hallucination detection across structured knowledge formats.
Abstract（参考訳）: 多くの推論タスクにおいて、大きな言語モデル(LLM)はグラフやテーブルのような構造化された外部知識に依存し、通常はシーケンシャルなトークン表現に線形化される。しかし、十分な知識が得られても、LLMは依然として幻覚出力を生成でき、そのような失敗の背後にあるメカニズムはよく分かっていない。これらのメカニズムを解明し、ランダムノイズではなく系統的な内部力学から幻覚が生じることを確認する。第一に、注意は、完全なコンテキストに分散するのではなく、ショートカットのような構造的手がかりに焦点を合わせている。第二に、フィードフォワード表現は提供された知識を根拠にせず、モデルがパラメトリックメモリに戻る。さらに,本研究の結果から,ハロゲン化はフィードフォワード層内のセマンティックグラウンドリングの失敗と一貫して関連していることが明らかとなった。最後に、これらの力学パターンは、シングルホップグラフを超えて、マルチホップおよび表のセッティングに一般化され、構造化された知識形式間の効果的な幻覚検出を可能にすることを示す。

論文の概要: Why LLMs Hallucinate on Structured Knowledge: A Mechanistic Analysis of Reasoning over Linearized Representations

関連論文リスト