Fugu-MT 論文翻訳(概要): Tracing Relational Knowledge Recall in Large Language Models

論文の概要: Tracing Relational Knowledge Recall in Large Language Models

arxiv url: http://arxiv.org/abs/2604.19934v1
Date: Tue, 21 Apr 2026 19:23:27 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-23 15:36:10.759495
Title: Tracing Relational Knowledge Recall in Large Language Models
Title（参考訳）: 大規模言語モデルにおける関係知識の追跡的リコール
Authors: Nicholas Popovič, Michael Färber,
Abstract要約: 我々は,大規模言語モデルがテキスト生成中にいかに知識を思い出すかを研究する。線形プローブによる関係分類に適した潜在表現の同定に着目する。本研究は, 線形関係分類において, 残差ストリームに対する頭部当たりの注意力の寄与が比較的強い特徴であることを示す。
参考スコア（独自算出の注目度）: 4.990228412613982
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: We study how large language models recall relational knowledge during text generation, with a focus on identifying latent representations suitable for relation classification via linear probes. Prior work shows how attention heads and MLPs interact to resolve subject, predicate, and object, but it remains unclear which representations support faithful linear relation classification and why some relation types are easier to capture linearly than others. We systematically evaluate different latent representations derived from attention head and MLP contributions, showing that per-head attention contributions to the residual stream are comparatively strong features for linear relation classification. Feature attribution analyses of the trained probes, as well as characteristics of the different relation types, reveal clear correlations between probe accuracy and relation specificity, entity connectedness, and how distributed the signal on which the probe relies is across attention heads. Finally, we show how token-level feature attribution of probe predictions can be used to reveal probe behavior in further detail.
Abstract（参考訳）: 本研究では,線形プローブによる関係分類に適した潜在表現の同定に焦点をあてて,テキスト生成中の関係知識を大規模言語モデルがいかにリレーショナルリレーショナルリレーショナルリレーショナルリレーショナルリレーショナルリレーショナルリレーショナルリレーショナルリレーショナルリレーショナルリレーショナルリレーショナルリレーショナルリレーショナルリレーショナルリレーショナルリレーショナルリレーショナルリレーショナルリレーショナルリレーショナルリレーショナルリレーショナルリレーショナルリレーショナルリレーショナルリレーショナルリレーショナルリレーショナルリレーショナルリレーショナルリレーショナルリレーショナルリレーショナルリレーショナルリレーショナルリレーショナルリレーショナルリレーショナルリレーショナルリレーショナルリレーショナルリレーショナルリ先行研究は、主観、述語、対象を解決するために、注意頭とMLPがどのように相互作用するかを示しているが、どの表現が忠実な線形関係分類を支持しているのか、また、なぜいくつかの関係型が他のものよりも線形に捉えやすいのかは定かではない。注意頭とMLPの寄与から導かれる様々な潜在表現を系統的に評価し, 線形関係分類において, 残差ストリームに対する頭当たりの注意貢献が比較的強い特徴であることを示す。訓練されたプローブの特徴属性分析と異なる関係型の特徴は、プローブ精度と関係特異性、エンティティ接続性、およびプローブが依存する信号が注目ヘッドを越えてどのように分散されているかを明確にする。最後に、プローブ予測のトークンレベルの特徴属性を用いて、さらに詳細にプローブの挙動を明らかにする方法を示す。

論文の概要: Tracing Relational Knowledge Recall in Large Language Models

関連論文リスト