Fugu-MT 論文翻訳(概要): A Mechanistic Account of Attention Sinks in GPT-2: One Circuit, Broader Implications for Mitigation

論文の概要: A Mechanistic Account of Attention Sinks in GPT-2: One Circuit, Broader Implications for Mitigation

arxiv url: http://arxiv.org/abs/2604.14722v1
Date: Thu, 16 Apr 2026 07:32:37 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-17 21:29:31.782149
Title: A Mechanistic Account of Attention Sinks in GPT-2: One Circuit, Broader Implications for Mitigation
Title（参考訳）: GPT-2における注意シンクの力学的考察:1つの回路, 緩和に対するより広い意味
Authors: Yuval Ran-Milo, Hila Ofek, Shahar Mendel,
Abstract要約: トランスフォーマーは一般的に注意シンクを呈し、第1位に不当に高い注意を払っている。学習されたクエリバイアスと絶対的な位置埋め込みを持つGPT-2スタイルのモデルにおいて,この挙動について検討する。
参考スコア（独自算出の注目度）: 1.2744523252873352
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Transformers commonly exhibit an attention sink: disproportionately high attention to the first position. We study this behavior in GPT-2-style models with learned query biases and absolute positional embeddings. Combining structural analysis with causal interventions, validated across natural-language, mathematical, and code inputs, we find that the sink arises from the interaction among (i) a learned query bias, (ii) the first-layer MLP transformation of the positional encoding, and (iii) structure in the key projection. Crucially, each component we identify is individually dispensable: architectures omitting each of them robustly exhibit sinks. This indicates that attention sinks may arise through distinct circuits across architectures. These findings inform mitigation of sinks, and motivate broader investigation into why sinks emerge.
Abstract（参考訳）: トランスフォーマーは一般的に注意シンクを示しており、第1位に不当に高い注意を払っている。学習されたクエリバイアスと絶対的な位置埋め込みを持つGPT-2スタイルのモデルにおいて,この挙動について検討する。構造解析と因果介入を組み合わせ、自然言語、数学的、コード入力で検証した結果、シンクは相互の相互作用から生じることがわかった。 i) 学習されたクエリバイアス。 (ii)位置符号化の第1層MLP変換及び (iii)キープロジェクションの構造重要なのは、各コンポーネントは個別に発行可能で、それぞれのコンポーネントを省略するアーキテクチャは、しっかりとシンクを表示することです。これは、アーキテクチャを横断する異なる回路を通して注意シンクが発生する可能性があることを示している。これらの知見は、沈み込みの緩和を示唆し、なぜ沈み込みが発生するのかを広範囲に調査する動機となっている。

論文の概要: A Mechanistic Account of Attention Sinks in GPT-2: One Circuit, Broader Implications for Mitigation

関連論文リスト