Fugu-MT 論文翻訳(概要): Give it Space! Explicit Disentangling of Positional and Semantic Representations in Encoders

論文の概要: Give it Space! Explicit Disentangling of Positional and Semantic Representations in Encoders

arxiv url: http://arxiv.org/abs/2605.30022v1
Date: Thu, 28 May 2026 14:42:25 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-30 02:45:56.400918
Title: Give it Space! Explicit Disentangling of Positional and Semantic Representations in Encoders
Title（参考訳）: 空間を与えろ!エンコーダにおける位置表現と意味表現の明示的解離
Authors: Pierre-Antoine Lequeu, Camille Barboule, Benjamin Piwowarski,
Abstract要約: 位置符号化(PE)は、置換不変トランスフォーマーがシーケンス順序をどのように表すかを示す。 RoPEのようなPEメソッドは、長いコンテキストの理解や検索のようなタスクに苦戦している。我々は、位置的信号と意味的信号が、訓練されたトランスフォーマー内のほとんど閉じた部分空間を占有しているという証拠の上に構築する。
参考スコア（独自算出の注目度）: 8.72344410197391
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Positional encoding (PE) underpins how permutation-invariant Transformers represent sequence order, yet how positional information is processed and stored remains poorly understood. Modern PE methods such as RoPE still struggle on tasks such as long-context understanding or retrieval \cite{chen-etal-2025-hope}. Hence, a better understanding of the internal positional mechanism could help design better PE. Building on evidence that positional and semantic signals occupy nearly orthogonal subspaces in trained Transformers, we modify an encoder Transformer to process three explicitly disentangled streams: semantic, absolute positional (AP) and relative positional (RP), and confine the masked-language-modeling (MLM) objective to the semantic stream. This decoupling enables a clean mechanistic study and yields three take-aways. (1) The isolated AP subspace spontaneously collapses into a low-frequency two-dimensional manifold that captures the structure of the document; (2) Attention heads specialize into structure and semantic-oriented groups, with RP exclusively supporting the latter; (3) Standard positional encodings do not robustly retain macroscopic structure: RoPE and RP only weakly encode it, and entangled AP loses it in the final layers under MLM pressure. The disentangled approach preserves positional encoding, which improves linguistic representation on 49 of the 65 linguistic phenomena of the Flash-Holmes probing benchmark.
Abstract（参考訳）: 位置符号化(PE)は、置換不変トランスフォーマーがシーケンス順序をどのように表すかを示すが、位置情報がどのように処理され、格納されるかは理解されていない。 RoPEのような現代のPEメソッドは、長いコンテキストの理解や検索のようなタスクに苦戦している。したがって、内部位置のメカニズムをよりよく理解することで、より優れたPEの設計に役立てることができる。トレーニングされたトランスフォーマーにおいて,位置信号と意味信号がほぼ直交部分空間を占めることを示す証拠に基づいて,エンコーダ変換器を改良し,意味的,絶対的位置 (AP) および相対的位置 (RP) の3つの明示的不整合ストリームを処理し,マスク付き言語モデリング (MLM) の目的をセマンティックストリームに閉じ込める。このデカップリングにより、クリーンなメカニスティックな研究が可能になり、3つのテイクアウトが得られる。 1) 孤立AP部分空間は、文書の構造を捉えた低周波2次元多様体に自発的に崩壊する; (2) 保持ヘッドは、構造と意味指向のグループに特化し、RPは後者のみをサポートする; 3) 標準位置符号化は、マクロ構造を強固に保持しない: RoPE と RP は、それを弱エンコードするだけで、絡み合ったAP は、MLM 下の最終層でそれを失う。アンタングル化アプローチは位置符号化を保ち、Flash-Holmes Probingベンチマークの65言語現象のうち49言語表現を改善する。

論文の概要: Give it Space! Explicit Disentangling of Positional and Semantic Representations in Encoders

関連論文リスト