Fugu-MT 論文翻訳(概要): Stream separation improves Bregman conditioning in transformers

論文の概要: Stream separation improves Bregman conditioning in transformers

arxiv url: http://arxiv.org/abs/2603.21317v1
Date: Sun, 22 Mar 2026 16:55:57 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-24 19:11:39.351597
Title: Stream separation improves Bregman conditioning in transformers
Title（参考訳）: ストリーム分離による変圧器のブレグマン条件の改善
Authors: James Clayton Kerce,
Abstract要約: 変換器表現を操る線形手法は、表現空間の幾何学がユークリッド的であることを暗黙的に仮定する。 Park et al. は、ソフトマックスは、計量テンソルが対数正規化子のヘシアンである曲線化されたブレグマン幾何学を誘導することを示した。制御された2x2設計ストリーム分離における中間層におけるこのヘシアンを層間監視により測定する。
参考スコア（独自算出の注目度）: 4.7718339202518685
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Linear methods for steering transformer representations, including probing, activation engineering, and concept erasure, implicitly assume the geometry of representation space is Euclidean. Park et al. [Park et al., 2026] showed that softmax induces a curved Bregman geometry whose metric tensor is the Hessian of the log-normalizer, $H(λ) = Cov[γ | λ]$. Ignoring this curvature causes Euclidean steering to leak probability mass to unintended tokens. Their analysis applies at the output layer. We measure this Hessian at intermediate layers in a controlled 2x2 design crossing stream separation with per-layer supervision (vocabulary decoding loss at each layer), all at matched vocabulary and parameter count. In standard single-stream transformers, H is severely degenerate at intermediate layers (effective rank 8 in 516 dimensions). Stream separation improves conditioning by up to 22 in effective rank, even without auxiliary supervision. Per-layer supervision helps, but less. The cosine similarity between primal and dual concept directions predicts per-layer steering effectiveness on downstream tasks, with a threshold near 0.3. These results bear on the reliability of linear safety interventions, which depend on the geometry being well-conditioned at the layer where they are applied.
Abstract（参考訳）: 探索、アクティベーションエンジニアリング、概念消去を含む変換器表現を操る線形手法は、表現空間の幾何学をユークリッドと暗黙的に仮定する。 Park et al [Park et al , 2026] は、ソフトマックスが、計量テンソルが対数正規化子のヘシアンである曲線化されたブレグマン幾何学を誘導することを示した。この曲率を無視すると、ユークリッドの操舵は意図しないトークンに確率質量を漏らす。それらの分析は出力層に適用される。制御された2x2設計ストリーム分離において,このヘシアンを中間層で測定し,各層における語彙復号損失)、すべて一致する語彙とパラメータ数で測定する。標準単流変圧器では、Hは中間層(有効ランク8は516次元)で著しく縮退する。ストリーム分離は、補助的な監督なしでも、効果的なランクで22までの条件付けを改善する。レイヤ単位の監視は役に立つが、少ない。原始方向と双対方向のコサイン類似性は、下流タスクにおける層間ステアリング効果を0.3付近で予測する。これらの結果は、線形安全介入の信頼性にかかっている。

論文の概要: Stream separation improves Bregman conditioning in transformers

関連論文リスト