Fugu-MT 論文翻訳(概要): When Language Representations Interact: Separability and Cross-Lingual Effects in LLMs

論文の概要: When Language Representations Interact: Separability and Cross-Lingual Effects in LLMs

arxiv url: http://arxiv.org/abs/2606.14347v1
Date: Fri, 12 Jun 2026 11:00:59 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-15 16:00:42.869786
Title: When Language Representations Interact: Separability and Cross-Lingual Effects in LLMs
Title（参考訳）: 言語表現の相互作用: LLMにおける分離性と言語間影響
Authors: Boris Marinov, Angira Sharma, Christian Schroeder de Witt, Philip Torr, Anisoara Calinescu, Jialin Yu,
Abstract要約: 大規模言語モデルは強い多言語能力を示すが、その内部表現は解釈が難しい。近年の研究では、因果幾何学構造が、どのようにある概念がほぼ線形かつ分離可能な方向としてエンコードされているかを説明することができることが示されている。因果幾何学的解析を多言語LLMに適用し、3つのモデル間の28の両言語コントラストについて検討した。
参考スコア（独自算出の注目度）: 20.39151103511549
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Large language models exhibit strong multilingual capabilities, however, their internal representations are difficult to interpret. Understanding these interactions is important for ensuring reliable behavior in multilingual systems. Recent work has shown that causal-geometric structure can explain how certain concepts are encoded as approximately linear and separable directions, but whether this framework extends to multilingual models, where language identity is correlated and hierarchical, is underexplored. We apply causal-geometric analysis to multilingual LLMs, studying 28 bilingual contrasts across three models, allowing us to analyze when languages behave as approximately independent factors and when structured dependencies persist. We find evidence that language concepts admit stable linear representations that are largely separable under a covariance-adjusted (causal) inner product, with structured deviations reflecting linguistic similarity. Moreover, languages within the same family (such as Germanic or Romance) exhibit a simplex-like geometric structure, suggesting hierarchical organization. These results extend causal-geometric interpretability to multilingual settings and provide insight into how separability and similarity may exist in multilingual LLM representations, motivating interpretability analyses that diagnose when and how structured dependencies between concepts can be anticipated. This has implications for trustworthy deployment, as residual structure between languages may lead to unintended cross-lingual effects when models are monitored or intervened upon.
Abstract（参考訳）: 大規模言語モデルは強い多言語能力を示すが、内部表現の解釈は困難である。これらの相互作用を理解することは、多言語システムにおける信頼性の高い振る舞いを保証するために重要である。最近の研究で、因果幾何学構造は、ある概念がほぼ線形で分離可能な方向としてエンコードされているかを説明することができることが示されているが、この枠組みが言語同一性と階層性が相関する多言語モデルにまで拡張されているかどうかは未定である。因果幾何学的解析を多言語LLMに適用し、3つのモデルにまたがる28の両言語コントラストを調査し、言語がほぼ独立した要因として振る舞うとき、構造的依存関係が持続するときの分析を可能にする。言語概念は、共分散調整された(因果的)内部積の下で大きく分離可能な安定な線形表現を認め、言語的類似性を反映した構造的偏差を持つことを示す。さらに、同族の言語(ゲルマン語やロマンス語など)は、単純な幾何学的構造を示し、階層的な構造を示唆している。これらの結果は、多言語的設定に対する因果的幾何学的解釈可能性を拡張し、多言語LLM表現における分離性と類似性がどのように存在するかについての洞察を与え、概念間の構造的依存関係をいつ、どのように予測できるかを診断する解釈可能性分析を動機付けている。これは、言語間の残留構造が、モデルを監視したり介入したりするときに意図しない言語間影響をもたらす可能性があるため、信頼できるデプロイメントに影響を及ぼす。

論文の概要: When Language Representations Interact: Separability and Cross-Lingual Effects in LLMs

関連論文リスト