Fugu-MT 論文翻訳(概要): A Universal Vibe? Finding and Controlling Language-Agnostic Informal Register with SAEs

論文の概要: A Universal Vibe? Finding and Controlling Language-Agnostic Informal Register with SAEs

arxiv url: http://arxiv.org/abs/2603.26236v1
Date: Fri, 27 Mar 2026 09:58:31 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-30 21:49:48.433843
Title: A Universal Vibe? Finding and Controlling Language-Agnostic Informal Register with SAEs
Title（参考訳）: ユニバーサルバイブ : SAEを用いた言語非依存型インフォーマルレジスタの検索と制御
Authors: Uri Z. Kialy, Avi Shtarkberg, Ayal Klein,
Abstract要約: 多言語言語モデルは、スラングのような文化固有の実用的なレジスタを、独立した言語固有の記憶として、あるいは統一された抽象概念として処理するかを検討する。目的語はすべて多義語であり、リテラルと非公式の両方の文脈に現れる新しいデータセットを提案する。非公式登録信号の多くは言語固有の特徴に分散しているが、小さなが非常に堅牢な言語間コアは一貫して出現する。
参考スコア（独自算出の注目度）: 0.858070544154173
License: http://creativecommons.org/licenses/by/4.0/
Abstract: While multilingual language models successfully transfer factual and syntactic knowledge across languages, it remains unclear whether they process culture-specific pragmatic registers, such as slang, as isolated language-specific memorizations or as unified, abstract concepts. We study this by probing the internal representations of Gemma-2-9B-IT using Sparse Autoencoders (SAEs) across three typologically diverse source languages: English, Hebrew, and Russian. To definitively isolate pragmatic register processing from trivial lexical sensitivity, we introduce a novel dataset in which every target term is polysemous, appearing in both literal and informal contexts. We find that while much of the informal-register signal is distributed across language-specific features, a small but highly robust cross-linguistic core consistently emerges. This shared core forms a geometrically coherent ``informal register subspace'' that sharpens in the model's deeper layers. Crucially, these shared representations are not merely correlational: activation steering with these features causally shifts output formality across all source languages and transfers zero-shot to six unseen languages spanning diverse language families and scripts. Together, these results provide the first mechanistic evidence that multilingual LLMs internalize informal register not just as surface-level heuristics, but as a portable, language-agnostic pragmatic abstraction.
Abstract（参考訳）: 多言語言語モデルは、事実的および構文的知識を言語間で伝達することに成功しているが、スラングのような文化固有の実用的レジスタを独立した言語固有の記憶として処理するか、あるいは統一された抽象概念として処理するかは定かではない。 Sparse Autoencoders (SAEs) を用いたGemma-2-9B-ITの内部表現を, 英語, ヘブライ語, ロシア語の3言語で検討した。現実的なレジスタ処理を自明な語彙感から断定的に分離するために,各目的語が多義語であり,リテラルと非公式の両方の文脈に現れる新しいデータセットを提案する。非公式登録信号の多くは言語固有の特徴に分散しているが、小さなが非常に堅牢な言語間コアは一貫して出現する。この共有コアは、幾何学的にコヒーレントな ` `informal register subspace''' を形成し、モデルのより深い層を鋭くする。アクティベーションのステアリングは、すべてのソース言語で出力の形式を因果的にシフトさせ、さまざまな言語ファミリーとスクリプトにまたがる6つの目に見えない言語にゼロショットを転送する。これらの結果は、多言語LLMが、表面レベルのヒューリスティックとしてだけでなく、ポータブルで言語に依存しない実用的な抽象概念として、非公式なレジスタを内部化する最初の機械的証拠となる。

論文の概要: A Universal Vibe? Finding and Controlling Language-Agnostic Informal Register with SAEs

関連論文リスト