Fugu-MT 論文翻訳(概要): Task Vector Geometry Underlies Dual Modes of Task Inference in Transformers

論文の概要: Task Vector Geometry Underlies Dual Modes of Task Inference in Transformers

arxiv url: http://arxiv.org/abs/2605.03780v1
Date: Tue, 05 May 2026 14:07:55 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-06 19:35:43.961791
Title: Task Vector Geometry Underlies Dual Modes of Task Inference in Transformers
Title（参考訳）: タスクベクトル幾何は変圧器のタスク推論の2モードを下方へ
Authors: Hao Yan, Haolin Yang, Yiqiao Zhong,
Abstract要約: トランスフォーマーは2つの推論モードを通してコンテキストから潜在タスクを推論するのに効果的である。近年の解釈可能性研究は中間層表現からタスク固有の方向を同定している。 2つの推論モードが1つのモデル内で共存可能であることを示す。
参考スコア（独自算出の注目度）: 6.89278796238822
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Transformers are effective at inferring the latent task from context via two inference modes: recognizing a task seen during training, and adapting to a novel one. Recent interpretability studies have identified from middle-layer representations task-specific directions, or task vectors, that steer model behavior. However, a lack of rigorous foundations hinders connecting internal representations to external model behavior: existing work fails to explain how task-vector geometry is shaped by the training distribution, and what geometry enables out-of-distribution (OOD) generalization. In this paper, we study these questions in a controlled synthetic setting by training small transformers from scratch on latent-task sequence distributions, which allows a principled mathematical characterization. We show that two inference modes can coexist within a single model. In-distribution behavior is governed by Bayesian task retrieval, implemented internally through convex combinations of learned task vectors. OOD behavior, by contrast, arises through extrapolative task learning, whose representations occupy a subspace nearly orthogonal to the task-vector subspace. Taken together, our results suggest that task-vector geometry, training distributions, and generalization behaviors are closely related.
Abstract（参考訳）: トランスフォーマーは、トレーニング中に見られるタスクを認識し、新しいタスクに適応する2つの推論モードを通じて、潜在タスクをコンテキストから推論するのに効果的である。近年の解釈可能性研究は、ステアモデル行動を示す中間層表現のタスク固有方向(タスクベクトル)から特定されている。しかし、厳密な基礎の欠如は、内部表現と外部モデル行動の接続を妨げる: 既存の作業は、トレーニング分布によってタスクベクトル幾何学がどのように形成されているか、また、幾何がアウト・オブ・ディストリビューション(OOD)の一般化を可能にするのかを説明するのに失敗する。本稿では,これらの質問を,スクラッチからラテント・タスク列の分布を学習することで,制御された合成条件下で研究し,数学的特徴付けを可能にする。 2つの推論モードが1つのモデル内で共存可能であることを示す。分布内挙動は、学習されたタスクベクトルの凸結合を通して内部的に実装されたベイズタスク検索によって制御される。対照的に、OODの振る舞いは外挿的タスク学習によって生じ、その表現はタスクベクトル部分空間とほぼ直交する部分空間を占有する。その結果,タスクベクトル幾何学,トレーニング分布,一般化挙動が密接に関連していることが示唆された。

論文の概要: Task Vector Geometry Underlies Dual Modes of Task Inference in Transformers

関連論文リスト