Fugu-MT 論文翻訳(概要): How Few-Shot Examples Add Up: A Causal Decomposition of Function Vectors in In-Context Learning

論文の概要: How Few-Shot Examples Add Up: A Causal Decomposition of Function Vectors in In-Context Learning

arxiv url: http://arxiv.org/abs/2605.16591v1
Date: Fri, 15 May 2026 19:49:56 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-19 17:57:46.717958
Title: How Few-Shot Examples Add Up: A Causal Decomposition of Function Vectors in In-Context Learning
Title（参考訳）: In-Context学習における関数ベクトルの因果分解
Authors: Entang Wang, Yiwei Wang, Aleksandra Bakalova, Michael Hahn,
Abstract要約: In-context Learning (ICL)は、最小限の例から新しいタスクを抽出する。モデル関数ベクトル (FV) のショットプロンプトがいかに少ないかを示す。実験がFVを支配している適応的再重み付けの先行例に基づいて,各例の表現を文脈的に表現することを示す。
参考スコア（独自算出の注目度）: 48.80008450455555
License: http://creativecommons.org/licenses/by/4.0/
Abstract: In-context learning (ICL) excels at new tasks from minimal examples, yet we still lack a mechanistic explanation of how few-shot prompts shape a model's function vector (FV)--a causal activation direction that drives task behavior on the ICL query. Across tasks and models, an $n$-shot FV is well-approximated by a linear combination of example-level sub-FVs, suggesting additive and composable contributions from individual demonstrations. Beyond additivity, we show that models contextualize individual examples' representations based on prior examples to adaptively reweight which demonstrations dominate the FV: attention shifts toward examples that are more informative and less ambiguous under the context. Finally, a causal decomposition separates Query-Key routing from Value updates, finding that contextualization's most consistent contributions to FV quality arise from Query-Key alignment--particularly in ambiguous settings--while Value-mediated effects are more heterogeneous. Together, these results unify additive superposition with context-dependent attention reweighting into a mechanistic, testable account of how few-shot prompts implement tasks.
Abstract（参考訳）: In-context Learning (ICL) は最小限の例から新しいタスクを抽出するが、モデル関数ベクトル(FV)をどう形作るかという機械的な説明はいまだにない。タスクやモデル全体で、$n$-shot FVは例レベルのサブFVの線形結合によってよく近似され、個々のデモから追加的かつ構成可能なコントリビューションが提案される。付加性以外のモデルでは、先行例に基づいて個々の例の表現を文脈化して適応的に重み付けし、実演がFVを支配していることを示す。最後に、因果分解は、クエリキールーティングとバリュー更新を分離し、コンテキスト化のFV品質に対する最も一貫性のあるコントリビューションは、クエリキーアライメント(特にあいまいな設定で)から生じる。これらの結果は、文脈依存の注意を重み付けした付加的重ね合わせを、少数ショットプロンプトがいかにタスクを実装するかという、機械的かつ検証可能な説明に統一する。

論文の概要: How Few-Shot Examples Add Up: A Causal Decomposition of Function Vectors in In-Context Learning

関連論文リスト