Fugu-MT 論文翻訳(概要): Distributional Alignment as a Criterion for Designing Task Vectors in In-Context Learning

論文の概要: Distributional Alignment as a Criterion for Designing Task Vectors in In-Context Learning

arxiv url: http://arxiv.org/abs/2605.20730v1
Date: Wed, 20 May 2026 05:26:38 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-21 19:19:56.495695
Title: Distributional Alignment as a Criterion for Designing Task Vectors in In-Context Learning
Title（参考訳）: 文脈学習におけるタスクベクトル設計基準としての分布アライメント
Authors: Jihoon Kwon, Jiwon Choi, Jy-yong Sohn,
Abstract要約: 本稿では,タスクベクトルを用いた推論は,その予測分布とICLの予測分布を一致させるべきであると論じる。本稿では,タスクベクトルベースとICLベースの推論の次点確率の差を測定する指標である$d_textNTP$を紹介する。閉形式線形写像により$d_textNTP$を最小化するLTV(Linear Task Vector)を開発した。
参考スコア（独自算出の注目度）: 6.840854574584369
License: http://creativecommons.org/licenses/by/4.0/
Abstract: In-context learning (ICL) allows large language models (LLMs) to adapt to new tasks through demonstrations, yet it suffers from escalating inference costs as context length increases. While task vectors offer a promising alternative by compressing demonstrations into compact hidden-state representations, their quality has been evaluated only through downstream task accuracy. This indirect criterion provides limited insight into how to design more effective task vector extraction methods. In this paper, we posit that inference using task vectors should align their predictive distribution with that of ICL. To quantify this, we introduce $d_{\text{NTP}}$, a metric that measures the discrepancy in next-token probabilities between task vector-based and ICL-based inference. Our empirical analysis reveals that $d_{\text{NTP}}$ serves as a performance proxy, exhibiting a strong negative correlation with downstream accuracy. Motivated by this, we develop Linear Task Vector (LTV), a method designed to minimize $d_{\text{NTP}}$ via a closed-form linear mapping that estimates demonstration effects through regression. Across eight classification benchmarks and five LLMs, LTV consistently outperforms existing task vector baselines, improving average accuracy by 9.2\% while reducing inference latency. We further show that LTV outperforms the baselines on regression tasks. Moreover, we investigate the transferability of LTV across different model scales; an aspect that has remained nascent in task vector research. Specifically, we empirically show that task vectors from a larger model can enhance a smaller model's performance by 6.4\%, suggesting a new utility for extracted task representations.
Abstract（参考訳）: インコンテキスト学習(ICL)は、大規模言語モデル(LLM)がデモを通じて新しいタスクに適応することを可能にするが、コンテキスト長が増加するにつれて推論コストの増大に悩まされる。タスクベクトルはデモをコンパクトな隠れ状態表現に圧縮することで有望な代替手段を提供するが、その品質は下流のタスク精度によって評価されている。この間接的基準は、より効率的なタスクベクトル抽出方法の設計方法に関する限られた洞察を与える。本稿では,タスクベクトルを用いた推論は,その予測分布をICLと一致させるべきであると仮定する。これの定量化には、$d_{\text{NTP}}$を導入します。我々の実証分析によると、$d_{\text{NTP}}$はパフォーマンスプロキシとして機能し、下流の精度と強い負の相関を示す。そこで我々はLTV(Linear Task Vector, LTV)を開発し, 回帰による実演効果を推定する閉形式線形写像を用いて$d_{\text{NTP}}$を最小化する手法を提案する。 8つの分類ベンチマークと5つのLLMで、LTVは既存のタスクベクトルベースラインを一貫して上回り、平均精度を9.2\%改善し、推論遅延を低減している。さらに,LTVは回帰タスクのベースラインよりも優れていることを示す。さらに,様々なモデルスケールにおけるLTVの転送可能性について検討した。具体的には、より大きなモデルからのタスクベクトルがより小さなモデルの性能を6.4 %向上させることができることを実証的に示し、抽出されたタスク表現のための新しいユーティリティを提案する。

論文の概要: Distributional Alignment as a Criterion for Designing Task Vectors in In-Context Learning

関連論文リスト