Fugu-MT 論文翻訳(概要): IA2: Alignment with ICL Activations Improves Supervised Fine-Tuning

論文の概要: IA2: Alignment with ICL Activations Improves Supervised Fine-Tuning

arxiv url: http://arxiv.org/abs/2509.22621v1
Date: Fri, 26 Sep 2025 17:46:32 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-29 20:57:54.622117
Title: IA2: Alignment with ICL Activations Improves Supervised Fine-Tuning
Title（参考訳）: IA2: ICLアクティベーションとのアライメント改善
Authors: Aayush Mishra, Daniel Khashabi, Anqi Liu,
Abstract要約: In-Context Learning (ICL) は、インプロンプト内のインストラクションやデモによってモデルに適応する。 ICLとSFTは異なるアクティベーションパターンを生成し,異なる機能機構によって2つの手法が適応可能であることを示す。 ICL Activation Alignment (IA2) は、ICCの活性化パターンをSFTモデルで再現し、ICCのような内部推論をインセンティブ化する自己蒸留技術である。
参考スコア（独自算出の注目度）: 42.543865253955666
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Supervised Fine-Tuning (SFT) is used to specialize model behavior by training weights to produce intended target responses for queries. In contrast, In-Context Learning (ICL) adapts models during inference with instructions or demonstrations in the prompt. ICL can offer better generalizability and more calibrated responses compared to SFT in data scarce settings, at the cost of more inference compute. In this work, we ask the question: Can ICL's internal computations be used to improve the qualities of SFT? We first show that ICL and SFT produce distinct activation patterns, indicating that the two methods achieve adaptation through different functional mechanisms. Motivated by this observation and to use ICL's rich functionality, we introduce ICL Activation Alignment (IA2), a self-distillation technique which aims to replicate ICL's activation patterns in SFT models and incentivizes ICL-like internal reasoning. Performing IA2 as a priming step before SFT significantly improves the accuracy and calibration of model outputs, as shown by our extensive empirical results on 12 popular benchmarks and 2 model families. This finding is not only practically useful, but also offers a conceptual window into the inner mechanics of model adaptation.
Abstract（参考訳）: Supervised Fine-Tuning (SFT) は、クエリに対して対象とする応答を生成するために重みをトレーニングすることによって、モデル行動の専門化に使用される。対照的に、In-Context Learning (ICL) はプロンプトのインストラクションやデモによる推論の間にモデルを適応させる。 ICLは、より推論計算のコストで、データ不足設定におけるSFTと比較して、より一般化性とよりキャリブレーションされた応答を提供することができる。 In this work, we asked the question: ICL's internal calculations can be used to improve the quality of SFT? まず、ICLとSFTが異なるアクティベーションパターンを生成することを示し、その2つの手法が異なる機能機構によって適応できることを示す。 ICL Activation Alignment (IA2)は、ICLの活性化パターンをSFTモデルで再現し、ICLのような内部推論をインセンティブ化する自己蒸留技術である。 SFT以前のプライミングステップとしてIA2を実行することで、12のベンチマークと2つのモデルファミリーでの広範な実験結果から、モデル出力の精度とキャリブレーションが大幅に向上する。この発見は実用的に有用であるだけでなく、モデル適応の内的力学に関する概念的な窓を提供する。

論文の概要: IA2: Alignment with ICL Activations Improves Supervised Fine-Tuning

関連論文リスト