Fugu-MT 論文翻訳(概要): Many-Shot CoT-ICL: Making In-Context Learning Truly Learn

論文の概要: Many-Shot CoT-ICL: Making In-Context Learning Truly Learn

arxiv url: http://arxiv.org/abs/2605.13511v1
Date: Wed, 13 May 2026 13:30:12 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-14 23:30:28.07292
Title: Many-Shot CoT-ICL: Making In-Context Learning Truly Learn
Title（参考訳）: Many-Shot CoT-ICL: インテクスト学習を真に学習する
Authors: Tsz Ting Chung, Lemao Liu, Mo Yu, Dit-Yan Yeung,
Abstract要約: In-context Learning (ICL)は、パラメータを更新せずにプロンプト内のデモを条件にすることで、大きな言語モデルを新しいタスクに適応させる。提案手法は,標準のマルチショット・ルールが転送されないことを示すために,マルチショット・チェーン・オブ・コンテクスト・ラーニング(CoT-ICL)について検討する。
参考スコア（独自算出の注目度）: 58.439517684779936
License: http://creativecommons.org/licenses/by/4.0/
Abstract: In-context learning (ICL) adapts large language models (LLMs) to new tasks by conditioning on demonstrations in the prompt without parameter updates. With long-context models, many-shot ICL can use dozens to hundreds of examples and achieve performance comparable to fine-tuning, yet current understanding of its scaling behavior is largely derived from non-reasoning tasks. We study many-shot chain-of-thought in-context learning (CoT-ICL) for reasoning and show that standard many-shot rules do not transfer. Across non-reasoning and reasoning-oriented LLMs and across non-reasoning and reasoning tasks, we find: (i) a setting-dependent scaling effect, where increasing the number of CoT demonstrations is unstable for non-reasoning LLMs and benefits mainly reasoning-oriented LLMs; (ii) similarity-based retrieval helps on non-reasoning tasks but fails on reasoning, since semantic similarity poorly predicts procedural (i.e., CoT) compatibility; and (iii) an order-scaling effect, where performance variance grows with more CoT demonstrations. We interpret these behaviors by viewing many-shot CoT-ICL as in-context test-time learning rather than scaled pattern matching, and suggests two principles: (i) demonstrations should be easy for the target model to understand, and (ii) they should be ordered to support a smooth conceptual progression. Guided by the principle, we propose Curvilinear Demonstration Selection (CDS), a simple ordering method that yields up to a 5.42 percentage-point gain on geometry with 64 demonstrations. Overall, our results reframe the long context window from a retrieval buffer into a structured curriculum for in-context test-time learning.
Abstract（参考訳）: インコンテキスト学習(ICL)は、パラメータ更新なしでプロンプト内のデモを条件にすることで、大きな言語モデル(LLM)を新しいタスクに適応させる。長いコンテキストモデルでは、多数のショットICLは数十から数百のサンプルを使用し、微調整に匹敵するパフォーマンスを達成することができるが、現在のスケーリング動作の理解は、主に非合理的なタスクから派生している。提案手法は,標準のマルチショット・ルールが転送されないことを示すために,マルチショット・チェーン・オブ・コンテクスト・ラーニング(CoT-ICL)について検討する。非理性、理性指向のLLM、非理性、理性指向のタスクにまたがって、以下のことが分かる。 (i)非推論型LCMにはCoTデモの増加が不安定であり、主に推論型LCMのメリットが期待できる設定依存スケーリング効果。 (ii)類似性に基づく検索は、非推論タスクに役立つが、意味的類似性は手続き的(CoT)互換性を予測できないため、推論に失敗する。 (iii)CoTのデモを多く行うと、パフォーマンスのばらつきが増大する秩序スケーリング効果。パターンマッチングをスケールするのではなく,コンテキスト内テストタイム学習として多発的なCoT-ICLを解釈することで,これらの振る舞いを解釈し,2つの原則を提案する。 (i)デモは、ターゲットモデルが理解しやすく、かつ、 (二)円滑な概念的進歩を支援するように命じるべきである。この原理で導かれたCurvilinear Demonstration Selection (CDS) は、64個の実演で最大5.42ポイントのゲインを得られる単純な順序付け法である。その結果,検索バッファの長いコンテキストウィンドウを,コンテキスト内テスト時間学習のための構造化カリキュラムに再構成した。

論文の概要: Many-Shot CoT-ICL: Making In-Context Learning Truly Learn

関連論文リスト