Fugu-MT 論文翻訳(概要): Rethinking the Chain-of-Thought: The Roles of In-Context Learning and Pre-trained Priors

論文の概要: Rethinking the Chain-of-Thought: The Roles of In-Context Learning and Pre-trained Priors

arxiv url: http://arxiv.org/abs/2509.01236v1
Date: Mon, 01 Sep 2025 08:24:28 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-04 15:17:03.596807
Title: Rethinking the Chain-of-Thought: The Roles of In-Context Learning and Pre-trained Priors
Title（参考訳）: 思考の連鎖を再考する: 文脈学習と事前学習の役割
Authors: Hao Yang, Zhiyu Yang, Yunjie Zhang, Shanyi Zhu, Lin Yang,
Abstract要約: Chain-of-Thought推論は、モデル推論能力を向上するための重要な方法論として登場した。本稿では、文脈内学習と事前学習との二重関係の観点から、思考の連鎖的推論の動作メカニズムを考察する。
参考スコア（独自算出の注目度）: 11.658759174514222
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Chain-of-Thought reasoning has emerged as a pivotal methodology for enhancing model inference capabilities. Despite growing interest in Chain-of-Thought reasoning, its underlying mechanisms remain unclear. This paper explores the working mechanisms of Chain-of-Thought reasoning from the perspective of the dual relationship between in-context learning and pretrained priors. We first conduct a fine-grained lexical-level analysis of rationales to examine the model's reasoning behavior. Then, by incrementally introducing noisy exemplars, we examine how the model balances pretrained priors against erroneous in-context information. Finally, we investigate whether prompt engineering can induce slow thinking in large language models. Our extensive experiments reveal three key findings: (1) The model not only quickly learns the reasoning structure at the lexical level but also grasps deeper logical reasoning patterns, yet it heavily relies on pretrained priors. (2) Providing sufficient exemplars shifts the model's decision-making from pretrained priors to in-context signals, while misleading prompts introduce instability. (3) Long Chain-of-Thought prompting can induce the model to generate longer reasoning chains, thereby improving its performance on downstream tasks.
Abstract（参考訳）: Chain-of-Thought推論は、モデル推論能力を向上するための重要な方法論として登場した。因果関係の推論への関心が高まっているにもかかわらず、その根底にあるメカニズムはいまだ不明である。本稿では、文脈内学習と事前学習との二重関係の観点から、思考の連鎖的推論の動作メカニズムを考察する。まず、モデルの推論の振る舞いを調べるために、論理学の詳細な語彙レベル解析を行う。そして, 雑音を漸進的に導入することにより, 事前学習と誤コンテキスト情報とのバランスについて検討する。最後に、大規模言語モデルにおいて、迅速なエンジニアリングが遅い思考を誘発するかどうかを検討する。 1) モデルは語彙レベルで推論構造を素早く学習するだけでなく, より深い論理的推論パターンも把握するが, 事前学習に大きく依存する。 2)事前訓練された事前の判断を文脈内信号にシフトさせるのに対して,誤解を招くと不安定が生じる。 (3)ロング・チェーン・オブ・ソート(Long Chain-of-Thought)のプロンプトにより、より長い推論チェーンが生成され、下流タスクのパフォーマンスが向上する。

論文の概要: Rethinking the Chain-of-Thought: The Roles of In-Context Learning and Pre-trained Priors

関連論文リスト