Fugu-MT 論文翻訳(概要): Can Aha Moments Be Fake? Identifying True and Decorative Thinking Steps in Chain-of-Thought

論文の概要: Can Aha Moments Be Fake? Identifying True and Decorative Thinking Steps in Chain-of-Thought

arxiv url: http://arxiv.org/abs/2510.24941v1
Date: Tue, 28 Oct 2025 20:14:02 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-30 15:50:44.780802
Title: Can Aha Moments Be Fake? Identifying True and Decorative Thinking Steps in Chain-of-Thought
Title（参考訳）: Aha Momentsは偽物になれるか? 真理と決定的思考のステップを解明する
Authors: Jiachen Zhao, Yiyou Sun, Weiyan Shi, Dawn Song,
Abstract要約: 大きな言語モデル(LLM)は、テスト時に長いチェーン・オブ・ソート(CoT)を生成することができ、複雑なタスクを解決できる。提案したTrue Thinking Score (TTS) を用いて、各推論ステップの段階的因果関係がモデルの最終予測に与える影響を測定する。我々は、LLMの潜在空間におけるTrueThinking方向を同定し、モデルに特定のCoTステップの実行や無視を強制することができる。
参考スコア（独自算出の注目度）: 72.45900226435289
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Recent large language models (LLMs) can generate long Chain-of-Thought (CoT) at test time, enabling them to solve complex tasks. These reasoning steps in CoT are often assumed as a faithful reflection of the model's internal thinking process, and used to monitor unsafe intentions. However, we find many reasoning steps don't truly contribute to LLMs' prediction. We measure the step-wise causal influence of each reasoning step on the model's final prediction with a proposed True Thinking Score (TTS). We reveal that LLMs often interleave between true-thinking steps (which are genuinely used to produce the final output) and decorative-thinking steps (which only give the appearance of reasoning but have minimal causal impact). Notably, only a small subset of the total reasoning steps have a high TTS that causally drive the model's prediction: e.g., for the AIME dataset, only an average of 2.3% of reasoning steps in CoT have a TTS >= 0.7 (range: 0-1) under the Qwen-2.5 model. Furthermore, we identify a TrueThinking direction in the latent space of LLMs. By steering along or against this direction, we can force the model to perform or disregard certain CoT steps when computing the final result. Finally, we highlight that self-verification steps in CoT (i.e., aha moments) can also be decorative, where LLMs do not truly verify their solution. Steering along the TrueThinking direction can force internal reasoning over these steps, resulting in a change in the final results. Overall, our work reveals that LLMs often verbalize reasoning steps without actually performing them internally, which undermines both the efficiency of LLM reasoning and the trustworthiness of CoT.
Abstract（参考訳）: 最近の大規模言語モデル(LLM)は、テスト時に長時間のChain-of-Thought(CoT)を生成することができ、複雑なタスクを解決できる。これらのCoTの推論ステップは、しばしばモデルの内部思考プロセスの忠実な反映と見なされ、安全でない意図を監視するために使用される。しかし、多くの理由付けステップがLLMの予測に真に寄与していないことが分かっています。提案したTrue Thinking Score (TTS) を用いて, 各推論ステップの段階的因果関係がモデルの最終予測に与える影響を測定する。 LLMは、しばしば真の思考ステップ(最終的なアウトプットを生成するために実際に使用される)と装飾的な思考ステップ(推論の外観しか与えないが、因果的影響を最小限に抑える)の間に介在する。例えば、AIMEデータセットでは、CoTにおける推論ステップの平均2.3%は、Qwen-2.5モデルの下でTS>=0.7(範囲:0-1)である。さらに,LLMの潜在空間におけるTrueThinking方向を同定する。この方向に沿ってあるいは反対に進むことで、最終的な結果を計算する際に、モデルに特定のCoTステップの実行や無視を強制することができる。最後に, CoT の自己検証ステップ (すなわち, モーメント) も装飾的であり, LLM がその解を真に検証しない点を強調した。 TrueThinking方向のステアリングは、これらのステップに対して内部の推論を強制し、最終的な結果を変更する。全体として、我々の研究は、LLMが実際に内部で行うことなく推論のステップを口頭で表すことがしばしばであり、LLM推論の効率とCoTの信頼性の両方を損なうことを明らかにしている。

関連論文リスト

Stop Spinning Wheels: Mitigating LLM Overthinking via Mining Patterns for Early Reasoning Exit [114.83867400179354]
オーバーライドは、大きな言語モデル全体のパフォーマンスを低下させる可能性がある。推論は, 探索段階の不足, 補償推論段階, 推論収束段階の3段階に分類される。我々は,ルールに基づく軽量なしきい値設定戦略を開発し,推論精度を向上させる。
論文参考訳（メタデータ） (2025-08-25T03:17:17Z)
Beyond Prompt-Induced Lies: Investigating LLM Deception on Benign Prompts [79.1081247754018]
大規模言語モデル(LLM)は、推論、計画、意思決定のタスクに広くデプロイされている。そこで我々は, 接触探索質問(CSQ)に基づく枠組みを提案し, 騙しの可能性を定量化する。
論文参考訳（メタデータ） (2025-08-08T14:46:35Z)
Overclocking LLM Reasoning: Monitoring and Controlling Thinking Path Lengths in LLMs [52.663816303997194]
回答の質に影響を与える重要な要因は思考段階の長さである。本稿では, LLM が推論の長さを理解し, 制御するメカニズムを探求し, 活用する。以上の結果から,この「オーバークロック」手法は過度な思考を軽減し,解答精度を向上し,推論遅延を低減することが示唆された。
論文参考訳（メタデータ） (2025-06-08T17:54:33Z)
The Price of a Second Thought: On the Evaluation of Reasoning Efficiency in Large Language Models [54.88805865447848]
モデルが全体の効率を向上し,問題の難しさが効率に影響を及ぼすことを示す。インストラクションモデルが簡単なアウトラインをドラフトし,思考モデルがそれを拡張する,シンプルな2段階パイプラインであるCOTHINKを提案する。 GSM8K、MATH500、AIME24では、COTHINKはトークンの使用量を21.1%削減し、4つの思考モデルの精度を維持し、強力な効率のベースラインと競争し続ける。
論文参考訳（メタデータ） (2025-05-28T06:24:45Z)
Have Large Language Models Learned to Reason? A Characterization via 3-SAT Phase Transition [11.422434149376478]
大規模言語モデル(LLM)は高度な推論能力を持つAIモデルとして評価されている。理論上は、Chain-of-Thought (CoT) を用いた自己回帰 LLM は複雑な推論タスクを解くためによりシリアルな計算を行うことができる。近年の研究では、LSMは、この能力にもかかわらず、理性を学ぶのではなく、統計的特徴に適合することが示唆されている。
論文参考訳（メタデータ） (2025-04-04T20:57:36Z)
SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs [48.28847964704554]
CoT(Chain-of-Thought)推論により、LLM(Large Language Models)は複雑な推論タスクを解くことができる。 LLMの変更を必要としない連続空間推論のための新しい手法を提案する。
論文参考訳（メタデータ） (2025-02-17T18:52:29Z)
When More is Less: Understanding Chain-of-Thought Length in LLMs [51.631483479081645]
大規模言語モデル(LLM)は複雑な問題を分解するためにChain-of-Thought(CoT)推論を用いる。本稿は、長いCoTがより優れていると仮定されることがしばしばあり、長いCoTが常に優れているとは限らない、と論じる。
論文参考訳（メタデータ） (2025-02-11T05:28:59Z)
Premise-Augmented Reasoning Chains Improve Error Identification in Math reasoning with LLMs [10.179253284788796]
大型言語モデル(LLM)の数学的推論を促進させるチェーン・オブ・ソート(CoT) 本稿では,各ステップの前提を識別し,推論の評価を改善するためのフレームワークを提案する。本研究は,複雑な問題解決課題に対処する前提中心表現の有用性を強調した。
論文参考訳（メタデータ） (2025-02-04T14:44:58Z)
Assessing Step-by-Step Reasoning against Lexical Negation: A Case Study on Syllogism [19.590120229602103]
大規模言語モデル(LLM)は、ステップバイステップの推論命令、例えばチェーン・オブ・シント(CoT)プロンプトを利用する。本研究では, 否定に着目したLCMのステップバイステップ推論能力について検討する。
論文参考訳（メタデータ） (2023-10-23T12:40:41Z)

関連論文リストは本サイト内にある論文のタイトル・アブストラクトから自動的に作成しています。

指定された論文の情報です。
本サイトの運営者は本サイト（すべての情報・翻訳含む）の品質を保証せず、本サイト（すべての情報・翻訳含む）を使用して発生したあらゆる結果について一切の責任を負いません。