Fugu-MT 論文翻訳(概要): When Is Thinking Enough? Early Exit via Sufficiency Assessment for Efficient Reasoning

論文の概要: When Is Thinking Enough? Early Exit via Sufficiency Assessment for Efficient Reasoning

arxiv url: http://arxiv.org/abs/2604.06787v1
Date: Wed, 08 Apr 2026 07:56:28 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-09 17:30:51.411426
Title: When Is Thinking Enough? Early Exit via Sufficiency Assessment for Efficient Reasoning
Title（参考訳）: 思考が十分であるのはいつ頃か : 効率的な推論のための十分性評価による早期退院
Authors: Yang Xiang, Yixin Ji, Ruotao Xu, Dan Qiao, Zheming Yang, Juntao Li, Min Zhang,
Abstract要約: 本稿では、効率的な推論のための新しいフレームワークDTSR(Dynamic Thought Sufficiency in Reasoning)を紹介する。人間のメタ認知にインスパイアされたDTSRは、リフレクションシグナルモニタリングとThought Sufficiency Checkという2つの段階で動作する。 DTSRは推論長を28.9%から34.9%削減し、性能損失を最小限に抑えている。
参考スコア（独自算出の注目度）: 52.21239821135325
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Large reasoning models (LRMs) have achieved remarkable performance in complex reasoning tasks, driven by their powerful inference-time scaling capability. However, LRMs often suffer from overthinking, which results in substantial computational redundancy and significantly reduces efficiency. Early-exit methods aim to mitigate this issue by terminating reasoning once sufficient evidence has been generated, yet existing approaches mostly rely on handcrafted or empirical indicators that are unreliable and impractical. In this work, we introduce Dynamic Thought Sufficiency in Reasoning (DTSR), a novel framework for efficient reasoning that enables the model to dynamically assess the sufficiency of its chain-of-thought (CoT) and determine the optimal point for early exit. Inspired by human metacognition, DTSR operates in two stages: (1) Reflection Signal Monitoring, which identifies reflection signals as potential cues for early exit, and (2) Thought Sufficiency Check, which evaluates whether the current CoT is sufficient to derive the final answer. Experimental results on the Qwen3 models show that DTSR reduces reasoning length by 28.9%-34.9% with minimal performance loss, effectively mitigating overthinking. We further discuss overconfidence in LRMs and self-evaluation paradigms, providing valuable insights for early-exit reasoning.
Abstract（参考訳）: 大規模推論モデル(LRM)は、その強力な推論時間スケーリング能力によって駆動される複雑な推論タスクにおいて、顕著なパフォーマンスを実現している。しかし、LRMは過度な思考に悩まされ、計算の冗長性が著しく低下し、効率が著しく低下する。早期退行法は、十分な証拠が生成されると推論を終了させることでこの問題を軽減することを目的としているが、既存のアプローチは主に信頼性が低く実用的でない手工芸的または経験的な指標に依存している。本研究では、効率的な推論のための新しいフレームワークである動的思考補充(DTSR)を紹介し、このモデルにより、そのチェーン・オブ・シント(CoT)の充足度を動的に評価し、早期退避の最適点を決定することができる。ヒトのメタ認知に触発されたDTSRは,(1)リフレクション信号モニタリング,(2)リフレクション信号を早期退避のための潜在的手がかりとして認識する,2)現在のCoTが最終回答を導き出すのに十分かどうかを判断する,2つの段階で機能する。 Qwen3モデルに対する実験結果から、DTSRは推論長を28.9%から34.9%削減し、最小性能の損失を減らし、事実上過度な考えを和らげることを示した。さらに、LEMと自己評価パラダイムの過信を議論し、早期退行推論に有用な洞察を提供する。

論文の概要: When Is Thinking Enough? Early Exit via Sufficiency Assessment for Efficient Reasoning

関連論文リスト