Fugu-MT 論文翻訳(概要): Think Anywhere in Code Generation

論文の概要: Think Anywhere in Code Generation

arxiv url: http://arxiv.org/abs/2603.29957v1
Date: Tue, 31 Mar 2026 16:24:03 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-01 15:25:03.854417
Title: Think Anywhere in Code Generation
Title（参考訳）: コード生成のどこにでも考える
Authors: Xue Jiang, Tianyu Zhang, Ge Li, Mengyang Liu, Taozhi Chen, Zhenhua Xu, Binhua Li, Wenpin Jiao, Zhi Jin, Yongbin Li, Yihong Dong,
Abstract要約: Think-Anywhereは、LLMがコード生成中にオンデマンドで思考を実行することを可能にする、新しい推論メカニズムである。我々は,Think-Anywhereが既存の推論手法と最近のポストトレーニング手法の両方に対して最先端の性能を達成することを示す。分析の結果,Think-Anywhereは高エントロピー位置での推論を適応的に実行し,解釈可能性を高めることができることがわかった。
参考スコア（独自算出の注目度）: 89.9273351013764
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Recent advances in reasoning Large Language Models (LLMs) have primarily relied on upfront thinking, where reasoning occurs before final answer. However, this approach suffers from critical limitations in code generation, where upfront thinking is often insufficient as problems' full complexity only reveals itself during code implementation. Moreover, it cannot adaptively allocate reasoning effort throughout the code generation process where difficulty varies significantly. In this paper, we propose Think-Anywhere, a novel reasoning mechanism that enables LLMs to invoke thinking on-demand at any token position during code generation. We achieve Think-Anywhere by first teaching LLMs to imitate the reasoning patterns through cold-start training, then leveraging outcome-based RL rewards to drive the model's autonomous exploration of when and where to invoke reasoning. Extensive experiments on four mainstream code generation benchmarks (i.e., LeetCode, LiveCodeBench, HumanEval, and MBPP) show that Think-Anywhere achieves state-of-the-art performance over both existing reasoning methods and recent post-training approaches, while demonstrating consistent generalization across diverse LLMs. Our analysis further reveals that Think-Anywhere enables the model to adaptively invoke reasoning at high-entropy positions, providing enhanced interpretability.
Abstract（参考訳）: 大規模言語モデル(LLM)の推論の最近の進歩は、最終回答の前に推論が行われる事前思考に大きく依存している。しかし、このアプローチはコード生成の限界に悩まされており、前もって考えることはしばしば不十分である。さらに、難易度が著しく異なるコード生成プロセス全体で推論の労力を適応的に割り当てることができない。本稿では,LLMがコード生成中の任意のトークン位置において,オンデマンドで思考を実行することができる新しい推論機構であるThink-Anywhereを提案する。我々はまず、コールドスタートトレーニングを通じて推論パターンを模倣するようにLSMに教え、次に結果に基づくRL報酬を活用して、モデルがいつどこで推論を実行すべきかを自律的に探究することで、Think-Anywhereを実現する。 4つの主要なコード生成ベンチマーク(LeetCode、LiveCodeBench、HumanEval、MBPP)の広範な実験は、Think-Anywhereが既存の推論方法と最近のトレーニング後のアプローチの両方に対して最先端のパフォーマンスを実現し、多様なLCMをまたいだ一貫した一般化を実証していることを示している。分析の結果,Think-Anywhereは高エントロピー位置での推論を適応的に実行し,解釈可能性を高めることができることがわかった。

論文の概要: Think Anywhere in Code Generation

関連論文リスト