Fugu-MT 論文翻訳(概要): SEER: Enhancing Chain-of-Thought Code Generation through Self-Exploring Deep Reasoning

論文の概要: SEER: Enhancing Chain-of-Thought Code Generation through Self-Exploring Deep Reasoning

arxiv url: http://arxiv.org/abs/2510.17130v1
Date: Mon, 20 Oct 2025 03:51:17 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-25 00:56:39.296414
Title: SEER: Enhancing Chain-of-Thought Code Generation through Self-Exploring Deep Reasoning
Title（参考訳）: SEER: 自己探索型ディープ推論によるChain-of-Thoughtコード生成の強化
Authors: Shuzheng Gao, Chaozheng Wang, Cuiyun Gao, Michael R. Lyu,
Abstract要約: CoT(Chain-of-Thought)推論により、LLM(Large Language Models)は、コードを書く前に高いレベルの推論計画を開発することができる。コード生成の正確かつ適応的な推論を可能にするSelf-Exploring Deep ReasoningフレームワークであるSEERを提案する。
参考スコア（独自算出の注目度）: 41.76790935791852
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Code generation, the task of creating executable programs from natural language requirements, has recently seen tremendous advances through Chain-of-Thought (CoT) reasoning, which enables Large Language Models (LLMs) to develop high-level reasoning plans before writing code. Recent research has proposed various methods to enhance models' CoT reasoning for code generation such as prompt engineering and supervised fine-tuning. However, existing approaches still face three critical limitations: (1) limited exploration of diverse reasoning paths, which constrains generalization across various programming scenarios, (2) lack of quality assessment for intermediate reasoning steps, which hampers the reliability of the generated plans and code, and (3) the potential negative impact of "overthinking", potentially leading to unnecessarily complex and incorrect solutions. To address these limitations, we frame CoT code generation as a decision making problem and present SEER, a SElf-Exploring deep Reasoning framework that enables accurate and adaptive reasoning for code generation. SEER introduces three key components: (1) Diverse reasoning path exploration, which aims at exploring diverse reasoning paths and annotating intermediate steps without relying on manual experts or closed-source proprietary models; (2) Reasoning quality-aware model training, which trains a policy model for generating candidate reasoning steps and a value model for assessing their quality; and (3) Adaptive CoT reasoning, which dynamically switches between direct generation and step-by-step reasoning for different problems.
Abstract（参考訳）: 自然言語の要求から実行可能なプログラムを作成するためのタスクであるコード生成は、最近、Chain-of-Thought(CoT)推論を通じて大きな進歩を遂げた。近年の研究では、プロンプトエンジニアリングや教師付き微調整など、コード生成のためのCoT推論を強化する様々な手法が提案されている。しかし、既存のアプローチは、(1)様々なプログラミングシナリオにまたがる一般化を制約する多様な推論経路の限られた探索、(2)生成された計画やコードの信頼性を損なう中間推論ステップの品質評価の欠如、(3)「再考」の潜在的なネガティブな影響により、必要以上に複雑で誤った解決策がもたらされる、という3つの重大な制限に直面している。このような制限に対処するため,我々はCoTコード生成を意思決定問題とみなし,コード生成の正確かつ適応的な推論を可能にするSelf-Exploring Deep ReasoningフレームワークであるSEERを提示する。 SEERは,(1)手動の専門家やクローズドソースのプロプライエタリなモデルに頼らず,多様な推論経路を探索し,中間段階の注釈を付けることを目的とした異種推論経路探索,(2)候補推論ステップを生成するための政策モデルと品質評価のための価値モデルを訓練する品質認識モデルの推論,(3)直接生成とステップバイステップ推論を動的に切り替える適応的CoT推論,の3つの重要な要素を紹介した。

論文の概要: SEER: Enhancing Chain-of-Thought Code Generation through Self-Exploring Deep Reasoning

関連論文リスト