Fugu-MT 論文翻訳(概要): TreeCoder: Systematic Exploration and Optimisation of Decoding and Constraints for LLM Code Generation

論文の概要: TreeCoder: Systematic Exploration and Optimisation of Decoding and Constraints for LLM Code Generation

arxiv url: http://arxiv.org/abs/2511.22277v1
Date: Thu, 27 Nov 2025 09:59:39 GMT
ステータス: 翻訳完了
システム内更新日: 2025-12-01 19:47:55.496063
Title: TreeCoder: Systematic Exploration and Optimisation of Decoding and Constraints for LLM Code Generation
Title（参考訳）: TreeCoder: LLMコード生成のためのデコードと制約の体系的探索と最適化
Authors: Henrijs Princis, Arindam Sharma, Cristina David,
Abstract要約: TreeCoderは、デコーディングを候補プログラムのツリー検索として表現する。 TreeCoderは、CodeLlama、Mistral、DeepSeekなどのオープンソースモデルの精度を一貫して改善する。
参考スコア（独自算出の注目度）: 2.2940141855172036
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large language models (LLMs) have shown remarkable ability to generate code, yet their outputs often violate syntactic or semantic constraints when guided only through natural language prompts. We introduce TreeCoder, the most general and flexible framework to date for exploring decoding strategies, constraints, and hyperparameters in LLMs, and use it in code generation to enforce correctness and structure during decoding rather than relying on prompt engineering. TreeCoder represents decoding as a tree search over candidate programs, where both decoding strategies and constraint functions - such as style, syntax, execution - are treated as first-class, optimisable components. This design enables systematic exploration and automatic tuning of decoding configurations using standard optimisation techniques. Experiments on the MBPP (Python) and SQL-Spider benchmarks show that TreeCoder consistently improves accuracy across open-source models such as CodeLlama, Mistral and DeepSeek, often outperforming their unconstrained baselines by considerable margins.
Abstract（参考訳）: 大規模言語モデル(LLM)は、コードを生成する素晴らしい能力を示しているが、自然言語プロンプトを通してのみガイドされる場合、その出力は構文的または意味的な制約に反することが多い。私たちは、LLMのデコード戦略、制約、ハイパーパラメータを探索する上で、これまでで最も汎用的で柔軟なフレームワークであるTreeCoderを紹介します。 TreeCoderは、デコード戦略と制約関数(スタイル、構文、実行など)の両方を第一級の最適化可能なコンポーネントとして扱う、候補プログラムに対するツリー検索としてデコードを表す。この設計は、標準的な最適化手法を用いて、デコード構成の体系的な探索と自動チューニングを可能にする。 MBPP(Python)とSQL-Spiderベンチマークの実験は、TreeCoderがCodeLlama、Mistral、DeepSeekといったオープンソースモデルで一貫して精度を向上し、制約のないベースラインをかなりのマージンで上回っていることを示している。

論文の概要: TreeCoder: Systematic Exploration and Optimisation of Decoding and Constraints for LLM Code Generation

関連論文リスト