Fugu-MT 論文翻訳(概要): AdapTrack: Constrained Decoding without Distorting LLM's Output Intent

論文の概要: AdapTrack: Constrained Decoding without Distorting LLM's Output Intent

arxiv url: http://arxiv.org/abs/2510.17376v1
Date: Mon, 20 Oct 2025 10:11:34 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-25 00:56:39.399165
Title: AdapTrack: Constrained Decoding without Distorting LLM's Output Intent
Title（参考訳）: AdapTrack: LLMの出力インテントを歪ませることなく、制約付きデコード
Authors: Yongmin Li, Jia Li, Ge Li, Zhi Jin,
Abstract要約: 言語モデルに基づくコード生成と補完ツールは、時には必要な制約を満たさないコードを生成することがある。制約に固執するコードを生成するために、制約付き復号法が開発された。 AdapTrackは、モデルの出力インテントを歪めるのを避けるため、制約に準拠するだけでなく、モデルの出力インテントと意味的に一致した結果を生成する。
参考スコア（独自算出の注目度）: 53.75852524070165
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Language model-based code generation and completion tools have been widely adopted, but they may sometimes produce code that does not meet necessary constraints, such as syntactic correctness or API existence. Constrained decoding techniques are developed to help the model generate code adhering to the constraints by greedily eliminating generation options that violate constraints at each step of the generation process. However, there is a severe limitation of constrained decoding, that it distorts the model's output intent, forcing it to produce code that may satisfy the constraint but does not match the development intent and is therefore incorrect. In response to this challenge, we propose AdapTrack. By incorporating backtracking into the generation process, AdapTrack avoids distorting the output intent of the model, thereby producing results that are not only constraint-compliant but also more semantically aligned with model's output intent. On our synthetic API completion dataset, AdapTrack can achieve up to 360.87% improvement compared to constrained decoding; on the real-world API completion dataset we collect that exhibits similar issues, AdapTrack can achieve up to 38.93% improvement over constrained decoding; in general code genration benchmarks, compared to constrained decoding, AdapTrack can achieve up to 7.84% improvement on HumanEval, and up to 6.42% improvement on MBPP. This indicates that, simply by better adhering to the model's output intent, AdapTrack can achieve significant improvements. We provide a theoretical proof that the distribution produced by AdapTrack aligns with the model's distribution given the generated tokens, thereby ensuring that the model's output intent is not distorted. Experiments on DSL problems show that, compared to existing methods, our approach can provide generation results that are more consistent with the language model's distribution.
Abstract（参考訳）: 言語モデルに基づくコード生成と補完ツールは広く採用されているが、構文的正確性やAPIの存在など、必要な制約を満たしていないコードを生成することもある。生成プロセスの各ステップで制約に反する生成オプションを鮮やかに排除することで、モデルが制約に忠実なコードを生成するのに役立つように、制約付き復号技術が開発されている。しかし、制約付きデコードには厳格な制限があり、それはモデルの出力意図を歪め、制約を満たすが開発意図と一致せず、従って正しくないコードを生成するように強制する。この課題に対して,我々はAdapTrackを提案する。生成プロセスにバックトラックを組み込むことで、AdapTrackはモデルの出力インテントの歪みを回避し、制約に準拠しただけでなく、モデルの出力インテントにセマンティックに適合した結果を生成する。当社の合成API補完データセットでは、制約付きデコーディングと比較して最大360.87%の改善を達成できます。同じような問題を示す実世界のAPI補完データセットでは、AdapTrackは制約付きデコーディングよりも最大38.93%の改善を達成できます。これは、単にモデルの出力インテントをより忠実にすることで、AdapTrackが大幅な改善を達成できることを示している。本稿では,AdapTrackが生成した分布が生成したトークンをモデル分布と一致させることで,モデルの出力意図が歪まないことを保証する。 DSL問題に関する実験は、既存の手法と比較して、我々の手法が言語モデルの分布とより整合した生成結果を提供できることを示している。

論文の概要: AdapTrack: Constrained Decoding without Distorting LLM's Output Intent

関連論文リスト