Fugu-MT 論文翻訳(概要): Correctness-Guaranteed Code Generation via Constrained Decoding

論文の概要: Correctness-Guaranteed Code Generation via Constrained Decoding

arxiv url: http://arxiv.org/abs/2508.15866v1
Date: Wed, 20 Aug 2025 20:48:18 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-25 16:42:36.132529
Title: Correctness-Guaranteed Code Generation via Constrained Decoding
Title（参考訳）: 制約付きデコードによる正確性保証コード生成
Authors: Lingxiao Li, Salar Rahili, Yiwei Zhao,
Abstract要約: 本稿では,意味論的に正しいプログラムを生成するための制約付き実行時復号アルゴリズムを提案する。提案手法は,任意の所定のスクリプティングAPIに従って,意味的に正しいプログラムを生成することができることを示す。さらに、慎重に設計することで、我々のセマンティック保証が正当性にまで拡張され、ローグライクなビデオゲームにゲームメカニクスを発生させることで検証されることを示す。
参考スコア（独自算出の注目度）: 11.531496728670746
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Language Models (LMs) are increasingly being used for code generation, but ensuring the correctness of generated programs remains a significant challenge. Although imperfect code may be acceptable during software development with human oversight, domains such as video games and robotics require one-shot correctness for runtime-critical components. We present a constrained decoding algorithm for generating semantically correct programs that incorporates a context-sensitive parser, which, at each step, outputs a regular expression that satisfies a critical non-extensible property to guide the generation of the next token sequence that can continue to a correct program. To build such a context-sensitive parser, we propose a framework of a dynamic tree of parsers (ToP) during parsing, where each parser corresponds to a modular context-free grammar enriched with contextual information such as variable scopes and type constraints, with tree branches representing ambiguity in the future code segment. We demonstrate our approach through sLua, a strongly typed variant of Lua, showing that our method can generate semantically correct programs conforming to any prescribed scripting API. We further show that, with careful design, our semantic guarantees extend to runtime correctness, as validated in the application of generating game mechanics for a roguelike video game.
Abstract（参考訳）: 言語モデル(LM)はコード生成にますます使われていますが、生成されたプログラムの正確性を保証することは大きな課題です。人間の監視によるソフトウェア開発において不完全なコードは受け入れられるかもしれないが、ビデオゲームやロボティクスのようなドメインは実行時クリティカルなコンポーネントに対してワンショットの正当性を必要とする。本稿では、文脈に敏感なパーサを組み込んだ意味論的に正しいプログラムを生成するための制約付き復号アルゴリズムを提案し、各ステップにおいて、重要な非拡張性を満たす正規表現を出力し、正しいプログラムを継続可能な次のトークンシーケンスの生成を導く。このような文脈に敏感なパーサを構築するために、解析中のパーサの動的ツリー(ToP)のフレームワークを提案し、各パーサは、変数スコープや型制約といったコンテキスト情報に富んだモジュラーな文脈自由文法に対応し、将来のコードセグメントにおけるあいまいさを表すツリーブランチを持つ。我々は,Luaの強く型付けされた変種であるsLuaによるアプローチを実証し,任意の所定のスクリプティングAPIに準拠した意味的に正しいプログラムを生成可能であることを示す。さらに、慎重に設計することで、我々のセマンティック保証が実行時正当性にまで拡張され、ローグライクなビデオゲームにゲームメカニクスを発生させることで検証されることを示す。

論文の概要: Correctness-Guaranteed Code Generation via Constrained Decoding

関連論文リスト