Fugu-MT 論文翻訳(概要): Draft-Conditioned Constrained Decoding for Structured Generation in LLMs

論文の概要: Draft-Conditioned Constrained Decoding for Structured Generation in LLMs

arxiv url: http://arxiv.org/abs/2603.03305v1
Date: Sun, 08 Feb 2026 03:52:24 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-09 01:20:08.131409
Title: Draft-Conditioned Constrained Decoding for Structured Generation in LLMs
Title（参考訳）: LLMにおける構造生成のためのドラフトコンディション制約デコード
Authors: Avinash Reddy, Thayne T. Walker, James S. Ide, Amrit Singh Bedi,
Abstract要約: 制約デコーディングは、モデルが有効な継続に低確率質量を割り当てたときに生成を歪めることができる。本稿では,構造的強制からセマンティックプランニングを分離する訓練自由推論手法であるemphDraft-Conditioned Constrained Decoding (DCCD)を提案する。我々は,KLプロジェクションビューを用いてDCCDを解析し,ドラフト条件が実現可能な質量を増大させ,ハード制約による累積的な「投射税」を減少させることを示した。
参考スコア（独自算出の注目度）: 11.309525632171217
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Large language models (LLMs) are increasingly used to generate executable outputs, JSON objects, and API calls, where a single syntax error can make the output unusable. Constrained decoding enforces validity token-by-token via masking and renormalization, but it can distort generation when the model assigns low probability mass to valid continuations, pushing decoding toward locally valid yet semantically incorrect trajectories. We propose \emph{Draft-Conditioned Constrained Decoding (DCCD)}, a simple two-step, training-free inference procedure that decouples semantic planning from structural enforcement: an unconstrained draft is generated first, and constrained decoding is then applied, conditioned on this draft, to guarantee validity. We analyze DCCD through a KL-projection view, showing that draft conditioning increases feasible mass and reduces the cumulative "projection tax" induced by hard constraints, with an optional best-of-$K$ draft selection. Across structured reasoning benchmarks, DCCD improves strict structured accuracy by up to +24 percentage points over standard constrained decoding (e.g., 15.2\% to 39.0\% on GSM8K with a 1B model), and enables smaller model pairs to match or exceed much larger constrained baselines, yielding substantial gains in parameter efficiency.
Abstract（参考訳）: 大きな言語モデル(LLM)は、実行可能な出力、JSONオブジェクト、API呼び出しを生成するために、ますます使われています。制約付き復号法はマスキングと再正規化によって正当性トークン・バイ・トーケンを強制するが、モデルが低確率質量を有効継続に割り当てた場合には生成を歪め、復号法は局所的に有効だが意味的に正しくない軌道へと押し下げる。本稿では,制約のないドラフトを最初に生成し,制約付きデコーディングをこのドラフトに適用し,その妥当性を保証するための,単純な2ステップのトレーニング不要推論手順である 'emph{Draft-Conditioned Constrained Decoding (DCCD) を提案する。我々は,KLプロジェクションビューを用いてDCCDを解析し,ドラフト条件が実現可能な質量を増大させ,厳密な制約によって引き起こされる累積的な「投射税」を減らすことを示し,オプションとして$K$のドラフト選択を行う。構造化推論ベンチマーク全体では、DCCDは標準制約付き復号法(例えば、GSM8Kでは1Bモデルで15.2\%から39.0\%)よりも最大で+24パーセンテージの厳密な構造化精度を向上し、より小さなモデルペアがより大きな制約付きベースラインに適合または超えることを可能にし、パラメータ効率の大幅な向上をもたらす。

論文の概要: Draft-Conditioned Constrained Decoding for Structured Generation in LLMs

関連論文リスト