Fugu-MT 論文翻訳(概要): Future Validity is the Missing Statistic: From Impossibility to $Φ$-Estimation for Grammar-Faithful Speculative Decoding

論文の概要: Future Validity is the Missing Statistic: From Impossibility to $Φ$-Estimation for Grammar-Faithful Speculative Decoding

arxiv url: http://arxiv.org/abs/2605.07698v1
Date: Fri, 08 May 2026 13:08:18 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-11 19:43:39.058656
Title: Future Validity is the Missing Statistic: From Impossibility to $Φ$-Estimation for Grammar-Faithful Speculative Decoding
Title（参考訳）: 不合理性から「$$-Estimation for Grammar-Faithful Speculative Decoding」まで
Authors: Wenhua Nie, Zijie Meng, Kun Zou, Zheng Lin, Ziwei Li, Haoran Zheng, Jyh-Shing Roger Jang, Hao Zhang,
Abstract要約: 文法制約付き生成は、しばしば局所的な語彙マスキングと投機的復号化と組み合わせられる。局所マスクアクセスを持つ投機的デコーダ,レバイアサン拒絶,および局所予測分布$mathrmproj$からのロールバック音質サンプルは,文法条件分布$star$ではなく,いずれかを示す。
参考スコア（独自算出の注目度）: 21.074583041375888
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Grammar-constrained generation is often combined with local vocabulary masking and speculative decoding, but the resulting sampling law is not the grammar-conditional distribution users usually intend. We show that any speculative decoder with local mask access, Leviathan rejection, and rollback soundness samples from the locally projected distribution $μ^{\mathrm{proj}}$ rather than the grammar-conditional distribution $μ^\star$. This extends the GAD impossibility result to speculative decoding; on Dyck grammars with Qwen3-8B, the total-variation gap can reach 0.996. We identify the future-validity function $Φ_t(y)=\Pr_p[\mathrm{valid\ completion}\mid y]$ as the missing correction statistic. The target distribution is a Doob transform of the base model with $h=Φ$, while local masking corresponds to setting $h$ to one. With exact $Φ$, our oracle decoder FVO-Spec samples exactly from $μ^\star$; with approximate $Φ$, we bound the resulting total-variation error. Because exact future validity is hard for general context-free grammars, we evaluate estimator hierarchies on tractable Dyck and finite JSON languages. OneStep reduces Dyck TV by 14% with under 1% throughput overhead, exact dynamic programming reduces it by 97%, and finite-language correction closes JSON gaps to numerical precision. All fidelity claims are scoped to enumerable grammars and token tries.
Abstract（参考訳）: 文法制約付き生成は、しばしば局所的な語彙マスキングと投機的復号法と組み合わせられるが、結果として得られるサンプリング法則は、通常ユーザーが意図する文法条件分布ではない。局所マスクアクセスを持つ投機的デコーダ、レヴィアサンの拒絶、および局所予測分布$μ^{\mathrm{proj}}$からのロールバック音質サンプルは、文法条件分布$μ^\star$ではなく、いずれの投機的デコーダも示す。これにより、GADの不合理性は投機的復号化へと拡張され、Qwen3-8Bのダイク文法では、全変分差は0.996に達する。将来の正則関数 $ _t(y)=\Pr_p[\mathrm{valid\ completion}\mid y]$ を欠落した補正統計量として同定する。ターゲットの分布は、ベースモデルのDoob変換で$h=$$であるのに対して、ローカルマスキングは$h$を1に設定する。正確な$=では、我々のオラクルデコーダ FVO-Spec は$μ^\star$ から正確にサンプリングされる。一般的な文脈自由文法では, 正確な将来性は難しいため, トラクタブルダイクおよび有限JSON言語上の推定器階層を評価する。 OneStepはDyck TVを1%未満のスループットオーバーヘッドで14%削減し、正確な動的プログラミングにより97%削減し、有限言語補正はJSONギャップを数値精度で埋める。すべての忠実なクレームは、エネルブル文法とトークン試行にスコープされる。

論文の概要: Future Validity is the Missing Statistic: From Impossibility to $Φ$-Estimation for Grammar-Faithful Speculative Decoding

関連論文リスト