Fugu-MT 論文翻訳(概要): Diffusion Language Model Parallel Decoding via Product-of-Experts Bridge

論文の概要: Diffusion Language Model Parallel Decoding via Product-of-Experts Bridge

arxiv url: http://arxiv.org/abs/2606.08048v1
Date: Sat, 06 Jun 2026 08:21:06 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-09 14:42:05.693145
Title: Diffusion Language Model Parallel Decoding via Product-of-Experts Bridge
Title（参考訳）: Product-of-Experts Bridgeによる拡散言語モデル並列デコーディング
Authors: Juntong Shi, Brian L. Trippe, Jure Leskovec, Stefano Ermon, Minkai Xu,
Abstract要約: 拡散言語モデル (DLMs) は並列デコーディングによる大幅な速度優位性を提供する。トークン依存関係の欠如は、自動回帰(AR)モデルと比較して生成品質を制限します。最近の進歩は、DLMが提案、ARが目標として、重要サンプリングによってギャップを埋めようとしている。本稿では,生成速度と精度を大幅に向上させる新しいデコードフレームワークPoE-Bridgeを紹介する。
参考スコア（独自算出の注目度）: 93.37920675145553
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Diffusion language models (DLMs) offer substantial speed advantages through parallel decoding, but the lack of token dependencies limits generation quality compared to autoregressive (AR) models. Recent progress attempts to bridge the gap via importance sampling, with DLM being the proposal and AR being the target. However, due to the huge gap between their distributions, the sampling requires a large number of particles and is thus expensive to compute. In this paper, we introduce PoE-Bridge, a novel decoding framework that drastically improves generation speed and accuracy by introducing an intermediate distribution to bridge the gap. The distribution is constructed as a Product-of-Experts (PoE) of the DLM proposal and the AR target. With the intermediate distribution, we first use the DLM to draft multiple continuations in parallel, then apply rejection sampling to verify the drafted tokens and move the resulting candidates toward the PoE. We then use importance sampling to further correct the PoE-aligned candidates toward the AR target. We further propose several improved techniques, including mixed-temperature sampling for enhanced diversity and elastic rejection windows for reducing wasted verification. Empirically, PoE-Bridge achieves significantly improved accuracy with $5\times$ speedup over the standard DLM decoding approach, and recovers at least 95% of the target AR model's performance, efficiently advancing most of the quality gap on challenging mathematical reasoning and coding tasks. Our code is available at https://github.com/juntongshi48/poe-bridge.
Abstract（参考訳）: 拡散言語モデル(DLM)は並列デコーディングによる大幅な速度優位性を提供するが、トークン依存の欠如は自己回帰(AR)モデルと比較して生成品質を制限している。最近の進歩は、DLMが提案、ARが目標として、重要サンプリングによってギャップを埋めようとしている。しかし、それらの分布の間に大きなギャップがあるため、サンプリングには大量の粒子が必要であり、計算に費用がかかる。本稿では,このギャップを埋める中間分布を導入することにより,生成速度と精度を大幅に向上する新しいデコードフレームワークPoE-Bridgeを紹介する。この分布は、DLM提案とARターゲットのProduct-of-Experts (PoE)として構成されている。中間分布では、まずDLMを用いて複数の継続を並列に起草し、次に拒否サンプリングを適用して、起草されたトークンを検証し、結果の候補をPoEへ移動させる。次に、重要サンプリングを使用して、PoE対応候補をARターゲットに向けてさらに修正する。さらに、多様性向上のための混合温度サンプリングや、無駄な検証を減らすための弾性拒絶窓など、いくつかの改良された手法を提案する。実証的には、PoE-Bridgeは標準のDLMデコーディングアプローチよりも5ドル以上で大幅な精度向上を実現し、ターゲットのARモデルの性能の少なくとも95%を回復し、挑戦的な数学的推論やコーディングタスクにおける品質ギャップの大部分を効率的に改善する。私たちのコードはhttps://github.com/juntongshi48/poe-bridgeで利用可能です。

論文の概要: Diffusion Language Model Parallel Decoding via Product-of-Experts Bridge

関連論文リスト