Fugu-MT 論文翻訳(概要): Dystruct: Dynamically Structured Diffusion Language Model Decoding via Bayesian Inference

論文の概要: Dystruct: Dynamically Structured Diffusion Language Model Decoding via Bayesian Inference

arxiv url: http://arxiv.org/abs/2605.09820v1
Date: Sun, 10 May 2026 23:51:14 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-12 23:28:50.434527
Title: Dystruct: Dynamically Structured Diffusion Language Model Decoding via Bayesian Inference
Title（参考訳）: Dystruct: ベイズ推論による動的に構造化された拡散言語モデルデコード
Authors: Bian Sun, Kevin Zhai, Mubarak Shah, Zhenyi Wang,
Abstract要約: 拡散言語モデル (DLM) は自己回帰モデルに代わる有望な代替品として登場した。ほとんどの既存のDLMはデコードに先立って指定された固定生成長に依存しており、現実世界のアプリケーションでは柔軟性が制限されている。本稿では,動的構造推論問題としてフレキシブル長生成を定式化する,非学習型ベイズ構造復号化フレームワークを提案する。
参考スコア（独自算出の注目度）: 51.12849550784653
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Diffusion language models (DLMs) have recently emerged as a promising alternative to autoregressive models, primarily due to their ability to enable parallel decoding. Despite this advantage, most existing DLMs rely on a fixed generation length specified prior to decoding, which restricts their flexibility in real-world applications. While a few recent works attempt to support flexible-length generation, they typically suffer from notable limitations: some require costly retraining to accommodate variable-length outputs, while others depend solely on local confidence signals during decoding. Such local criteria fail to capture the evolving structure of the sequence, often resulting in suboptimal generation quality. In this paper, we propose a training-free, Bayesian structured decoding framework that formulates flexible-length generation as a dynamic structural inference problem. Our approach formulates flexible-length generation as a dynamic structural inference problem, jointly computing the expansion length, the block boundaries, and the decoding schedule. At each window expansion step, the method integrates local uncertainty with structural signals via a unified mechanism that supports dynamic structured generation, including both flexible block expansion and block organization, while maintaining coherence. Extensive experiments across multiple benchmarks demonstrate that our approach significantly improves generation quality and flexibility over existing fixed-length and flexible-length baselines. These results highlight the advantage of Bayesian structured decoding for diffusion language model, providing a principled and efficient solution for structured text generation.
Abstract（参考訳）: 拡散言語モデル(DLM)は、主に並列デコードを可能にする能力のために、自動回帰モデルに代わる有望な代替品として最近登場した。この利点にもかかわらず、ほとんどの既存のDLMはデコードに先立って指定された固定された世代長に依存しており、現実世界のアプリケーションにおける柔軟性を制限している。最近のいくつかの研究はフレキシブル長生成をサポートしようとしているが、典型的には顕著な制限に悩まされている: 可変長出力に対応するためにコストのかかる再訓練を必要とするものもあれば、復号中は局所的な信頼信号にのみ依存するものもある。このような局所的な基準は、配列の進化する構造を捉えず、しばしば最適以下の生成品質をもたらす。本稿では,動的構造推論問題としてフレキシブル長生成を定式化する,学習不要なベイズ構造復号化フレームワークを提案する。提案手法は,動的構造推論問題としてフレキシブル長生成を定式化し,拡張長,ブロック境界,復号化スケジュールを共同計算する。各ウィンドウ展開ステップにおいて、フレキシブルブロック展開とブロック構成の両方を含む動的構造化生成をサポートする統一機構により、コヒーレンスを維持しつつ、局所不確実性を構造信号と統合する。複数のベンチマークにわたる大規模な実験により、我々のアプローチは既存の固定長およびフレキシブル長のベースラインよりも生成品質と柔軟性を著しく改善することを示した。これらの結果は、拡散言語モデルに対するベイズ構造復号法の利点を強調し、構造化テキスト生成の原理的かつ効率的なソリューションを提供する。

論文の概要: Dystruct: Dynamically Structured Diffusion Language Model Decoding via Bayesian Inference

関連論文リスト