Fugu-MT 論文翻訳(概要): Forward-Free Diffusion Language Models

論文の概要: Forward-Free Diffusion Language Models

arxiv url: http://arxiv.org/abs/2606.08357v1
Date: Sat, 06 Jun 2026 22:10:46 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-09 14:42:06.055803
Title: Forward-Free Diffusion Language Models
Title（参考訳）: 前方自由拡散言語モデル
Authors: Haotian Sun, Rushi Qiang, Yuqian Zheng, Bo Dai,
Abstract要約: 拡散言語モデルは反復的記述を通じてテキストを生成する。本研究では,手作業で設計した前方処理を必要としない前方自由拡散言語モデルFReDAを提案する。 FReDAは近傍に非依存で、モデル複雑度を意識し、フレキシブルリファインメントパラメータ化と互換性がある。
参考スコア（独自算出の注目度）: 12.961496586646708
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Diffusion language models generate text through iterative denoising, offering a powerful alternative to autoregressive generation. However, discrete language spaces lack a natural neighborhood structure for defining effective perturbations, so some artificial corruption schemes are proposed in the forward process. Such prescribed forward processes often produce states that are mathematically convenient but misaligned with drafts and errors encountered during generation, resulting in degraded sample quality. To address this limitation, we propose FReDA, a forward-free diffusion language model that eliminates the need for a hand-designed forward process. We formulate diffusion language modeling as recursive distribution refinement, in which model-generated drafts serve as implicit intermediate states, and the learned refinement model progressively moves the draft distribution toward the target distribution. Concretely, FReDA refines drafts by proposing candidate draft sequences and either directly performing self-refinement or selecting among parallel candidates via best-of-N refinement. With this design, FReDA is neighborhood-agnostic, model-complexity-aware, and compatible with flexible refinement parameterizations. Extensive evaluations in the sub-8B regime show that FReDA-4B outperforms larger diffusion base models on reasoning and coding benchmarks, achieving absolute gains of up to 15%, while reaching a 1.5-1.8x average speedup over diffusion baselines and scaling effectively with additional refinement computation.
Abstract（参考訳）: 拡散言語モデルは反復的記述を通じてテキストを生成し、自己回帰生成の強力な代替手段を提供する。しかし、離散言語空間は効果的な摂動を定義する自然な近傍構造を欠いているため、いくつかの人工的な汚いスキームが前方プロセスで提案されている。このような所定の前処理は、しばしば数学的に便利であるが、生成時に遭遇したドラフトやエラーと一致しない状態を生成し、結果としてサンプルの品質が低下する。この制限に対処するため,手作りの前方処理を必要としない前方自由拡散言語モデルFReDAを提案する。本稿では,モデル生成したドラフトが暗黙の中間状態として機能する再帰的分布改善として拡散言語モデリングを定式化し,学習された改善モデルは,段階的に目標分布に向かってドラフト分布を移動させる。具体的には、FReDAは、候補のドラフトシーケンスを提案してドラフトを洗練し、直接自己修正を行うか、ベスト・オブ・Nによる並列候補の選択を行う。この設計により、FReDAは近傍非依存で、モデル複雑度を意識し、フレキシブルな精細化パラメータ化と互換性がある。さらに,FReDA-4Bは,拡散ベースラインよりも1.5-1.8倍のスピードアップを達成し,さらなる改良を加えて,拡張ベースラインのスケーリングを効果的に行うとともに,推論および符号化ベンチマークにおいてより大きな拡散ベースモデルよりも優れることを示した。

論文の概要: Forward-Free Diffusion Language Models

関連論文リスト