Fugu-MT 論文翻訳(概要): Dream 7B: Diffusion Large Language Models

論文の概要: Dream 7B: Diffusion Large Language Models

arxiv url: http://arxiv.org/abs/2508.15487v1
Date: Thu, 21 Aug 2025 12:09:58 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-22 16:26:46.311089
Title: Dream 7B: Diffusion Large Language Models
Title（参考訳）: Dream 7B: 拡散大言語モデル
Authors: Jiacheng Ye, Zhihui Xie, Lin Zheng, Jiahui Gao, Zirui Wu, Xin Jiang, Zhenguo Li, Lingpeng Kong,
Abstract要約: これまでで最も強力なオープン拡散大言語モデルであるDream 7Bを紹介します。我々のモデルは、一般的な、数学的、コーディングタスクにおいて、既存の拡散言語モデルよりも一貫して優れています。
参考スコア（独自算出の注目度）: 85.26033751898296
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: We introduce Dream 7B, the most powerful open diffusion large language model to date. Unlike autoregressive (AR) models that generate tokens sequentially, Dream 7B employs discrete diffusion modeling to refine sequences in parallel through iterative denoising. Our model consistently outperforms existing diffusion language models on general, mathematical, and coding tasks. Dream 7B demonstrates superior planning abilities and inference flexibility, including arbitrary-order generation, infilling capabilities, and tunable quality-speed trade-offs. These results are achieved through simple yet effective training techniques, including AR-based LLM initialization and context-adaptive token-level noise rescheduling. We release both Dream-Base and Dream-Instruct to facilitate further research in diffusion-based language modeling.
Abstract（参考訳）: これまでで最も強力なオープン拡散大言語モデルであるDream 7Bを紹介します。トークンを逐次生成する自己回帰(AR)モデルとは異なり、ドリーム7Bは離散拡散モデルを用いて反復的復調によって配列を並列に洗練する。我々のモデルは、一般的な、数学的、コーディングタスクにおいて、既存の拡散言語モデルよりも一貫して優れています。ドリーム7Bは、任意の順序生成、補充能力、調整可能な品質と速度のトレードオフを含む、優れた計画能力と推論の柔軟性を示す。これらの結果は、ARベースのLLM初期化やコンテキスト適応型トークンレベルのノイズスケジューリングなど、シンプルで効果的なトレーニング手法によって達成される。我々はDream-BaseとDream-Instructを共にリリースし、拡散に基づく言語モデリングのさらなる研究を促進する。

論文の概要: Dream 7B: Diffusion Large Language Models

関連論文リスト