Fugu-MT 論文翻訳(概要): A Survey on Diffusion Language Models

論文の概要: A Survey on Diffusion Language Models

arxiv url: http://arxiv.org/abs/2508.10875v1
Date: Thu, 14 Aug 2025 17:47:22 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-15 22:24:48.43958
Title: A Survey on Diffusion Language Models
Title（参考訳）: 拡散言語モデルに関する調査
Authors: Tianyi Li, Mingda Chen, Bowei Guo, Zhiqiang Shen,
Abstract要約: 拡散言語モデル(DLM)は、支配的な自己回帰(AR)パラダイムの代替である。 DLMは、推論遅延を減らし、双方向のコンテキストをキャプチャすることに固有の利点がある。近年の進歩により、DLMは自己回帰に匹敵する性能を示すようになった。
参考スコア（独自算出の注目度）: 30.00199970146068
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Diffusion Language Models (DLMs) are rapidly emerging as a powerful and promising alternative to the dominant autoregressive (AR) paradigm. By generating tokens in parallel through an iterative denoising process, DLMs possess inherent advantages in reducing inference latency and capturing bidirectional context, thereby enabling fine-grained control over the generation process. While achieving a several-fold speed-up, recent advancements have allowed DLMs to show performance comparable to their autoregressive counterparts, making them a compelling choice for various natural language processing tasks. In this survey, we provide a holistic overview of the current DLM landscape. We trace its evolution and relationship with other paradigms, such as autoregressive and masked language models, and cover both foundational principles and state-of-the-art models. Our work offers an up-to-date, comprehensive taxonomy and an in-depth analysis of current techniques, from pre-training strategies to advanced post-training methods. Another contribution of this survey is a thorough review of DLM inference strategies and optimizations, including improvements in decoding parallelism, caching mechanisms, and generation quality. We also highlight the latest approaches to multimodal extensions of DLMs and delineate their applications across various practical scenarios. Furthermore, our discussion addresses the limitations and challenges of DLMs, including efficiency, long-sequence handling, and infrastructure requirements, while outlining future research directions to sustain progress in this rapidly evolving field. Project GitHub is available at https://github.com/VILA-Lab/Awesome-DLMs.
Abstract（参考訳）: 拡散言語モデル(DLM)は、支配的な自己回帰(AR)パラダイムに代わる強力で有望な選択肢として急速に現れています。反復デノナイジングプロセスを通じてトークンを並列に生成することにより、DLMは推論遅延を低減し、双方向のコンテキストをキャプチャすることで、生成プロセスのきめ細かい制御を可能にするという固有の利点を持つ。数倍のスピードアップを達成する一方で、最近の進歩により、DLMは自己回帰処理に匹敵するパフォーマンスを示し、様々な自然言語処理タスクにおいて魅力的な選択肢となっている。本調査では,現在のDLM景観について概観する。我々は、その進化と、自己回帰モデルやマスキング言語モデルといった他のパラダイムとの関係を辿り、基礎原理と最先端モデルの両方をカバーする。我々の研究は、最新の総合的な分類学と、事前訓練戦略から先進的なポストトレーニング方法まで、現在の技術に関する詳細な分析を提供する。この調査のもうひとつの貢献は、並列化の復号化、キャッシュ機構、生成品質の改善など、DLM推論戦略と最適化の徹底的なレビューである。また、DLMのマルチモーダル拡張に対する最新のアプローチを強調し、様々な実践シナリオにまたがってそれらのアプリケーションを説明する。さらに,DLMの効率性,長期ハンドリング,インフラ要件といった限界と課題について考察するとともに,この急速に発展する分野における進歩を維持するための今後の研究の方向性を概説する。 Project GitHubはhttps://github.com/VILA-Lab/Awesome-DLMsで入手できる。

論文の概要: A Survey on Diffusion Language Models

関連論文リスト