Fugu-MT 論文翻訳(概要): Ultra-Fast Language Generation via Discrete Diffusion Divergence Instruct

論文の概要: Ultra-Fast Language Generation via Discrete Diffusion Divergence Instruct

arxiv url: http://arxiv.org/abs/2509.25035v1
Date: Mon, 29 Sep 2025 16:55:44 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-30 22:32:20.139484
Title: Ultra-Fast Language Generation via Discrete Diffusion Divergence Instruct
Title（参考訳）: 離散拡散分散命令による超高速言語生成
Authors: Haoyang Zheng, Xinyang Liu, Cindy Xiangrui Kong, Nan Jiang, Zheyuan Hu, Weijian Luo, Wei Deng, Guang Lin,
Abstract要約: DiDi-Instructは、高速言語生成モデルにつながるトレーニングベースの手法である。 OpenWebTextでは、DiDi-Instructはすべてのアクセラレーションされた言語生成モデルより優れています。
参考スコア（独自算出の注目度）: 24.431216450821463
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Fast generation of language texts is the holy grail that people pursue in the AI era. In this work, we introduced Discrete Diffusion Divergence Instruct (DiDi-Instruct), a training-based method that leads to fast language generation models by initializing from a pre-trained (masked) discrete diffusion language model (dLLM). The resulting DiDi-Instruct model outperforms the dLLM counterparts and the GPT-2 baseline with 64x acceleration. In the theoretical part of the paper, we build the foundation of DiDi-Instruct in a framework of integral KL-divergence minimization, with practical training algorithms. We also introduce techniques like grouped reward normalization, intermediate-state matching, and the reward-guided ancestral sampler (RGAS) that significantly improve the training stability, the model coverage, and the inference performances. On OpenWebText, DiDi-Instruct outperforms all accelerated language generation models as well as the GPT-2 baseline and the standard dLLMs, achieving sample perplexities ranging from 62.2 (8 NFEs) to 18.4 (128 NFEs). These performance gains are accomplished with a negligible entropy loss of about 1% and 20x less additional training wall-clock time. We further validate the robustness and effectiveness of DiDi-Instruct through extensive ablation studies, model scaling, and the generation of discrete protein sequences. In conclusion, DiDi-Instruct is an efficient yet effective distillation method, enabling language generation in the blink of an eye. We will release both code and models at github.com/haoyangzheng-ai/didi-instruct.
Abstract（参考訳）: 言語のテキストの高速生成は、人々がAI時代に追求する聖杯です。本研究では,事前学習した(仮装した)離散拡散言語モデル(dLLM)から初期化することにより,高速な言語生成モデルを実現するトレーニングベース手法であるDisdisrete Diffusion Divergence Instruct (DiDi-Instruct)を紹介した。結果のDiDi-Instructモデルは、64xアクセラレーションでdLLMとGPT-2ベースラインを上回っている。論文の理論的部分では、実用的な学習アルゴリズムを用いて、KL分割最小化の枠組みを用いて、DiDi-Instructの基礎を構築する。また、グループ化された報酬正規化、中間状態マッチング、およびトレーニング安定性、モデルカバレッジ、推論性能を大幅に向上させる報酬誘導祖先サンプリング(RGAS)といった手法も導入する。 OpenWebText では、DiDi-Instruct は GPT-2 ベースラインと標準 dLLM だけでなく、全ての加速言語生成モデルよりも優れており、62.2 (8 NFEs) から 18.4 (128 NFEs) のサンプルパープレシティを達成している。これらの性能向上は、負のエントロピー損失約1%と20倍のトレーニングウォールタイムで達成される。さらに,DiDi-Instructの堅牢性と有効性について,広範囲なアブレーション研究,モデルスケーリング,離散タンパク質配列の生成を通じて検証した。結論として、DiDi-Instructは効率的かつ効果的な蒸留法であり、目の瞬きにおける言語生成を可能にする。 github.com/haoyangzheng-ai/di-instruct.comで、コードとモデルの両方をリリースします。

論文の概要: Ultra-Fast Language Generation via Discrete Diffusion Divergence Instruct

関連論文リスト