Fugu-MT 論文翻訳(概要): Ultra-Fast Language Generation via Discrete Diffusion Divergence Instruct

論文の概要: Ultra-Fast Language Generation via Discrete Diffusion Divergence Instruct

arxiv url: http://arxiv.org/abs/2509.25035v2
Date: Wed, 01 Oct 2025 17:45:09 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-02 14:33:21.818658
Title: Ultra-Fast Language Generation via Discrete Diffusion Divergence Instruct
Title（参考訳）: 離散拡散分散命令による超高速言語生成
Authors: Haoyang Zheng, Xinyang Liu, Cindy Xiangrui Kong, Nan Jiang, Zheyuan Hu, Weijian Luo, Wei Deng, Guang Lin,
Abstract要約: DiDi-Instructは、高速世代のために数ステップの学生を蒸留するトレーニングベースの方法である。 OpenWebText上でDiDi-Instructは62.2 (8 NFEs)から18.4 (128 NFEs)にパープレキシティを実現するこれらの利得には無視できるエントロピー損失(約1%)が伴い、競合するdLLM蒸留法と比較して、追加のトレーニングウォールタイム時間を20時間以上短縮する。
参考スコア（独自算出の注目度）: 24.431216450821463
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Fast and high-quality language generation is the holy grail that people pursue in the age of AI. In this work, we introduce Discrete Diffusion Divergence Instruct (DiDi-Instruct), a training-based method that initializes from a pre-trained (masked) discrete diffusion language model (dLLM) and distills a few-step student for fast generation. The resulting DiDi-Instruct model achieves comparable or superior performance to its dLLM teacher and the GPT-2 baseline while enabling up to 64$\times$ acceleration. The theoretical foundation of DiDi-Instruct is a novel framework based on integral KL-divergence minimization, which yields a practical training algorithm. We further introduce grouped reward normalization, intermediate-state matching, and the reward-guided ancestral sampler that significantly improve training stability, model coverage, and inference quality. On OpenWebText, DiDi-Instruct achieves perplexity from 62.2 (8 NFEs) to 18.4 (128 NFEs), which outperforms prior accelerated dLLMs and GPT-2 baseline. These gains come with a negligible entropy loss (around $1\%$) and reduce additional training wall-clock time by more than $20\times$ compared to competing dLLM distillation methods. We further validate the robustness and effectiveness of DiDi-Instruct through extensive ablation studies, model scaling, and the generation of discrete protein sequences. In conclusion, DiDi-Instruct is an efficient yet effective distillation method, enabling language generation in the blink of an eye. We will release both code and models at github.com/haoyangzheng-ai/didi-instruct.
Abstract（参考訳）: 高速で高品質な言語生成は、人々がAI時代に追求する聖杯です。本研究では,事前学習された離散拡散言語モデル(dLLM)から初期化して,高速な生成のために数ステップの学生を蒸留するトレーニングベース手法であるDisdisrete Diffusion Divergence Instruct (DiDi-Instruct)を紹介する。 DiDi-Instructモデルは、最大64$\times$Accelerationを可能にしながら、dLLMの教師とGPT-2のベースラインと同等または優れたパフォーマンスを達成する。 DiDi-Instructの理論的基礎は、KL分割最小化に基づく新しいフレームワークであり、実用的なトレーニングアルゴリズムを生成する。さらに,グループ化された報酬正規化,中間状態マッチング,およびトレーニング安定性,モデルカバレッジ,推論品質を著しく向上する報酬誘導祖先サンプリングを導入する。 OpenWebTextでは、DiDi-Instructは62.2 (8 NFEs)から18.4 (128 NFEs)にパープレキシティを達成し、それ以前の加速dLLMとGPT-2ベースラインより優れている。これらの利得には無視できるエントロピー損失(約1\%$)が伴い、競合するdLLM蒸留法と比較して、追加のトレーニングウォールタイム時間を20\times$以上削減する。さらに,DiDi-Instructの堅牢性と有効性について,広範囲なアブレーション研究,モデルスケーリング,離散タンパク質配列の生成を通じて検証した。結論として、DiDi-Instructは効率的かつ効果的な蒸留法であり、目の瞬きにおける言語生成を可能にする。 github.com/haoyangzheng-ai/di-instruct.comで、コードとモデルの両方をリリースします。

論文の概要: Ultra-Fast Language Generation via Discrete Diffusion Divergence Instruct

関連論文リスト