Fugu-MT 論文翻訳(概要): ParallelBench: Understanding the Trade-offs of Parallel Decoding in Diffusion LLMs

論文の概要: ParallelBench: Understanding the Trade-offs of Parallel Decoding in Diffusion LLMs

arxiv url: http://arxiv.org/abs/2510.04767v1
Date: Mon, 06 Oct 2025 12:41:31 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-07 16:52:59.861074
Title: ParallelBench: Understanding the Trade-offs of Parallel Decoding in Diffusion LLMs
Title（参考訳）: ParallelBench: 拡散LDMにおける並列デコーディングのトレードオフを理解する
Authors: Wonjun Kang, Kevin Galim, Seunghyuk Oh, Minjae Lee, Yuchen Zeng, Shuibai Zhang, Coleman Hooper, Yuezhou Hu, Hyung Il Koo, Nam Ik Cho, Kangwook Lee,
Abstract要約: 拡散LDMは、並列復号による推論を劇的に加速する可能性への関心が高まっている。既存の作業は、これらの固有の課題を概ね見落としており、標準ベンチマークによる評価は、並列復号による品質劣化を捉えるのに十分ではない。そこで我々は,DLLMに特化して設計された最初のベンチマークであるParallelBenchを提案する。我々の発見は、現在のスピード品質のトレードオフを克服できる革新的な復号法の必要性を強調している。
参考スコア（独自算出の注目度）: 31.387806058620683
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: While most autoregressive LLMs are constrained to one-by-one decoding, diffusion LLMs (dLLMs) have attracted growing interest for their potential to dramatically accelerate inference through parallel decoding. Despite this promise, the conditional independence assumption in dLLMs causes parallel decoding to ignore token dependencies, inevitably degrading generation quality when these dependencies are strong. However, existing works largely overlook these inherent challenges, and evaluations on standard benchmarks (e.g., math and coding) are not sufficient to capture the quality degradation caused by parallel decoding. To address this gap, we first provide an information-theoretic analysis of parallel decoding. We then conduct case studies on analytically tractable synthetic list operations from both data distribution and decoding strategy perspectives, offering quantitative insights that highlight the fundamental limitations of parallel decoding. Building on these insights, we propose ParallelBench, the first benchmark specifically designed for dLLMs, featuring realistic tasks that are trivial for humans and autoregressive LLMs yet exceptionally challenging for dLLMs under parallel decoding. Using ParallelBench, we systematically analyze both dLLMs and autoregressive LLMs, revealing that: (i) dLLMs under parallel decoding can suffer dramatic quality degradation in real-world scenarios, and (ii) current parallel decoding strategies struggle to adapt their degree of parallelism based on task difficulty, thus failing to achieve meaningful speedup without compromising quality. Our findings underscore the pressing need for innovative decoding methods that can overcome the current speed-quality trade-off. We release our benchmark to help accelerate the development of truly efficient dLLMs.
Abstract（参考訳）: ほとんどの自己回帰LDMは1対1の復号化に制約されているが、拡散LDM(dLLM)は並列復号化によって推論を劇的に加速する可能性への関心が高まっている。この約束にもかかわらず、dLLMsの条件付き独立仮定は、並列デコードによってトークンの依存関係を無視し、これらの依存関係が強い場合には、必然的に生成品質を低下させる。しかし、既存の研究はこれらの固有の課題を概ね見落としており、標準的なベンチマーク(例えば、数学やコーディング)による評価は、並列デコーディングによる品質劣化を捉えるのに十分ではない。このギャップに対処するため、我々はまず並列デコーディングの情報理論解析を行う。次に、データ分散と復号化戦略の観点から解析的に抽出可能な合成リスト演算のケーススタディを行い、並列復号化の基本的限界を強調する定量的な洞察を提供する。これらの知見に基づいてParallelBenchを提案する。これは、DLLM用に特別に設計された最初のベンチマークであり、人間にはやさしい現実的なタスクと、並列デコーディング下でのdLLMでは例外的に難しい自動回帰LDMを特徴とする。 ParallelBenchを用いて、dLLMと自己回帰LDMの両方を系統的に解析し、次のように明らかにした。 (i)dLLMの並列復号化は実世界のシナリオにおいて劇的な品質劣化を招きかねない。 (II)現在の並列復号戦略はタスクの難易度に基づいて並列化の度合いを順応するのに苦労し、品質を損なうことなく有意義なスピードアップを達成できない。我々の発見は、現在のスピード品質のトレードオフを克服できる革新的な復号法の必要性を強調している。私たちは、真に効率的なdLLMの開発を加速するために、ベンチマークをリリースします。

論文の概要: ParallelBench: Understanding the Trade-offs of Parallel Decoding in Diffusion LLMs

関連論文リスト