Fugu-MT 論文翻訳(概要): A Survey on Parallel Text Generation: From Parallel Decoding to Diffusion Language Models

論文の概要: A Survey on Parallel Text Generation: From Parallel Decoding to Diffusion Language Models

arxiv url: http://arxiv.org/abs/2508.08712v2
Date: Wed, 13 Aug 2025 13:24:25 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-14 14:06:00.573174
Title: A Survey on Parallel Text Generation: From Parallel Decoding to Diffusion Language Models
Title（参考訳）: 並列テキスト生成に関する調査:並列復号化から拡散言語モデルへ
Authors: Lingzhe Zhang, Liancheng Fang, Chiming Duan, Minghua He, Leyi Pan, Pei Xiao, Shiyu Huang, Yunpeng Zhai, Xuming Hu, Philip S. Yu, Aiwei Liu,
Abstract要約: トークン・バイ・トークン生成のボトルネックを破り、推論効率を向上させることを目的とした並列テキスト生成技術。既存のアプローチをARベースのパラダイムと非ARベースのパラダイムに分類し、各カテゴリの中核技術について詳細に検討する。我々は、最近の進歩を強調し、オープンな課題を特定し、並列テキスト生成における将来的な研究の方向性を概説する。
参考スコア（独自算出の注目度）: 49.97547209846335
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: As text generation has become a core capability of modern Large Language Models (LLMs), it underpins a wide range of downstream applications. However, most existing LLMs rely on autoregressive (AR) generation, producing one token at a time based on previously generated context-resulting in limited generation speed due to the inherently sequential nature of the process. To address this challenge, an increasing number of researchers have begun exploring parallel text generation-a broad class of techniques aimed at breaking the token-by-token generation bottleneck and improving inference efficiency. Despite growing interest, there remains a lack of comprehensive analysis on what specific techniques constitute parallel text generation and how they improve inference performance. To bridge this gap, we present a systematic survey of parallel text generation methods. We categorize existing approaches into AR-based and Non-AR-based paradigms, and provide a detailed examination of the core techniques within each category. Following this taxonomy, we assess their theoretical trade-offs in terms of speed, quality, and efficiency, and examine their potential for combination and comparison with alternative acceleration strategies. Finally, based on our findings, we highlight recent advancements, identify open challenges, and outline promising directions for future research in parallel text generation. We have also created a GitHub repository for indexing relevant papers and open resources available at https://github.com/zhanglingzhe0820/Awesome-Parallel-Text-Generation.
Abstract（参考訳）: テキスト生成が現代のLarge Language Models (LLM) のコア機能となっているため、幅広い下流アプリケーションを支える。しかし、ほとんどの既存のLCMは自己回帰(AR)生成に依存しており、プロセスの本質的にシーケンシャルな性質のため、以前に生成されたコンテキスト置換に基づいて1つのトークンを同時に生成する。この課題に対処するため、トークン・バイ・トークン生成のボトルネックを突破し、推論効率を向上させるための幅広い手法である、並列テキスト生成を探求する研究者が増えている。関心が高まりつつあるにもかかわらず、特定のテクニックが並列テキスト生成を構成するのか、どのように推論性能を改善するのか、包括的な分析はいまだに残っていない。このギャップを埋めるために,並列テキスト生成手法を体系的に調査する。既存のアプローチをARベースのパラダイムと非ARベースのパラダイムに分類し、各カテゴリの中核技術について詳細に検討する。この分類に続き、速度、品質、効率の観点から理論上のトレードオフを評価し、代替加速戦略と組み合わせて比較する可能性を検討する。最後に,本研究の成果を踏まえて,最近の進歩を強調し,オープン課題を特定し,並列テキスト生成における今後の研究の方向性を概説する。また、関連する論文やオープンリソースをインデックスするGitHubリポジトリをhttps://github.com/zhanglingzhe0820/Awesome-Parallel-Text-Generation.comで作成しました。

論文の概要: A Survey on Parallel Text Generation: From Parallel Decoding to Diffusion Language Models

関連論文リスト