Fugu-MT 論文翻訳(概要): Beyond Autoregression: An Empirical Study of Diffusion Large Language Models for Code Generation

論文の概要: Beyond Autoregression: An Empirical Study of Diffusion Large Language Models for Code Generation

arxiv url: http://arxiv.org/abs/2509.11252v1
Date: Sun, 14 Sep 2025 12:51:06 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-16 17:26:22.997008
Title: Beyond Autoregression: An Empirical Study of Diffusion Large Language Models for Code Generation
Title（参考訳）: 自己回帰を超えて:コード生成のための拡散大言語モデルに関する実証的研究
Authors: Chengze li, Yitong Zhang, Jia Li, Liyi Cai, Ge Li,
Abstract要約: 既存のLLMは主に自己回帰生成、すなわちコードトークンを左から右に生成する。近年の拡散LDMは有望な代替手段として出現している。コード生成のための拡散LDMを探索する最初の実証的研究について述べる。
参考スコア（独自算出の注目度）: 21.991328297811275
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: LLMs have become the mainstream approaches to code generation. Existing LLMs mainly employ autoregressive generation, i.e. generating code token-by-token from left to right. However, the underlying autoregressive generation has two limitations in code generation. First, autoregressive LLMs only generate a token at each step, showing low efficiency in practice. Second, programming is a non-sequential process involving back-and-forth editing, while autoregressive LLMs only employ the left-to-right generation order. These two intrinsic limitations hinder the further development of LLMs in code generation. Recently, diffusion LLMs have emerged as a promising alternative. Diffusion LLMs address the above limitations with two advances, including multi-token prediction (i.e. generating multiple tokens at each step) and flexible generation order (i.e. flexibly determining which positions to generate tokens). However, there is no systematic study exploring diffusion LLMs in code generation. To bridge the knowledge gap, we present the first empirical study of diffusion LLMs for code generation. Our study involves 9 representative diffusion LLMs and conduct experiments on 4 widely used benchmarks. Based on the results, we summarize the following findings. (1) Existing diffusion LLMs are competitive with autoregressive LLMs with similar sizes. (2) Diffusion LLMs have a stronger length extrapolation ability than autoregressive LLMs and perform better in long code understanding. (3) We explore factors impacting the effectiveness and efficiency of diffusion LLMs, and provide practical guidance. (4) We discuss several promising further directions to improve diffusion LLMs on code generation. We open-source all source code, data, and results to facilitate the following research. The code is publicly available at https://github.com/zhangyitonggg/dllm4code.
Abstract（参考訳）: LLMはコード生成の主流のアプローチとなっている。既存のLLMは主に自己回帰生成、すなわちコードトークンを左から右に生成する。しかし、基盤となる自己回帰生成には、コード生成に2つの制限がある。第一に、自己回帰LDMは各ステップでトークンしか生成せず、実際は低い効率を示す。第二に、プログラミングはバック・フォー・フォー・ザ・編集を含む非シークエンシャルなプロセスであり、自動回帰LDMは左から右への生成順序のみを使用する。これら2つの固有の制限は、コード生成におけるLLMのさらなる発展を妨げる。近年, 拡散LDMは有望な代替手段として出現している。拡散LDMは、マルチトークン予測(各ステップで複数のトークンを生成する)とフレキシブル生成順序(トークンを生成するためにどの位置を柔軟に決定するか)という2つの進歩によって上記の制限に対処する。しかし、コード生成において拡散LDMを探索する体系的な研究は存在しない。知識ギャップを埋めるため,コード生成のための拡散LDMの実証的研究を行った。本研究は,9つの代表拡散LDMと4つの広く使用されているベンチマーク実験に関するものである。この結果をもとに,以下の知見をまとめた。 1) 既存の拡散LDMは, 同様の大きさの自己回帰LDMと競合する。 2)拡散LDMは自己回帰LDMよりも長大な外挿能力を有し,長大なコード理解に優れる。 (3)拡散LDMの有効性と効率に影響を与える要因について検討し,実践的なガイダンスを提供する。 (4) コード生成における拡散LDMを改善するためのいくつかの将来的な方向性について論じる。私たちは、以下の研究を促進するために、すべてのソースコード、データ、結果をオープンソースにしています。コードはhttps://github.com/zhangyitonggg/dllm4codeで公開されている。

論文の概要: Beyond Autoregression: An Empirical Study of Diffusion Large Language Models for Code Generation

関連論文リスト