Fugu-MT 論文翻訳(概要): DiffTester: Accelerating Unit Test Generation for Diffusion LLMs via Repetitive Pattern

論文の概要: DiffTester: Accelerating Unit Test Generation for Diffusion LLMs via Repetitive Pattern

arxiv url: http://arxiv.org/abs/2509.24975v1
Date: Mon, 29 Sep 2025 16:04:18 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-30 22:32:20.115676
Title: DiffTester: Accelerating Unit Test Generation for Diffusion LLMs via Repetitive Pattern
Title（参考訳）: DiffTester: 反復パターンによる拡散LDMのためのユニットテスト生成の高速化
Authors: Lekang Yang, Yuetong Liu, Yitong Zhang, Jia Li,
Abstract要約: DiffTesterは、単体テスト生成(UTG)におけるdLLMに適したアクセラレーションフレームワークである。 DiffTesterは、出力の品質を損なうことなく、各ステップで生成されるトークンの数を適応的に増加させる。最初はPythonに限られていたTestEvalベンチマークを拡張し、JavaやC++などの追加のプログラミング言語を導入しました。
参考スコア（独自算出の注目度）: 6.901203999358967
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Software development relies heavily on extensive unit testing, which makes the efficiency of automated Unit Test Generation (UTG) particularly important. However, most existing LLMs generate test cases one token at a time in each forward pass, which leads to inefficient UTG. Recently, diffusion LLMs (dLLMs) have emerged, offering promising parallel generation capabilities and showing strong potential for efficient UTG. Despite this advantage, their application to UTG is still constrained by a clear trade-off between efficiency and test quality, since increasing the number of tokens generated in each step often causes a sharp decline in the quality of test cases. To overcome this limitation, we present DiffTester, an acceleration framework specifically tailored for dLLMs in UTG. The key idea of DiffTester is that unit tests targeting the same focal method often share repetitive structural patterns. By dynamically identifying these common patterns through abstract syntax tree analysis during generation, DiffTester adaptively increases the number of tokens produced at each step without compromising the quality of the output. To enable comprehensive evaluation, we extend the original TestEval benchmark, which was limited to Python, by introducing additional programming languages including Java and C++. Extensive experiments on three benchmarks with two representative models show that DiffTester delivers significant acceleration while preserving test coverage. Moreover, DiffTester generalizes well across different dLLMs and programming languages, providing a practical and scalable solution for efficient UTG in software development. Code and data are publicly available at https://github.com/wellbeingyang/DLM4UTG-open .
Abstract（参考訳）: ソフトウェア開発は大規模な単体テストに大きく依存しているため、自動単体テスト生成(UTG)の効率性は特に重要である。しかし、既存のほとんどのLCMは、各前方通過で1回に1つのトークンを発生させ、非効率なUTGをもたらす。近年、拡散LDM(dLLMs)が出現し、有望な並列生成機能を提供し、効率的なUTGの可能性を示している。このような利点にもかかわらず、UTGへの適用は、効率性とテスト品質の明確なトレードオフによって制限されている。この制限を克服するために、UTGのdLLM用に特別に調整されたアクセラレーションフレームワークであるDiffTesterを紹介する。 DiffTesterのキーとなる考え方は、同じフォーカスメソッドをターゲットにした単体テストは、しばしば反復的な構造パターンを共有することである。生成中の抽象構文木解析を通じてこれらの共通パターンを動的に識別することにより、DiffTesterは、出力の品質を損なうことなく、各ステップで生成されるトークンの数を適応的に増加させる。包括的な評価を可能にするため、JavaやC++を含む追加のプログラミング言語を導入することで、Pythonに制限された最初のTestEvalベンチマークを拡張しました。 2つの代表的なモデルを持つ3つのベンチマークに関する大規模な実験は、DiffTesterがテストカバレッジを保ちながら大きな加速を提供することを示している。さらに、DiffTesterは様々なdLLMやプログラミング言語にまたがってうまく一般化し、ソフトウェア開発において効率的なUTGのための実用的でスケーラブルなソリューションを提供する。コードとデータはhttps://github.com/wellbeingyang/DLM4UTG-openで公開されている。

論文の概要: DiffTester: Accelerating Unit Test Generation for Diffusion LLMs via Repetitive Pattern

関連論文リスト