Fugu-MT 論文翻訳(概要): An Empirical Study of Speculative Decoding on Software Engineering Tasks

論文の概要: An Empirical Study of Speculative Decoding on Software Engineering Tasks

arxiv url: http://arxiv.org/abs/2604.26469v1
Date: Wed, 29 Apr 2026 09:26:13 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-30 15:59:36.33222
Title: An Empirical Study of Speculative Decoding on Software Engineering Tasks
Title（参考訳）: ソフトウェア工学タスクにおける投機的復号化に関する実証的研究
Authors: Yijia Li, Junkai Chen, Xing Hu, Xin Xia,
Abstract要約: 大規模言語モデル(LLM)は、関数レベルのコード生成から複雑なリポジトリレベルまで、ソフトウェア工学(SE)タスクに広く使われている。しかしながら、自動回帰推論のレイテンシが高いことは大きなボトルネックであり、インタラクティブな環境へのデプロイを妨げる。本研究は,SEタスクにおける投機的復号化(SD)の有効性を評価するための,最初の体系的実証的研究である。
参考スコア（独自算出の注目度）: 10.08815388769347
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Large Language Models (LLMs) have become widely used for Software Engineering (SE) tasks, spanning from function-level code generation to complex repository-level workflows. However, the high latency of autoregressive inference remains a significant bottleneck, hindering their deployment in interactive environments. While Speculative Decoding (SD) offers a promising technique for lossless acceleration, prior research on long-context repository-level tasks and complex agentic interactions remains limited. To bridge this gap, we present the first systematic empirical study to evaluate the effectiveness of SD in SE tasks. We systematically benchmark a comprehensive spectrum of strategies, encompassing both model-based and model-free methods, across representative generation, editing, and repair scenarios. Our empirical results indicate that SD demonstrates clear potential for accelerating inference, particularly for smaller models that achieve higher speedups than those of their larger counterparts. We find that the effectiveness of SD methods varies across different task scenarios. Model-based approaches are well-suited for code generation, whereas model-free methods are better adapted to repository-level repair and editing scenarios. Furthermore, we observe that the repetitiveness of SE tasks improves the performance of model-free methods. In contrast to natural language tasks, the higher predictability of SE tasks allows for more aggressive hyperparameters. Our findings are summarized as guidelines to help increase inference efficiency for SE scenarios.
Abstract（参考訳）: 大規模言語モデル(LLM)は、関数レベルのコード生成から複雑なリポジトリレベルのワークフローまで、ソフトウェア工学(SE)タスクに広く使われている。しかしながら、自動回帰推論のレイテンシが高いことは大きなボトルネックであり、インタラクティブな環境へのデプロイを妨げる。 Speculative Decoding (SD)は、ロスレスアクセラレーションのための有望なテクニックを提供するが、長いコンテキストのリポジトリレベルのタスクや複雑なエージェント間相互作用に関する以前の研究は限定的のままである。このギャップを埋めるために,本研究は,SEタスクにおけるSDの有効性を評価するための,最初の体系的実証的研究である。我々は、モデルベースとモデルフリーの両方の手法を網羅した総合的な戦略スペクトルを、代表生成、編集、修復シナリオにわたって体系的にベンチマークする。実験の結果、SDは推論を加速させる可能性を示しており、特により大きなモデルよりも高速なモデルでは顕著であることがわかった。 SD手法の有効性はタスクのシナリオによって異なることが判明した。モデルベースのアプローチはコード生成に適していますが、モデルフリーメソッドはリポジトリレベルの修復と編集のシナリオに適しています。さらに,SEタスクの反復性がモデルフリー手法の性能を向上させることが観察された。自然言語タスクとは対照的に、SEタスクの予測可能性の向上により、より積極的なハイパーパラメータが実現される。本研究は,SEシナリオの推論効率向上のためのガイドラインとして要約した。

論文の概要: An Empirical Study of Speculative Decoding on Software Engineering Tasks

関連論文リスト