Fugu-MT 論文翻訳(概要): Bridging Developer Instructions and Code Completion Through Instruction-Aware Fill-in-the-Middle Paradigm

論文の概要: Bridging Developer Instructions and Code Completion Through Instruction-Aware Fill-in-the-Middle Paradigm

arxiv url: http://arxiv.org/abs/2509.24637v1
Date: Mon, 29 Sep 2025 11:45:54 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-30 22:32:19.948371
Title: Bridging Developer Instructions and Code Completion Through Instruction-Aware Fill-in-the-Middle Paradigm
Title（参考訳）: インストラクション・イン・ザ・ミドルパラダイムによる開発者命令のブリッジとコード補完
Authors: Zhensu Sun, Chengran Yang, Chao Peng, Pengfei Gao, Xiaoning Du, Li Li, David Lo,
Abstract要約: Instruction-aware Fill-in-the-Middle (IFIM)を導入し、コード補完モデルを強化する。 IFIMは明示的な命令セクションを入力に組み込んで、モデルが三つ子から学習できるようにする。結果:IFIMは命令フォロー機能を大幅に改善し、Pass@1スコアは84.6%から93.6%に向上した。
参考スコア（独自算出の注目度）: 14.429414087640886
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Large Language Models (LLMs) have significantly advanced code completion, yet they often fail when the developer's intent is underspecified in the code context. To address this, developers usually add natural language instructions (e.g., comments) into the code context to clarify their intent. However, existing code LLMs applied for code completion systems merely undergo a fill-in-the-middle (FIM) pre-training, which struggles to leverage this information effectively due to the lack of instruction-like training data. Existing instruction-tuning techniques, which improve instruction-following in general code generation, paradoxically degrade FIM performance, forcing a trade-off between instruction-following and infilling capabilities. To address this gap, we introduce Instruction-aware Fill-in-the-Middle (IFIM), an instruction-tuning method specifically designed to enhance FIM code completion models. IFIM extends the conventional FIM training objective by incorporating an explicit instruction section into the input, enabling the model to learn from (prefix, instruction, suffix) triplets. This approach allows the model to effectively leverage developer-provided directives while preserving its core completion abilities when no instructions are present. To facilitate this, we constructed a large-scale dataset by using GPT-4o to generate concise, intent-focused instructions for code infilling examples. We evaluated IFIM by applying it to two popular base models, Deepseek-Coder and Qwen2.5-Coder, on the benchmarks derived from HumanEval-infilling and RepoMasterEval. The results demonstrate that IFIM significantly improves instruction-following capabilities, boosting the Pass@1 score from 84.6% to 93.6% on HumanEval-infilling. Moreover, this enhancement does not compromise the models' original performance on FIM code completion tasks with no instructions provided.
Abstract（参考訳）: 大規模言語モデル(LLM)はコード補完が大幅に進んでいるが、コードのコンテキストにおいて開発者の意図が過小評価されている場合、しばしば失敗する。これを解決するために、開発者は通常、意図を明確にするために、コードコンテキストに自然言語命令(例:コメント)を追加する。しかし、コード補完システムに適用される既存のコードLLMは、単にFIM事前トレーニングを実行しているだけであり、命令のようなトレーニングデータがないため、この情報を効果的に活用するのに苦労している。一般的なコード生成における命令追従を改善する既存の命令追従技術は、パラドックス的にFIM性能を低下させ、命令追従機能とインフィル機能とのトレードオフを強制する。このギャップに対処するために、FIMコード補完モデルを強化するために特別に設計された命令チューニング手法であるIFIM(Instruction-aware Fill-in-the-Middle)を導入する。 IFIMは、明示的な命令セクションを入力に組み込むことで、従来のFIMトレーニング目標を拡張し、モデルが三つ子(前置、命令、接尾辞)から学習できるようにする。このアプローチにより、ディベロッパが提供するディレクティブを効果的に活用でき、命令がない場合にコアコンプリート能力を維持することができる。これを容易にするために, GPT-4oを用いて, コード入力例の簡潔で意図を重視した命令を生成する大規模データセットを構築した。我々は、HumanEval-infillingとRepoMasterEvalから得られたベンチマークに基づいて、それを2つの人気のあるベースモデルであるDeepseek-CoderとQwen2.5-Coderに適用することでIFIMを評価した。その結果、IFIMは命令追跡能力を大幅に改善し、Pass@1のスコアは84.6%から93.6%に向上した。さらに、この拡張は、FIMコード補完タスクにおけるモデルのオリジナルのパフォーマンスを、指示なしで損なうことはない。

論文の概要: Bridging Developer Instructions and Code Completion Through Instruction-Aware Fill-in-the-Middle Paradigm

関連論文リスト