Fugu-MT 論文翻訳(概要): Impact-driven Context Filtering For Cross-file Code Completion

論文の概要: Impact-driven Context Filtering For Cross-file Code Completion

arxiv url: http://arxiv.org/abs/2508.05970v1
Date: Fri, 08 Aug 2025 03:08:19 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-11 20:39:06.056477
Title: Impact-driven Context Filtering For Cross-file Code Completion
Title（参考訳）: クロスファイルコード補完のためのインパクト駆動型コンテキストフィルタリング
Authors: Yanzhou Li, Shangqing Liu, Kangjie Chen, Tianwei Zhang, Yang Liu,
Abstract要約: Retrieval-augmented Generation (RAG)は、最近リポジトリレベルのコード補完の可能性を示している。検索したクロスファイルコンテキストのコントリビューションをよりよく理解するために,検索した各コードチャンクが完了に与える影響を評価する。これらの結果は、リポジトリレベルのコード補完の正確性、効率、および属性性を高めるためのCODEの可能性を強調します。
参考スコア（独自算出の注目度）: 19.717276276962483
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Retrieval-augmented generation (RAG) has recently demonstrated considerable potential for repository-level code completion, as it integrates cross-file knowledge with in-file preceding code to provide comprehensive contexts for generation. To better understand the contribution of the retrieved cross-file contexts, we introduce a likelihood-based metric to evaluate the impact of each retrieved code chunk on the completion. Our analysis reveals that, despite retrieving numerous chunks, only a small subset positively contributes to the completion, while some chunks even degrade performance. To address this issue, we leverage this metric to construct a repository-level dataset where each retrieved chunk is labeled as positive, neutral, or negative based on its relevance to the target completion. We then propose an adaptive retrieval context filtering framework, CODEFILTER, trained on this dataset to mitigate the harmful effects of negative retrieved contexts in code completion. Extensive evaluation on the RepoEval and CrossCodeLongEval benchmarks demonstrates that CODEFILTER consistently improves completion accuracy compared to approaches without filtering operations across various tasks. Additionally, CODEFILTER significantly reduces the length of the input prompt, enhancing computational efficiency while exhibiting strong generalizability across different models. These results underscore the potential of CODEFILTER to enhance the accuracy, efficiency, and attributability of repository-level code completion.
Abstract（参考訳）: Retrieval-augmented Generation (RAG)は、最近リポジトリレベルのコード補完の可能性を示した。検索したクロスファイルコンテキストのコントリビューションをよりよく理解するために,検索した各コードチャンクが完了に与える影響を評価するための可能性ベースのメトリクスを導入する。分析の結果,多数のチャンクを回収したにもかかわらず,少数のサブセットだけが完了に肯定的な貢献をする一方で,一部のチャンクは性能を低下させることがわかった。この問題に対処するため、我々はこの指標を利用して、検索したチャンクを対象の完了との関係に基づいて、正、中、負のラベルでラベル付けしたリポジトリレベルのデータセットを構築する。次に、このデータセットに基づいて訓練された適応型検索コンテキストフィルタリングフレームワークCODEFILTERを提案し、コード補完における負の検索コンテキストの有害な影響を軽減する。 RepoEvalとCrossCodeLongEvalベンチマークの大規模な評価は、CODEFILTERが様々なタスクにまたがる操作をフィルタリングしないアプローチと比較して、コンプリート精度を一貫して改善していることを示している。さらに、CODEFILTERは入力プロンプトの長さを大幅に削減し、異なるモデル間で強い一般化性を示しながら計算効率を向上する。これらの結果は、リポジトリレベルのコード補完の正確性、効率、属性性を向上するCODEFILTERの可能性を強調している。

論文の概要: Impact-driven Context Filtering For Cross-file Code Completion

関連論文リスト