Fugu-MT 論文翻訳(概要): CodeRAG: Finding Relevant and Necessary Knowledge for Retrieval-Augmented Repository-Level Code Completion

論文の概要: CodeRAG: Finding Relevant and Necessary Knowledge for Retrieval-Augmented Repository-Level Code Completion

arxiv url: http://arxiv.org/abs/2509.16112v1
Date: Fri, 19 Sep 2025 15:57:40 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-22 18:18:11.239153
Title: CodeRAG: Finding Relevant and Necessary Knowledge for Retrieval-Augmented Repository-Level Code Completion
Title（参考訳）: CodeRAG: Retrieval-Augmented Repository-Level Code Completionの関連知識と必要知識を見つける
Authors: Sheng Zhang, Yifan Ding, Shuquan Lian, Shun Song, Hui Li,
Abstract要約: リポジトリレベルのコード補完は、リポジトリからのより広範な情報に基づいて、未完成のコードを自動的に予測する。 CodeRAGは、リポジトリレベルのコード補完を検索するために必要な知識を特定するためのフレームワークである。
参考スコア（独自算出の注目度）: 11.329578913209623
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Repository-level code completion automatically predicts the unfinished code based on the broader information from the repository. Recent strides in Code Large Language Models (code LLMs) have spurred the development of repository-level code completion methods, yielding promising results. Nevertheless, they suffer from issues such as inappropriate query construction, single-path code retrieval, and misalignment between code retriever and code LLM. To address these problems, we introduce CodeRAG, a framework tailored to identify relevant and necessary knowledge for retrieval-augmented repository-level code completion. Its core components include log probability guided query construction, multi-path code retrieval, and preference-aligned BestFit reranking. Extensive experiments on benchmarks ReccEval and CCEval demonstrate that CodeRAG significantly and consistently outperforms state-of-the-art methods. The implementation of CodeRAG is available at https://github.com/KDEGroup/CodeRAG.
Abstract（参考訳）: リポジトリレベルのコード補完は、リポジトリからのより広範な情報に基づいて、未完成のコードを自動的に予測する。 Code Large Language Models(コードLLM)の最近の進歩は、リポジトリレベルのコード補完方法の開発を加速させ、有望な結果をもたらした。それでも、不適切なクエリ構築、シングルパスコード検索、コードレトリバーとコードLLMの不一致といった問題に悩まされている。これらの問題に対処するため,我々は,リポジトリレベルのコード補完のための関連知識と必要な知識を識別するフレームワークであるCodeRAGを紹介した。その中核となるコンポーネントは、ログ確率ガイドクエリ構築、マルチパスコード検索、および優先順のBestFitの再ランクである。ベンチマークに関する大規模な実験 ReccEval と CCEval は、CodeRAG が最先端の手法を大幅に上回ることを示した。 CodeRAGの実装はhttps://github.com/KDEGroup/CodeRAGで公開されている。

論文の概要: CodeRAG: Finding Relevant and Necessary Knowledge for Retrieval-Augmented Repository-Level Code Completion

関連論文リスト