Fugu-MT 論文翻訳(概要): Enhancing LLM-based Fault Localization with a Functionality-Aware Retrieval-Augmented Generation Framework

論文の概要: Enhancing LLM-based Fault Localization with a Functionality-Aware Retrieval-Augmented Generation Framework

arxiv url: http://arxiv.org/abs/2509.20552v1
Date: Wed, 24 Sep 2025 20:37:11 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-26 20:58:12.58377
Title: Enhancing LLM-based Fault Localization with a Functionality-Aware Retrieval-Augmented Generation Framework
Title（参考訳）: 関数性を考慮した検索型生成フレームワークによるLLMに基づくフォールトローカライゼーションの強化
Authors: Xinyu Shi, Zhenhao Li, An Ran Chen,
Abstract要約: FaR-Locは、メソッドレベルの障害ローカライゼーションを強化するフレームワークである。 FaR-Locは、LLM機能抽出、Semantic Retrieval、LLM再ランクの3つの重要なコンポーネントで構成されている。広く使われているDefects4Jベンチマーク実験により、FaR-Locは最先端のLCMベースラインよりも優れていることが示された。
参考スコア（独自算出の注目度）: 14.287359838639608
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Fault localization (FL) is a critical but time-consuming task in software debugging, aiming to identify faulty code elements. While recent advances in large language models (LLMs) have shown promise for FL, they often struggle with complex systems due to the lack of project-specific knowledge and the difficulty of navigating large projects. To address these limitations, we propose FaR-Loc, a novel framework that enhances method-level FL by integrating LLMs with retrieval-augmented generation (RAG). FaR-Loc consists of three key components: LLM Functionality Extraction, Semantic Dense Retrieval, and LLM Re-ranking. First, given a failed test and its associated stack trace, the LLM Functionality Extraction module generates a concise natural language description that captures the failing behavior. Next, the Semantic Dense Retrieval component leverages a pre-trained code-understanding encoder to embed both the functionality description (natural language) and the covered methods (code) into a shared semantic space, enabling the retrieval of methods with similar functional behavior. Finally, the LLM Re-ranking module reorders the retrieved methods based on their contextual relevance. Our experiments on the widely used Defects4J benchmark show that FaR-Loc outperforms state-of-the-art LLM-based baselines SoapFL and AutoFL, by 14.6% and 9.1% in Top-1 accuracy, by 19.2% and 22.1% in Top-5 accuracy, respectively. It also surpasses all learning-based and spectrum-based baselines across all Top-N metrics without requiring re-training. Furthermore, we find that pre-trained code embedding models that incorporate code structure, such as UniXcoder, can significantly improve fault localization performance by up to 49.0% in Top-1 accuracy. Finally, we conduct a case study to illustrate the effectiveness of FaR-Loc and to provide insights for its practical application.
Abstract（参考訳）: フォールトローカライゼーション(FL)は、ソフトウェアデバッグにおいて重要ではあるが時間を要するタスクであり、欠陥のあるコード要素を特定することを目的としている。近年の大規模言語モデル(LLM)の進歩はFLへの期待を示しているが、プロジェクト固有の知識の欠如と大規模プロジェクトのナビゲートが難しいため、複雑なシステムに苦しむことが多い。これらの制約に対処するために,LLMとRAGを統合することにより,メソッドレベルのFLを向上する新しいフレームワークであるFaR-Locを提案する。 FaR-Locは、LLM機能抽出、Semantic Dense Retrieval、LLMリグレードの3つの重要なコンポーネントで構成されている。まず、失敗したテストとそのスタックトレースが与えられた場合、LLM関数性抽出モジュールは、失敗する振る舞いをキャプチャする簡潔な自然言語記述を生成する。次に、Semantic Dense Retrievalコンポーネントは、トレーニング済みのコード理解エンコーダを利用して、機能記述(自然言語)とカバーされたメソッド(コード)の両方を共有セマンティック空間に埋め込む。最後に、LLMリグレードモジュールは、そのコンテキスト関連性に基づいて、検索したメソッドを再順序付けする。 The wide use on the Defects4J benchmark showed that FaR-Loc improves state-of-the-art LLM-based baselines SoapFL and AutoFL, by 14.6% and 9.1% in Top-1 accuracy, by 19.2% and 22.1% in Top-5 accuracy。また、学習ベースのベースラインとスペクトルベースのベースラインを、Top-Nのメトリクスすべてで上回り、再トレーニングは不要だ。さらに、UniXcoderのようなコード構造を組み込んだ事前学習されたコード埋め込みモデルでは、Top-1精度で最大49.0%のフォールトローカライゼーション性能が大幅に向上することが判明した。最後に,FaR-Locの有効性を説明するケーススタディと,その実用化に向けた洞察を提供する。

論文の概要: Enhancing LLM-based Fault Localization with a Functionality-Aware Retrieval-Augmented Generation Framework

関連論文リスト