Fugu-MT 論文翻訳(概要): iCoRe: An Iterative Correlation-Aware Retriever for Bug Reproduction Test Generation

論文の概要: iCoRe: An Iterative Correlation-Aware Retriever for Bug Reproduction Test Generation

arxiv url: http://arxiv.org/abs/2604.19224v1
Date: Tue, 21 Apr 2026 08:26:30 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-22 22:41:49.689193
Title: iCoRe: An Iterative Correlation-Aware Retriever for Bug Reproduction Test Generation
Title（参考訳）: iCoRe: バグ再現テスト生成のための反復的相関対応レトリバー
Authors: Junyi Wang, Jialun Cao, Zhongxin Liu,
Abstract要約: iCoReは反復的相関型コンテキスト検索手法である。本手法はFail-to-Passレートをそれぞれ42.0%,52.8%とする。
参考スコア（独自算出の注目度）: 10.824024745675281
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Automatically generating bug reproduction tests (BRT) from issue descriptions is crucial for software maintenance. LLM-based approaches have shown great potential for this task. Their effectiveness heavily relies on retrieving high-quality context from the codebase. The retrieval phase of existing approaches relies on either traditional methods like BM25 or LLM-driven strategies. LLM-based retrieval strategies typically equip an LLM with tools to autonomously explore the repository or select the most relevant files and code snippets from a provided list as context. However, these retrieval methods suffer from three key limitations: 1) They often employ a unified strategy for retrieving both source code and test cases, overlooking their distinct retrieval requirements. 2) They focus solely on semantic similarity while ignoring function call relationships, leading to irrelevant context. 3) The retrieval lacks a feedback loop from the generation phase, preventing it from refining the context based on execution results. These limitations collectively result in low-quality context, thereby hindering the accuracy of bug reproduction. To address these challenges, we propose iCoRe, an iterative, correlation-aware context retrieval approach explicitly aware of three key correlations: 1) between source code and test cases, which requires differentiated retrieval, 2) between textual semantics and function call structures for accurate relevance assessment, and 3) between the retrieval and generation phases, which enables iterative feedback and refinement. To evaluate iCoRe, we integrate it with an LLM-based BRT generator and conduct a comprehensive evaluation on the SWT-bench Lite and TDD-bench Verified benchmarks. Experimental results show that our method achieves a Fail-to-Pass rate of 42.0% and 52.8% respectively, representing 19.7%-31.7% relative improvements over existing retrieval methods.
Abstract（参考訳）: 問題記述からバグ再現テスト(BRT)を自動的に生成することは、ソフトウェアのメンテナンスに不可欠である。 LLMベースのアプローチは、このタスクに大きな可能性を示している。その効果はコードベースから高品質なコンテキストを取得することに大きく依存しています。既存のアプローチの検索フェーズは、BM25 や LLM による戦略のような従来の手法に依存している。 LLMベースの検索戦略は通常、LLMにリポジトリを自律的に探索したり、提供されたリストから最も関連性の高いファイルやコードスニペットをコンテキストとして選択するためのツールを装備する。しかし、これらの検索方法には3つの制限がある。 1) ソースコードとテストケースの両方を検索するための統一的な戦略をしばしば採用する。 2)機能コール関係を無視しながら意味的類似性にのみ焦点を合わせ,無関係な文脈へとつながる。 3)検索には生成フェーズからのフィードバックループが欠如しており,実行結果に基づいてコンテキストを書き換えることができない。これらの制限は全体として、品質の低いコンテキストをもたらすため、バグの再現の精度を損なう。これらの課題に対処するため、我々は3つの重要な相関関係を明確に認識する反復的相関対応コンテキスト検索手法iCoReを提案する。 1) ソースコードとテストケースを区別した検索を必要とする。 2) 正確な関連性評価のためのテキスト意味論と機能呼び出し構造 3) 検索フェーズと生成フェーズの間には, 反復的なフィードバックと改善が可能である。 iCoReを評価するために、LLMベースのBRTジェネレータと統合し、SWT-bench LiteとTDD-bench Verifiedベンチマークで包括的な評価を行う。実験の結果,既存の検索手法と比較して,Fail-to-Passレートが42.0%,Fail-to-Passレートが52.8%,19.7%～31.7%向上したことがわかった。

論文の概要: iCoRe: An Iterative Correlation-Aware Retriever for Bug Reproduction Test Generation

関連論文リスト