Fugu-MT 論文翻訳(概要): SiblingRepair: Sibling-Based Multi-Hunk Repair with Large Language Models

論文の概要: SiblingRepair: Sibling-Based Multi-Hunk Repair with Large Language Models

arxiv url: http://arxiv.org/abs/2605.06209v1
Date: Thu, 07 May 2026 13:14:36 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-08 22:27:11.81974
Title: SiblingRepair: Sibling-Based Multi-Hunk Repair with Large Language Models
Title（参考訳）: SiblingRepair: 大規模言語モデルによる兄弟ベースマルチハンク修復
Authors: Xinyu Liu, Jiayu Ren, Yusen Wang, Qi Xin, Xiaoyuan Xie, Jifeng Xuan,
Abstract要約: 兄弟修復に特化したLLMベースのマルチハンクAPR技術であるSiblingRepairを提案する。不審な場所から始まるSiblingRepairは、トークンと埋め込みベースのコードマッチングを使用して、意味的に関連する兄弟候補を検索する。次にLLMを使用して、障害関連兄弟を識別し、2つの補完的な戦略を通じて一貫したパッチを生成する。
参考スコア（独自算出の注目度）: 8.90705329392185
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Developers often make similar mistakes across code locations implementing related functionalities. These locations, called siblings, share similar issues and require similar fixes. Accurately identifying siblings and consistently repairing them are crucial for automated program repair. Hercules is a SOTA technique designed for sibling repair. However, it is limited by strong assumptions about sibling locations and commit-history availability, rigid AST-based sibling matching, and inflexible template-based patch generation. To address these limitations, we present SiblingRepair, a new LLM-based multi-hunk APR technique specialized for sibling repair. Starting from a suspicious location identified by spectrum-based fault localization, SiblingRepair searches for semantically related sibling candidates using token- and embedding-based code matching, without restricting discovery to failing-test coverage or commit history. It then uses an LLM to identify failure-relevant siblings and generate consistent patches through two complementary strategies: simultaneous repair, which jointly repairs siblings, and iterative repair, which progressively analyzes candidates for patch construction. SiblingRepair further preserves promising patches generated from earlier suspicious locations and combines them into generalized multi-hunk patches. We evaluate SiblingRepair on the Defects4J and GHRB benchmarks. The results show that SiblingRepair substantially outperforms SOTA multi-hunk repair techniques including Hercules. Our evaluation further demonstrates its repair efficiency, the effectiveness of its sibling detection and repair components, and limited impact of the LLM data leakage on the results. Overall, SiblingRepair advances automated sibling and general multi-hunk repair.
Abstract（参考訳）: 開発者はしばしば、関連する機能を実装するコード場所間で同様の間違いを犯す。これらの場所は兄弟姉妹と呼ばれ、同様の問題を共有し、同様の修正が必要である。兄弟姉妹を正確に識別し、継続的に修復することは、プログラムの自動修復に不可欠である。 Herculesは、兄弟の修理用に設計されたSOTA技術である。しかし、兄弟姉妹の位置とコミット履歴の可用性、堅固なASTベースの兄弟マッチング、柔軟性のないテンプレートベースのパッチ生成に関する強い仮定によって制限されている。これらの制約に対処するために、兄弟修復に特化した新しいLLMベースのマルチハンクAPR技術であるSiblingRepairを提案する。スペクトルベースの障害ローカライゼーションによって特定された疑わしい位置から始まるSiblingRepairは、トークンと埋め込みベースのコードマッチングを使用して、発見をフェールテストカバレッジやコミット履歴に制限することなく、意味的に関連する兄弟候補を検索する。次に、LCMを使用して障害関連兄弟を識別し、2つの補完的な戦略により一貫したパッチを生成する。 SiblingRepairはさらに、以前に不審な場所から生成された有望なパッチを保存し、それらを一般化されたマルチハンクパッチに結合する。 The Defects4J and GHRB benchmarks on the SiblingRepair on the Defects4J and GHRB benchmarks。その結果,SyblingRepairはHerculesを含むSOTAマルチハンク補修技術よりも大幅に優れていた。さらに, その修復効率, 兄弟検出および補修部品の有効性, およびLCMデータ漏洩が結果に与える影響について検討した。全体として、SiblingRepairは自動シブリングと一般的なマルチハンク修復を前進させる。

論文の概要: SiblingRepair: Sibling-Based Multi-Hunk Repair with Large Language Models

関連論文リスト