Fugu-MT 論文翻訳(概要): PracRepair: LLM-Empowered Automated Program Repair Inspired by Human-Like Debugging Practices

論文の概要: PracRepair: LLM-Empowered Automated Program Repair Inspired by Human-Like Debugging Practices

arxiv url: http://arxiv.org/abs/2606.17612v1
Date: Tue, 16 Jun 2026 07:18:37 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-17 17:15:32.327768
Title: PracRepair: LLM-Empowered Automated Program Repair Inspired by Human-Like Debugging Practices
Title（参考訳）: PracRepair: ヒューマンライクなデバッグプラクティスに触発されたLLMを利用した自動プログラム修復
Authors: Yu Cheng, Zhongxin Liu, Zhenchang Xing, Chao Ni, Qing Huang, Xiaoxue Ren,
Abstract要約: textscPracRepairは、人間のようなデバッグプラクティスにインスパイアされた、完全に自動化されたプログラム修復フレームワークである。 textscPracRepairは一貫して最先端のベースラインを上回っている。 textscPracRepairはRWB(Real-World Bugs)に効果的に一般化する
参考スコア（独自算出の注目度）: 22.432182416621917
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: As software systems grow in scale and complexity, debugging and repair remain costly and time-consuming. Large language models (LLMs) have advanced automated program repair (APR), but existing LLM-based APR approaches still largely rely on static or retrieved context, error messages, and coarse-grained validation outcomes. As a result, they underutilize dynamic information for failure understanding and repair, including failure-execution dynamics and patch-validation dynamics. Effectively leveraging such information, however, is challenging: failure-execution traces are large and noisy, raw static-dynamic context is not self-explanatory, and patch-validation dynamics are often reduced to coarse feedback. To address these challenges, we propose \textsc{PracRepair}, a fully automated LLM-based APR framework inspired by human-like debugging practices. \textsc{PracRepair} constructs an on-demand static-dynamic context from buggy programs and failure executions, performs question-driven failure diagnosis to formulate explicit repair hypotheses, and iteratively refines candidate patches using validation diagnostics and trace-level behavioral changes. Experimental results on Defects4J V1.2 and V2.0 show that \textsc{PracRepair} consistently outperforms state-of-the-art baselines. Specifically, under GPT-3.5, \textsc{PracRepair} correctly fixes 139/136 bugs on Defects4J V1.2/V2.0, while under GPT-4o it further improves to 162/171. Moreover, \textsc{PracRepair} generalizes effectively to RWB (Real-World Bugs), achieving the best performance across multiple foundation models.
Abstract（参考訳）: ソフトウェアシステムがスケールと複雑さを増すにつれて、デバッグと修復はコストと時間を要する。大規模言語モデル(LLM)には高度な自動プログラム修復(APR)があるが、既存のLLMベースのAPRアプローチは、静的または検索されたコンテキスト、エラーメッセージ、粗い粒度の検証結果に大きく依存している。その結果、フェール・エグゼクティブ・ダイナミクスやパッチ・バリデーション・ダイナミクスといった、障害の理解と修復のために、動的情報を過小評価する。しかし、そのような情報を効果的に活用することは難しい。障害実行トレースは巨大でノイズがあり、生の静的なコンテキストは自己説明的ではなく、パッチ検証のダイナミクスは粗いフィードバックに還元されることが多い。これらの課題に対処するために,人間的なデバッグプラクティスにヒントを得た,完全に自動化されたLLMベースのAPRフレームワークである‘textsc{PracRepair} を提案する。 \textsc{PracRepair}は、バギープログラムと障害実行からオンデマンドの静的なコンテキストを構築し、明示的な修復仮説を定式化するために質問駆動型障害診断を実行し、検証診断とトレースレベルの動作変化を使用して、候補パッチを反復的に洗練する。 Defects4J V1.2 と V2.0 の実験結果から、 \textsc{PracRepair} は最先端のベースラインを一貫して上回っている。具体的には、GPT-3.5の下では、textsc{PracRepair}はDefects4J V1.2/V2.0の139/136のバグを正しく修正し、GPT-4oでは162/171に改善した。さらに、textsc{PracRepair} は RWB (Real-World Bugs) に効果的に一般化し、複数の基礎モデルで最高のパフォーマンスを達成する。

論文の概要: PracRepair: LLM-Empowered Automated Program Repair Inspired by Human-Like Debugging Practices

関連論文リスト