Fugu-MT 論文翻訳(概要): SelfHeal: Empirical Fix Pattern Analysis and Bug Repair in LLM Agents

論文の概要: SelfHeal: Empirical Fix Pattern Analysis and Bug Repair in LLM Agents

arxiv url: http://arxiv.org/abs/2604.17699v1
Date: Mon, 20 Apr 2026 01:28:15 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-21 21:52:52.64599
Title: SelfHeal: Empirical Fix Pattern Analysis and Bug Repair in LLM Agents
Title（参考訳）: 自己修復: LLM剤の固定パターン解析とバグ修復
Authors: Niful Islam, Muhammad Anas Raza, Mohammad Wardat,
Abstract要約: Stack Overflow、GitHub、HuggingFaceフォーラムからバギーなポストとコードスニペットを研究します。我々は、LLMエージェントのバグに対する最初のベンチマークデータセットであるAgentDefectを紹介する。 LLMエージェントのバグ修正を目的としたマルチエージェントシステムであるSelfHealを提案する。
参考スコア（独自算出の注目度）: 3.8743350688734988
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Large Language Models (LLMs) have transformed software development and AI applications. While LLMs are designed for text processing, LLM agents extend this capability by enabling autonomous actions, tool use, and multi-step task completion. As this field grows, developers face new challenges in debugging these complex systems. To address this challenge, we present the first empirical study on bug fix patterns in LLM agents. We study buggy posts and code snippets from three platforms: Stack Overflow, GitHub, and HuggingFace Forums. We examine their fix patterns, the components where fixes are applied, and the programming languages and frameworks involved. Furthermore, we introduce AgentDefect, the first benchmark dataset for bugs in LLM agents. The dataset contains 37 runtime buggy instances along with fixed code and test files. Finally, we present SelfHeal, a multi-agent system designed to fix bugs in LLM agents. The system leverages two independent ReAct agents: the fix agent and the critic agent. These agents use tools that provide both internal knowledge (fix rules) and external knowledge (web search) to propose and validate fixes. Our evaluation shows that SelfHeal with Gemini 3 Pro as the backbone LLM outperforms both baseline and state-of-the-art approaches by a significant margin.
Abstract（参考訳）: 大規模言語モデル(LLM)は、ソフトウェア開発とAIアプリケーションを変革した。 LLMはテキスト処理用に設計されているが、LLMエージェントは、自律的なアクション、ツールの使用、マルチステップタスク補完を可能にすることで、この機能を拡張している。このフィールドが拡大するにつれて、開発者はこれらの複雑なシステムをデバッグする上で、新たな課題に直面します。この課題に対処するため,LLMエージェントのバグ修正パターンに関する実証的研究を行った。 Stack Overflow、GitHub、HuggingFace Forumsという3つのプラットフォームから、バグの多い投稿とコードスニペットを研究しています。修正パターン、修正が適用されるコンポーネント、および関連するプログラミング言語やフレームワークについて検討する。さらに、LLMエージェントのバグに対する最初のベンチマークデータセットであるAgentDefectを紹介する。データセットには37のランタイムバグのあるインスタンスと、固定されたコードとテストファイルが含まれている。最後に,LLMエージェントのバグ修正を目的としたマルチエージェントシステムであるSelfHealを紹介する。このシステムは2つの独立したReActエージェント、すなわち修正エージェントと批判エージェントを利用する。これらのエージェントは、内部知識(修正規則)と外部知識(Web検索)の両方を提供するツールを使用して修正を提案し、検証する。 Gemini 3 Pro をバックボーンとしたSelfHeal は,ベースラインと最先端の両方のアプローチにおいて,有意な差で優れていた。

論文の概要: SelfHeal: Empirical Fix Pattern Analysis and Bug Repair in LLM Agents

関連論文リスト