Fugu-MT 論文翻訳(概要): StepFinder: A Temporal Semantic Framework for Failure Attribution in Multi-Agent Systems

論文の概要: StepFinder: A Temporal Semantic Framework for Failure Attribution in Multi-Agent Systems

arxiv url: http://arxiv.org/abs/2606.03467v1
Date: Tue, 02 Jun 2026 10:45:49 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-03 22:00:04.946997
Title: StepFinder: A Temporal Semantic Framework for Failure Attribution in Multi-Agent Systems
Title（参考訳）: StepFinder: マルチエージェントシステムにおける障害帰属のための時間的セマンティックフレームワーク
Authors: Taiyu Zhu, Yifan Wu, Weilin Jin, Ying Li, Gang Huang,
Abstract要約: 失敗帰属(Failure Attribution)は、障害の原因となる根本原因のステップを自動的に特定することを目的としたタスクである。既存のフェール帰属法は主に、元の実行軌跡を推論するためにLLMに依存している。我々は、軽量な障害属性フレームワークであるStepFinderを提案する。
参考スコア（独自算出の注目度）: 11.500948775496218
License: http://creativecommons.org/licenses/by/4.0/
Abstract: LLM-based multi-agent systems exhibit remarkable collaborative capabilities in complex multi-step tasks. However, these systems are highly sensitive to single-step execution errors that can propagate through agent interactions and lead to cascading failures. To understand the causes of failure and improve system reliability, failure attribution has been introduced as a task that aims to automatically identify the root cause step responsible for a failure. Existing failure attribution methods mainly rely on LLMs to reason over original execution trajectories, which not only incur high inference costs and latency, but also suffer from interference caused by redundant and noisy execution logs, causing LLMs to struggle in accurately identifying the true root cause step. To address this, we propose StepFinder, a lightweight failure attribution framework. We use LLMs solely during the feature construction phase to encode execution logs into temporal semantic sequences. Subsequently, a parameter-efficient combination of temporal modeling and attention modules is applied to capture the sequential evolution and cross-step dependencies of the trajectories. Finally, the step-level error score is refined through multi-scale differences and position bias, enabling precise root cause identification. Experimental results on the Who&When benchmark demonstrate that StepFinder outperforms LLM-based methods in step-level failure attribution while achieving substantially higher inference efficiency, reducing inference time by 79% compared with the fastest LLM-based method, with no text generation overhead. Our code is available at https://github.com/taiyu-zhu/StepFinder.
Abstract（参考訳）: LLMベースのマルチエージェントシステムは、複雑なマルチステップタスクにおいて顕著な協調機能を示す。しかし、これらのシステムは単一ステップの実行エラーに非常に敏感であり、エージェント間相互作用を通じて伝播し、カスケード障害を引き起こす。障害の原因を理解し、システムの信頼性を向上させるために、障害の原因となる根本原因のステップを自動的に特定するタスクとして、障害帰属が導入された。既存のフェールアトリビューション手法は、主にLSMに頼って、推論コストと遅延を発生させるだけでなく、冗長でノイズの多い実行ログによる干渉に悩まされ、LSMは真の根本原因のステップを正確に特定するのに苦労する。これを解決するために、軽量な障害帰属フレームワークであるStepFinderを提案する。機能構築段階でのみLLMを使用して,実行ログを時間的意味シーケンスにエンコードする。その後、時間的モデリングとアテンションモジュールのパラメータ効率を併用して、軌道の逐次的進化とステップ間の依存関係をキャプチャする。最後に、ステップレベルの誤差スコアをマルチスケールの差分と位置バイアスによって洗練し、正確な根本原因同定を可能にする。 Who&Whenベンチマークの実験結果によると、StepFinderはステップレベルの失敗帰属法よりも高い推論効率を実現し、テキスト生成オーバーヘッドのない高速なLCM法と比較して推論時間を79%削減する。私たちのコードはhttps://github.com/Taiyu-zhu/StepFinder.comから入手可能です。

論文の概要: StepFinder: A Temporal Semantic Framework for Failure Attribution in Multi-Agent Systems

関連論文リスト