Fugu-MT 論文翻訳(概要): MASPrism: Lightweight Failure Attribution for Multi-Agent Systems Using Prefill-Stage Signals

論文の概要: MASPrism: Lightweight Failure Attribution for Multi-Agent Systems Using Prefill-Stage Signals

arxiv url: http://arxiv.org/abs/2605.07509v2
Date: Thu, 14 May 2026 05:20:24 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-16 00:43:04.047634
Title: MASPrism: Lightweight Failure Attribution for Multi-Agent Systems Using Prefill-Stage Signals
Title（参考訳）: MASPrism:プリフィル信号を用いたマルチエージェントシステムにおける軽量故障属性
Authors: Yang Liu, Hongjiang Feng, Junsong Pu, Zhuangbin Chen,
Abstract要約: 我々は,小言語モデル(SLM)のプリフィルステージ信号を用いて,障害帰属を行うフレームワークであるMASPrismを提案する。 MASPrismは各トレースを平均2.66秒で処理し、単一パスのプロンプトベースラインを6.69$times$スピードアップし、出力トークンをゼロにする。
参考スコア（独自算出の注目度）: 5.326315684098781
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Failure attribution in LLM-based multi-agent systems aims to identify the steps that contribute to a failed execution. This task remains difficult because a single execution can contain many agent actions and tool calls, failure evidence can appear many steps after the original mistake, and existing methods often rely on costly agent workflows, replay, or training on synthetic failure logs. To address these challenges, we propose MASPrism, a lightweight framework that performs failure attribution using prefill-stage signals from a small language model (SLM). MASPrism first extracts token-level negative log-likelihood and attention weights during a prefill pass to identify symptom-like steps and earlier candidate sources, without decoding. It then reconstructs a focused diagnostic prompt and performs a second prefill pass to rank failure-source candidates. Using Qwen3-0.6B as the SLM, MASPrism achieves the best performance on three of the four evaluated subsets across Who&When and TRAIL, improving Top-1 accuracy on Who&When-HC by 33.41% over the best baseline. On TRAIL, MASPrism outperforms strong proprietary LLMs, including Gemini-2.5-Pro, with up to 89.50% relative improvement. MASPrism processes each trace in 2.66 seconds on average, achieving a 6.69$\times$ speedup over the single-pass prompting baseline, with zero output tokens. These results show that MASPrism provides an effective and practical framework for failure attribution in long multi-agent execution logs.
Abstract（参考訳）: LLMベースのマルチエージェントシステムにおけるフェールアトリビューションは、フェール実行に寄与するステップを特定することを目的としている。このタスクは、単一の実行に多くのエージェントアクションやツールコールが含まれ、失敗のエビデンスが元のミスの後、多くのステップで現れるため、難しいままです。これらの課題に対処するために,小型言語モデル (SLM) のプリフィルステージ信号を用いて,障害帰属を行う軽量フレームワーク MASPrism を提案する。 MASPrismは、プリフィルパス中にトークンレベルの負のログ類似度と注意重みを抽出し、デコードすることなく、症状のようなステップや初期の候補ソースを特定する。その後、集中した診断プロンプトを再構築し、失敗ソース候補をランク付けするための第2のプリフィルパスを実行する。 Qwen3-0.6BをSLMとして使用することにより、MASPrismはWho&WhenとTRAILの4つの評価されたサブセットの中で最高のパフォーマンスを達成し、Who&When-HCのTop-1精度を33.41%向上させた。 TRAIL では、MASPrism は Gemini-2.5-Pro を含む強力なプロプライエタリ LLM よりも89.50% 向上している。 MASPrismは各トレースを平均2.66秒で処理し、単一パスのプロンプトベースラインを6.69$\times$スピードアップし、出力トークンをゼロにする。これらの結果から,MASPrismは長時間のマルチエージェント実行ログにおいて,障害帰属のための効果的かつ実用的なフレームワークを提供することが示された。

論文の概要: MASPrism: Lightweight Failure Attribution for Multi-Agent Systems Using Prefill-Stage Signals

関連論文リスト