Fugu-MT 論文翻訳(概要): MAS-FIRE: Fault Injection and Reliability Evaluation for LLM-Based Multi-Agent Systems

論文の概要: MAS-FIRE: Fault Injection and Reliability Evaluation for LLM-Based Multi-Agent Systems

arxiv url: http://arxiv.org/abs/2602.19843v1
Date: Mon, 23 Feb 2026 13:47:43 GMT
ステータス: 翻訳完了
システム内更新日: 2026-02-24 17:42:02.833275
Title: MAS-FIRE: Fault Injection and Reliability Evaluation for LLM-Based Multi-Agent Systems
Title（参考訳）: MAS-FIRE: LLMに基づくマルチエージェントシステムの故障注入と信頼性評価
Authors: Jin Jia, Zhiling Deng, Zhuangbin Chen, Yingqi Wang, Zibin Zheng,
Abstract要約: マルチエージェントシステムの障害注入と信頼性評価のための体系的フレームワークMAS-FIREを提案する。エージェント内認知障害とエージェント間協調障害を対象とする15種類の障害分類を定義した。 MAS-FIREを3つの代表的なMASアーキテクチャに適用することにより、フォールトトレラントな動作の豊富なセットを明らかにする。
参考スコア（独自算出の注目度）: 38.44649280816596
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: As LLM-based Multi-Agent Systems (MAS) are increasingly deployed for complex tasks, ensuring their reliability has become a pressing challenge. Since MAS coordinate through unstructured natural language rather than rigid protocols, they are prone to semantic failures (e.g., hallucinations, misinterpreted instructions, and reasoning drift) that propagate silently without raising runtime exceptions. Prevailing evaluation approaches, which measure only end-to-end task success, offer limited insight into how these failures arise or how effectively agents recover from them. To bridge this gap, we propose MAS-FIRE, a systematic framework for fault injection and reliability evaluation of MAS. We define a taxonomy of 15 fault types covering intra-agent cognitive errors and inter-agent coordination failures, and inject them via three non-invasive mechanisms: prompt modification, response rewriting, and message routing manipulation. Applying MAS-FIRE to three representative MAS architectures, we uncover a rich set of fault-tolerant behaviors that we organize into four tiers: mechanism, rule, prompt, and reasoning. This tiered view enables fine-grained diagnosis of where and why systems succeed or fail. Our findings reveal that stronger foundation models do not uniformly improve robustness. We further show that architectural topology plays an equally decisive role, with iterative, closed-loop designs neutralizing over 40% of faults that cause catastrophic collapse in linear workflows. MAS-FIRE provides the process-level observability and actionable guidance needed to systematically improve multi-agent systems.
Abstract（参考訳）: LLMベースのMulti-Agent Systems (MAS) は複雑なタスクにますますデプロイされているため、信頼性の確保が課題となっている。 MASは厳密なプロトコルではなく、構造化されていない自然言語を通してコーディネートするため、実行時例外を発生させることなく静かに伝播する意味障害(例えば、幻覚、誤解釈命令、推論ドリフト)が生じる傾向がある。エンドツーエンドのタスクの成功のみを測定する一般的な評価アプローチは、これらの失敗の発生方法や、エージェントがいかに効果的にタスクから回復するかについて、限られた洞察を提供する。このギャップを埋めるため,MASの故障注入と信頼性評価のための体系的枠組みであるMAS-FIREを提案する。我々は,エージェント内認知異常とエージェント間協調障害をカバーする15種類の障害の分類を定義し,迅速な修正,応答書き換え,メッセージルーティング操作という3つの非侵襲的なメカニズムを通じてそれらを注入する。 MAS-FIREを3つの代表的なMASアーキテクチャに適用することにより、私たちが構成するフォールトトレラントな動作の豊富なセットが、メカニズム、ルール、プロンプト、推論の4つの階層にまとめられます。この階層ビューは、システムの成功と失敗の理由を詳細に診断することを可能にする。以上の結果から,基礎モデルが強靭性を均一に改善しないことが明らかとなった。さらに、アーキテクチャトポロジが同様に決定的な役割を果たすことを示し、線形ワークフローにおいて破滅的な崩壊を引き起こす障害の40%以上を、反復的でクローズドループの設計が中和することを示した。 MAS-FIREは、マルチエージェントシステムを体系的に改善するために必要なプロセスレベルの可観測性と実行可能なガイダンスを提供する。

論文の概要: MAS-FIRE: Fault Injection and Reliability Evaluation for LLM-Based Multi-Agent Systems

関連論文リスト