Fugu-MT 論文翻訳(概要): Empirical Study on Robustness and Resilience in Cooperative Multi-Agent Reinforcement Learning

論文の概要: Empirical Study on Robustness and Resilience in Cooperative Multi-Agent Reinforcement Learning

arxiv url: http://arxiv.org/abs/2510.11824v1
Date: Mon, 13 Oct 2025 18:24:01 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-15 19:02:32.058431
Title: Empirical Study on Robustness and Resilience in Cooperative Multi-Agent Reinforcement Learning
Title（参考訳）: 協調型マルチエージェント強化学習におけるロバスト性とレジリエンスに関する実証的研究
Authors: Simin Li, Zihao Mao, Hanxiao Li, Zonglei Jing, Zhuohang bian, Jun Guo, Li Wang, Zhuoran Han, Ruixiao Xu, Xin Yu, Chengdong Ma, Yuqing Ma, Bo An, Yaodong Yang, Weifeng Lv, Xianglong Liu,
Abstract要約: 信頼できるマルチエージェント強化学習システムを構築するには、堅牢性を理解する必要がある。我々は,MARLにおける協調性,堅牢性,レジリエンスを評価するため,82,620以上の実験からなる大規模実験を行った。
参考スコア（独自算出の注目度）: 37.910012648322265
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: In cooperative Multi-Agent Reinforcement Learning (MARL), it is a common practice to tune hyperparameters in ideal simulated environments to maximize cooperative performance. However, policies tuned for cooperation often fail to maintain robustness and resilience under real-world uncertainties. Building trustworthy MARL systems requires a deep understanding of robustness, which ensures stability under uncertainties, and resilience, the ability to recover from disruptions--a concept extensively studied in control systems but largely overlooked in MARL. In this paper, we present a large-scale empirical study comprising over 82,620 experiments to evaluate cooperation, robustness, and resilience in MARL across 4 real-world environments, 13 uncertainty types, and 15 hyperparameters. Our key findings are: (1) Under mild uncertainty, optimizing cooperation improves robustness and resilience, but this link weakens as perturbations intensify. Robustness and resilience also varies by algorithm and uncertainty type. (2) Robustness and resilience do not generalize across uncertainty modalities or agent scopes: policies robust to action noise for all agents may fail under observation noise on a single agent. (3) Hyperparameter tuning is critical for trustworthy MARL: surprisingly, standard practices like parameter sharing, GAE, and PopArt can hurt robustness, while early stopping, high critic learning rates, and Leaky ReLU consistently help. By optimizing hyperparameters only, we observe substantial improvement in cooperation, robustness and resilience across all MARL backbones, with the phenomenon also generalizing to robust MARL methods across these backbones. Code and results available at https://github.com/BUAA-TrustworthyMARL/adv_marl_benchmark .
Abstract（参考訳）: MARL(Multi-Agent Reinforcement Learning)では、理想的なシミュレーション環境でハイパーパラメータを調整し、協調的な性能を最大化することが一般的である。しかし、協力のために調整された政策は、現実の不確実性の下で堅牢性とレジリエンスを維持するのに失敗することが多い。信頼できるMARLシステムを構築するには、不確実性の下での安定性を保証する堅牢性と、破壊から回復できるレジリエンスの深い理解が必要です。本稿では,4つの実環境,13の不確実性,15のハイパーパラメータにおけるMARLの協調性,堅牢性,レジリエンスを評価するため,82,620以上の実験からなる大規模実験を行った。 1) 緩やかな不確実性の下では, 協調最適化は堅牢性とレジリエンスを向上するが, 摂動が増大するにつれてリンクが弱まる。ロバスト性やレジリエンスもアルゴリズムや不確実性によって異なる。 2) 不確実性やエージェントの範囲でロバスト性やレジリエンスは一般化されない:全てのエージェントに対するアクションノイズに頑健なポリシーは、単一のエージェントの観測ノイズの下で失敗する可能性がある。意外なことに、パラメータ共有、GAE、PopArtといった標準プラクティスは、早期停止、高い批判的学習率、Leaky ReLUが一貫して役に立つ一方で、堅牢性を損なう可能性がある。ハイパーパラメータのみを最適化することにより、すべてのMARLバックボーンの協調性、堅牢性、レジリエンスが大幅に向上し、これらのバックボーンにまたがるロバストなMARL法にも一般化される。コードと結果はhttps://github.com/BUAA-TrustworthyMARL/adv_marl_benchmarkで公開されている。

論文の概要: Empirical Study on Robustness and Resilience in Cooperative Multi-Agent Reinforcement Learning

関連論文リスト