Fugu-MT 論文翻訳(概要): A Framework for Scalable Heterogeneous Multi-Agent Adversarial Reinforcement Learning in IsaacLab

論文の概要: A Framework for Scalable Heterogeneous Multi-Agent Adversarial Reinforcement Learning in IsaacLab

arxiv url: http://arxiv.org/abs/2510.01264v1
Date: Fri, 26 Sep 2025 03:16:48 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-03 14:32:17.225516
Title: A Framework for Scalable Heterogeneous Multi-Agent Adversarial Reinforcement Learning in IsaacLab
Title（参考訳）: アイザックラボにおけるスケーラブルな異種多言語対応強化学習フレームワーク
Authors: Isaac Peterson, Christopher Allred, Jacob Morrey, Mario Harper,
Abstract要約: マルチエージェント強化学習(MARL)は、動的環境下で協調するロボットシステムの中心である。我々はIsaacLabフレームワークを拡張し、高忠実度物理シミュレーションにおける敵ポリシーのスケーラブルなトレーニングを支援する。
参考スコア（独自算出の注目度）: 1.5749416770494706
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Multi-Agent Reinforcement Learning (MARL) is central to robotic systems cooperating in dynamic environments. While prior work has focused on these collaborative settings, adversarial interactions are equally critical for real-world applications such as pursuit-evasion, security, and competitive manipulation. In this work, we extend the IsaacLab framework to support scalable training of adversarial policies in high-fidelity physics simulations. We introduce a suite of adversarial MARL environments featuring heterogeneous agents with asymmetric goals and capabilities. Our platform integrates a competitive variant of Heterogeneous Agent Reinforcement Learning with Proximal Policy Optimization (HAPPO), enabling efficient training and evaluation under adversarial dynamics. Experiments across several benchmark scenarios demonstrate the framework's ability to model and train robust policies for morphologically diverse multi-agent competition while maintaining high throughput and simulation realism. Code and benchmarks are available at: https://github.com/DIRECTLab/IsaacLab-HARL .
Abstract（参考訳）: MARL(Multi-Agent Reinforcement Learning)は、動的環境下で協調するロボットシステムの中心である。以前の作業では、これらの協調的な設定に重点を置いていたが、追従回避、セキュリティ、競合操作といった現実世界のアプリケーションでは、敵の相互作用が同じように重要である。本研究はIsaacLabフレームワークを拡張し,高忠実度物理シミュレーションにおける敵ポリシーのスケーラブルなトレーニングを支援する。非対称な目標と能力を持つ異種エージェントを特徴とする対向的MARL環境について紹介する。提案プラットフォームは,HAPPO (Herogeneous Agent Reinforcement Learning) とHAPPO (Proximal Policy Optimization) の競合型を統合し,対向力学による効率的な学習と評価を可能にする。いくつかのベンチマークシナリオに対する実験は、高いスループットとシミュレーションリアリズムを維持しながら、形態的に多様なマルチエージェント競合に対する堅牢なポリシーをモデル化し、トレーニングするフレームワークの能力を実証している。コードとベンチマークは、https://github.com/DIRECTLab/IsaacLab-HARL で公開されている。

論文の概要: A Framework for Scalable Heterogeneous Multi-Agent Adversarial Reinforcement Learning in IsaacLab

関連論文リスト