Fugu-MT 論文翻訳(概要): MuJoCo-Drones-Gym: A GPU-Accelerated Multi-Drone Simulator for Control and Reinforcement Learning

論文の概要: MuJoCo-Drones-Gym: A GPU-Accelerated Multi-Drone Simulator for Control and Reinforcement Learning

arxiv url: http://arxiv.org/abs/2606.08039v1
Date: Sat, 06 Jun 2026 07:59:45 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-09 14:42:05.685229
Title: MuJoCo-Drones-Gym: A GPU-Accelerated Multi-Drone Simulator for Control and Reinforcement Learning
Title（参考訳）: MuJoCo-Drones-Gym: 制御と強化学習のためのGPU加速マルチDroneシミュレータ
Authors: Manan Tayal,
Abstract要約: MuJoCo-Drones-Gymは、MuJoCo物理エンジン上に構築されたオープンソースのGymnasium互換のマルチドローン環境である。環境設計、基礎となる物理、クワッドコプターの力学について述べ、制御と学習の例を通してその使い方を説明する。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by-sa/4.0/
Abstract: Robotic simulators are a cornerstone of modern research in aerial robotics, serving both as a vehicle for the development of new control algorithms and as the data source for training reinforcement learning (RL) policies. Yet, existing quadcopter learning environments often face a trade-off between physical fidelity, multi-agent support, and the throughput required by modern deep RL pipelines. In this paper, we present MuJoCo-Drones-Gym, an open-source Gymnasium-compatible multi-drone environment built on top of the MuJoCo physics engine. MuJoCo-Drones-Gym supports an arbitrary number of Bitcraze Crazyflie 2.x nano-quadcopters and exposes a modular API for selecting (i)~the physics model (rigid-body MuJoCo, explicit Python dynamics, or any subset of ground effect, blade drag, and inter-drone downwash), (ii)~the action interface (per-motor RPMs, collective normalized thrust, velocity setpoints, or PID waypoint commands), and (iii)~the observation space (kinematic state vectors, RGB / depth / segmentation cameras, or neighbourhood adjacency information). A PettingZoo ParallelEnv wrapper enables drop-in multi-agent reinforcement learning, while a suite of seven task environments, hover, velocity tracking, multi-drone hover, waypoint navigation, formation flight, gate racing, and a generic multi-agent template, demonstrates the breadth of the interface. We describe the environment design, the underlying physics and quadcopter dynamics, and illustrate its use through control and learning examples that mirror those of the closely related gym-pybullet-drones project, while taking advantage of MuJoCo's improved contact handling, rendering, and parallelizability.
Abstract（参考訳）: ロボットシミュレータは、新しい制御アルゴリズムを開発するための手段としても、強化学習(RL)ポリシーをトレーニングするためのデータソースとしても機能する。しかし、既存のクアッドコプター学習環境は、物理的忠実さ、マルチエージェントサポート、そして現代の深層RLパイプラインに必要なスループットのトレードオフに直面していることが多い。本稿では,MuJoCo物理エンジン上に構築されたオープンソースのGymnasium互換マルチドローン環境であるMuJoCo-Drones-Gymを紹介する。 MuJoCo-Drones-Gymは任意の数のBitcraze Crazyflie 2.xナノクアッドコプターをサポートし、選択のためのモジュールAPIを公開する。 (i)~物理モデル(剛体 MuJoCo、明示的なPythonダイナミックス、あるいは地面効果のサブセット、ブレードドラッグ、およびドローン間ダウンウォッシュ) (ii)~アクションインターフェース(運動量当たりのRPM、集合正規化推力、速度設定点、PIDウェイポイントコマンド)、 (iii)~観測空間(運動状態ベクトル、RGB/深度/セグメンテーションカメラ、周辺隣接情報) PettingZoo ParallelEnvラッパーは、マルチエージェント強化学習を可能にする一方で、ホバー、ベロシティトラッキング、マルチドローンホバー、ウェイポイントナビゲーション、フォーメーションフライト、ゲートレース、ジェネリックマルチエージェントテンプレートといった7つのタスク環境からなるスイートは、インターフェースの幅を実証している。環境設計,基礎となる物理,クアッドコプターのダイナミクスを解説し,MuJoCoのコンタクトハンドリング,レンダリング,並列化性の向上を活用しながら,近縁なジム-ピブルル-ドロンプロジェクトの状況を反映した制御と学習の例を解説する。

論文の概要: MuJoCo-Drones-Gym: A GPU-Accelerated Multi-Drone Simulator for Control and Reinforcement Learning

関連論文リスト