Fugu-MT 論文翻訳(概要): Network Distributed Multi-Agent Reinforcement Learning for Consensus Control of Quadcopters

論文の概要: Network Distributed Multi-Agent Reinforcement Learning for Consensus Control of Quadcopters

arxiv url: http://arxiv.org/abs/2606.02107v1
Date: Mon, 01 Jun 2026 11:39:40 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-02 21:34:31.892972
Title: Network Distributed Multi-Agent Reinforcement Learning for Consensus Control of Quadcopters
Title（参考訳）: クアッドコプターのコンセンサス制御のためのネットワーク分散マルチエージェント強化学習
Authors: Youssef Mahran, Zeyad Gamal, Aamir Ahmad, Ayman El-Badawy,
Abstract要約: 本稿では,クワッドコプターコンセンサス制御のためのネットワーク分散マルチエージェント強化学習(ND-MARL)フレームワークを提案する。その結果、集中型MARLコントローラと比較して、スムーズなコンセンサストラジェクトリとプランナー-トラッカー統合が示された。
参考スコア（独自算出の注目度）: 1.0499611180329804
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: This paper proposes a Network Distributed Multi-Agent Reinforcement Learning (ND-MARL) framework for quadcopter consensus control. Compared to conventional multi-agent MARL formulations that rely on centralized planning or fully decentralized execution, ND-MARL incorporates the swarm communication graph into the decision process. Under a 2-Neighbor communication topology, each agent observes information of only two neighbors and outputs an action through a distributed policy. A high-level distributed consensus planner is trained using Multi-Agent Soft Actor-Critic (MASAC) and embedded in a hierarchical stack to generate reference target positions tracked by a low-level quadcopter controller. Results demonstrate smooth consensus trajectories and planner-tracker integration when compared to a centralized MARL controller. Most notably, the learned controller exhibits zero-shot scalability, as policies trained on a three-agent system are deployed to swarms of up to 250 agents under the same 2-Neighbor communication topology without retraining or fine-tuning, achieving consistent convergence with increasing steady-state spread at large team sizes due to sparse information propagation. These findings highlight ND-MARL as a stable framework for distributed, communication-aware quadcopter consensus control.
Abstract（参考訳）: 本稿では,クワッドコプターコンセンサス制御のためのネットワーク分散マルチエージェント強化学習(ND-MARL)フレームワークを提案する。 ND-MARLは、中央集権計画や完全な分散実行に依存する従来のマルチエージェントMARLの定式化と比較して、Swarm通信グラフを決定プロセスに組み込んでいる。 2-Neighbor通信トポロジーの下で、各エージェントは2つの隣人の情報のみを観察し、分散ポリシーを介してアクションを出力する。高レベルの分散コンセンサスプランナーは、マルチエージェント・ソフトアクター・クリティカル(MASAC)を用いて訓練され、階層的なスタックに埋め込まれ、低レベルのクアッドコプターコントローラによって追跡される基準目標位置を生成する。その結果、集中型MARLコントローラと比較して、スムーズなコンセンサストラジェクトリとプランナー-トラッカー統合が示された。最も注目すべきは、学習されたコントローラがゼロショットのスケーラビリティを示すことだ。3エージェントシステムでトレーニングされたポリシーは、トレーニングや微調整をすることなく、2つの隣の通信トポロジの下で最大250のエージェントの群にデプロイされる。これらの結果は,分散通信対応クアッドコプターコンセンサス制御のための安定フレームワークとしてND-MARLが注目されている。

論文の概要: Network Distributed Multi-Agent Reinforcement Learning for Consensus Control of Quadcopters

関連論文リスト