Fugu-MT 論文翻訳(概要): MAPLE: Latent Multi-Agent Play for End-to-End Autonomous Driving

論文の概要: MAPLE: Latent Multi-Agent Play for End-to-End Autonomous Driving

arxiv url: http://arxiv.org/abs/2605.14201v2
Date: Tue, 19 May 2026 23:20:07 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-21 14:55:44.110609
Title: MAPLE: Latent Multi-Agent Play for End-to-End Autonomous Driving
Title（参考訳）: MAPLE: エンド・ツー・エンド自動運転のマルチエージェント・プレイ
Authors: Rajeev Yasarla, Deepti Hegde, Hsin-Pai Cheng, Shizhong Han, Yunxiao Shi, Meysam Sadeghigooghari, Hanno Ackermann, Litian Liu, Pranav Desai, Fatih Porikli, Mohammad Ghavamzadeh, Hong Cai,
Abstract要約: 視覚言語-アクション(VLA)モデルは、エンドツーエンドのモーションプランナーとして有効であるが、クローズドループ設定で評価すると不安定である。本稿では, VLAモデルの潜在空間における動的駆動シナリオの, リアクティブでマルチエージェントなロールアウトのための新しいフレームワークMAPLEを提案する。 MAPLEはBench2Driveで最先端の駆動性能を実現し、堅牢なE2E自動運転システムのためのスケーラブルでクローズループなマルチエージェントプレイを実演する。
参考スコア（独自算出の注目度）: 62.43744546817599
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Vision-language-action (VLA) models are effective as end-to-end motion planners, but can be brittle when evaluated in closed-loop settings due to being trained under traditional imitation learning framework. Existing closed-loop supervision approaches lack scalability and fail to completely model a reactive environment. We propose MAPLE, a novel framework for reactive, multi-agent rollout of a dynamic driving scenario in the latent space of the VLA model. The ego vehicle and nearby traffic agents are independently controlled over multi-step horizons, while being reactive to other agents in the scene, enabling closed-loop training. MAPLE consists of two training stages: (1) supervised fine-tuning on the latent rollouts based on ground-truth trajectories, followed by (2) reinforcement learning with global and agent -specific rewards that encourage safety, progress, and interaction realism. We further propose diversity rewards that encourage the model to generate planning behaviors that may not be present in logged driving data. Notably, our closed-loop training framework is scalable and does not require external simulators, which can be computationally expensive to run and have limited visual fidelity to the real-world. MAPLE achieves state-of-the-art driving performance on Bench2Drive and demonstrates scalable, closed-loop multi-agent play for robust E2E autonomous driving systems.
Abstract（参考訳）: 視覚言語アクション(VLA)モデルは、エンドツーエンドのモーションプランナーとして有効であるが、従来の模倣学習フレームワークでトレーニングされているため、クローズドループ設定で評価すると不安定になる可能性がある。既存のクローズドループ監視アプローチにはスケーラビリティがなく、リアクティブ環境を完全にモデル化することができない。本稿では, VLAモデルの潜在空間における動的駆動シナリオの, リアクティブでマルチエージェントなロールアウトのための新しいフレームワークMAPLEを提案する。エゴの車両と近くの交通機関は、シーン内の他のエージェントと反応しながら、複数のステップの水平線で独立に制御され、クローズドループの訓練を可能にしている。 MAPLEは,(1)地道軌道に基づく潜伏ロールアウトの監督的微調整,(2)グローバルとエージェントによる強化学習と,安全性,進歩,相互作用リアリズムの促進という2つの訓練段階から構成される。さらに、ログ化された運転データに存在しないかもしれない計画行動を生成するようモデルに促す多様性報酬を提案する。特に、クローズドループトレーニングフレームワークはスケーラブルであり、外部シミュレータを必要としない。 MAPLEはBench2Driveで最先端の駆動性能を実現し、堅牢なE2E自動運転システムのためのスケーラブルでクローズループなマルチエージェントプレイを実演する。

論文の概要: MAPLE: Latent Multi-Agent Play for End-to-End Autonomous Driving

関連論文リスト