Fugu-MT 論文翻訳(概要): Heterogeneous Self-Play for Realistic Highway Traffic Simulation

論文の概要: Heterogeneous Self-Play for Realistic Highway Traffic Simulation

arxiv url: http://arxiv.org/abs/2604.16406v1
Date: Tue, 31 Mar 2026 19:39:29 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-04 02:32:13.970172
Title: Heterogeneous Self-Play for Realistic Highway Traffic Simulation
Title（参考訳）: 実効性高速道路交通シミュレーションのための不均質なセルフプレイ
Authors: Jinkai Qiu, Alessandro Saviolo, Chaojie Wang, Mingke Wang, Xiaoyu Huang,
Abstract要約: PHASE, Policy for Heterogeneous Agent Self-play on Expresswayは、コンテキスト対応のセルフプレイフレームワークである。例えば、旅客車やトレーラートラックなどの異なる車両のプロフィールを単一のポリシーでサポートしている。 96.3%の成功率でADE/FDEを6.57/12.07mから2.44/5.25mに下げる。
参考スコア（独自算出の注目度）: 42.67401323169807
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Realistic highway simulation is critical for scalable safety evaluation of autonomous vehicles, particularly for interactions that are too rare to study from logged data alone. Yet highway traffic generation remains challenging because it requires broad coverage across speeds and maneuvers, controllable generation of rare safety-critical scenarios, and behavioral credibility in multi-agent interactions. We present PHASE, Policy for Heterogeneous Agent Self-play on Expressway, a context-aware self-play framework that addresses these three requirements through explicit per-agent conditioning for controllability, synthetic scenario generation for broad highway coverage, and closed-loop multi-agent training for realistic interaction dynamics. PHASE further supports different vehicle profiles, for example, passenger cars and articulated trailer trucks, within a single policy via vehicle-aware dynamics and context-conditioned actions, and stabilizes self-play with early termination of unrecoverable states, at-fault collision attribution, highway-aware reward shaping, coupled curricula, and robust policy optimization. Despite being trained only on synthetic data, PHASE transfers zero-shot to 512 unseen high-interaction real scenarios in exiD, achieving a 96.3% success rate and reducing ADE/FDE from 6.57/12.07 m to 2.44/5.25 m relative to a prior self-play baseline. In a learned trajectory embedding space, it also improves behavioral realism over IDM, reducing Frechet trajectory distance by 13.1% and energy distance by 20.2%. These results show that synthetic self-play can provide a scalable route to controllable and realistic highway scenario generation without direct imitation of expert logs.
Abstract（参考訳）: 現実的なハイウェイシミュレーションは、自動運転車のスケーラブルな安全性評価、特にログデータだけでは研究できないインタラクションにとって重要である。しかし、ハイウェイの交通生成は、スピードと操作、レアセーフティクリティカルなシナリオの制御可能な生成、マルチエージェントインタラクションにおける行動信頼性を必要とするため、依然として困難である。提案するPHASE, Policy for Heterogeneous Agent Self-play on Expressway, A context-aware self-play framework, a context-ware self-play framework that handle these three requirements through explicit per-agent conditioning for controllability, a synthetic scenario generation for broad highway coverage, and closed-loop multi-agent training for real interaction dynamics。 PHASEはさらに、旅客車やトレーラートラックなどの車両プロファイルを、車両認識のダイナミクスとコンテキスト条件付きアクションを通じて単一のポリシーでサポートし、保存不能状態の早期終了によるセルフプレイの安定化、オン・フォールト衝突の帰属、ハイウェイ対応の報酬形成、結合されたキュリキュラ、ロバストなポリシー最適化など、異なる車両プロファイルをサポートする。 PHASEは合成データのみに基づいて訓練されているにもかかわらず、ゼロショットをexiDで目にしない高反応の実シナリオ512に転送し、96.3%の成功率を獲得し、ADE/FDEを以前のセルフプレイベースラインに比べて6.57/12.07mから2.44/5.25mに削減した。学習された軌道埋め込み空間では、DM上の行動リアリズムを改善し、フレシェ軌道距離を13.1%、エネルギー距離を20.2%削減する。これらの結果から, 合成自己再生は, 専門家ログを直接模倣することなく, 制御可能かつ現実的なハイウェイシナリオ生成にスケーラブルな経路を提供することが可能であることが示唆された。

論文の概要: Heterogeneous Self-Play for Realistic Highway Traffic Simulation

関連論文リスト