Fugu-MT 論文翻訳(概要): RAP: 3D Rasterization Augmented End-to-End Planning

論文の概要: RAP: 3D Rasterization Augmented End-to-End Planning

arxiv url: http://arxiv.org/abs/2510.04333v1
Date: Sun, 05 Oct 2025 19:31:24 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-07 16:52:59.589911
Title: RAP: 3D Rasterization Augmented End-to-End Planning
Title（参考訳）: RAP:3Dラスタ化でエンド・ツー・エンドプランを拡大
Authors: Lan Feng, Yang Gao, Eloi Zablocki, Quanyi Li, Wuyang Li, Sichao Liu, Matthieu Cord, Alexandre Alahi,
Abstract要約: エンド・ツー・エンドの列車政策の模擬学習は、専門家によるデモンストレーションでのみ行われる。アノテーション付きプリミティブの軽量化により、コストのかかるレンダリングを置き換える3Dラスタライゼーションを提案する。 RAPは最先端のクローズドループとロングテールロバスト性を実現し、4つの主要なベンチマークで第1位となった。
参考スコア（独自算出の注目度）: 104.52778241744522
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Imitation learning for end-to-end driving trains policies only on expert demonstrations. Once deployed in a closed loop, such policies lack recovery data: small mistakes cannot be corrected and quickly compound into failures. A promising direction is to generate alternative viewpoints and trajectories beyond the logged path. Prior work explores photorealistic digital twins via neural rendering or game engines, but these methods are prohibitively slow and costly, and thus mainly used for evaluation. In this work, we argue that photorealism is unnecessary for training end-to-end planners. What matters is semantic fidelity and scalability: driving depends on geometry and dynamics, not textures or lighting. Motivated by this, we propose 3D Rasterization, which replaces costly rendering with lightweight rasterization of annotated primitives, enabling augmentations such as counterfactual recovery maneuvers and cross-agent view synthesis. To transfer these synthetic views effectively to real-world deployment, we introduce a Raster-to-Real feature-space alignment that bridges the sim-to-real gap. Together, these components form Rasterization Augmented Planning (RAP), a scalable data augmentation pipeline for planning. RAP achieves state-of-the-art closed-loop robustness and long-tail generalization, ranking first on four major benchmarks: NAVSIM v1/v2, Waymo Open Dataset Vision-based E2E Driving, and Bench2Drive. Our results show that lightweight rasterization with feature alignment suffices to scale E2E training, offering a practical alternative to photorealistic rendering. Project page: https://alan-lanfeng.github.io/RAP/.
Abstract（参考訳）: エンド・ツー・エンドの列車政策の模擬学習は、専門家によるデモンストレーションでのみ行われる。クローズドループにデプロイされると、そのようなポリシにはリカバリデータがない。有望な方向は、ログされたパスを越えて、代替の視点と軌道を生成することである。以前の研究では、ニューラルレンダリングやゲームエンジンを通じて、フォトリアリスティックなデジタル双生児を探索していたが、これらの手法は明らかに遅くてコストがかかるため、主に評価に使用される。本研究では、エンド・ツー・エンド・プランナーの育成に光リアリズムは不要であると主張する。運転はテクスチャや照明ではなく、幾何学や力学に依存します。そこで本研究では,アノテーション付きプリミティブの軽量ラスタ化に代えて3Dラスタ化を提案する。これらの合成ビューを実世界の展開に効果的に転送するために,シミュレート・トゥ・リアルギャップを橋渡しするRaster-to-Real機能空間アライメントを導入する。これらのコンポーネントは共に、計画のためのスケーラブルなデータ拡張パイプラインであるRasterization Augmented Planning(RAP)を形成する。 RAPは最先端のクローズドループロバスト性とロングテールの一般化を実現し、NAVSIM v1/v2、Waymo Open Dataset VisionベースのE2E Driving、Bench2Driveの4つの主要なベンチマークでランクインしている。以上の結果から,機能アライメントによる軽量ラスタライゼーションはE2Eトレーニングをスケールするのに十分であり,フォトリアリスティックレンダリングの実用的な代替手段であることがわかった。プロジェクトページ: https://alan-lanfeng.github.io/RAP/。

論文の概要: RAP: 3D Rasterization Augmented End-to-End Planning

関連論文リスト