Fugu-MT 論文翻訳(概要): What Matters for Scalable and Robust Learning in End-to-End Driving Planners?

論文の概要: What Matters for Scalable and Robust Learning in End-to-End Driving Planners?

arxiv url: http://arxiv.org/abs/2603.15185v1
Date: Mon, 16 Mar 2026 12:20:34 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-17 18:28:58.210158
Title: What Matters for Scalable and Robust Learning in End-to-End Driving Planners?
Title（参考訳）: エンド・ツー・エンドの計画立案者にとって、スケーラブルでロバストな学習とは何か?
Authors: David Holtz, Niklas Hanselmann, Simon Doll, Marius Cordts, Bernt Schiele,
Abstract要約: クローズドループ性能に対するアーキテクチャパターンの影響を再検討する。私たちは、軽量でスケーラブルなエンドツーエンド駆動アーキテクチャであるBevADを紹介します。
参考スコア（独自算出の注目度）: 45.17722693412255
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: End-to-end autonomous driving has gained significant attention for its potential to learn robust behavior in interactive scenarios and scale with data. Popular architectures often build on separate modules for perception and planning connected through latent representations, such as bird's eye view feature grids, to maintain end-to-end differentiability. This paradigm emerged mostly on open-loop datasets, with evaluation focusing not only on driving performance, but also intermediate perception tasks. Unfortunately, architectural advances that excel in open-loop often fail to translate to scalable learning of robust closed-loop driving. In this paper, we systematically re-examine the impact of common architectural patterns on closed-loop performance: (1) high-resolution perceptual representations, (2) disentangled trajectory representations, and (3) generative planning. Crucially, our analysis evaluates the combined impact of these patterns, revealing both unexpected limitations as well as underexplored synergies. Building on these insights, we introduce BevAD, a novel lightweight and highly scalable end-to-end driving architecture. BevAD achieves 72.7% success rate on the Bench2Drive benchmark and demonstrates strong data-scaling behavior using pure imitation learning. Our code and models are publicly available here: https://dmholtz.github.io/bevad/
Abstract（参考訳）: エンドツーエンドの自動運転は、対話的なシナリオで堅牢な振る舞いを学び、データでスケールする可能性に対して、大きな注目を集めている。一般的なアーキテクチャは、認識と計画のための別々のモジュールの上に構築され、鳥の目視の特徴グリッドのような潜在表現を通して接続され、エンドツーエンドの識別性を維持する。このパラダイムは、主にオープンループデータセットに基づいており、評価はパフォーマンスの駆動だけでなく、中間認識タスクにも焦点をあてている。残念ながら、オープンループで優れているアーキテクチャ上の進歩は、堅牢なクローズドループ駆動のスケーラブルな学習に変換できないことが多い。本稿では,(1)高分解能知覚表現,(2)歪んだ軌道表現,(3)生成計画といった共通アーキテクチャパターンが閉ループ性能に与える影響を系統的に再検討する。重要なことは、我々の分析はこれらのパターンの複合的な影響を評価し、予期せぬ限界と未探索の相乗効果の両方を明らかにしている。これらの洞察に基づいて、我々は、軽量でスケーラブルなエンドツーエンド駆動アーキテクチャであるBevADを紹介します。 BevADはBench2Driveベンチマークで72.7%の成功率を達成した。私たちのコードとモデルはこちらで公開されています。

論文の概要: What Matters for Scalable and Robust Learning in End-to-End Driving Planners?

関連論文リスト