Fugu-MT 論文翻訳(概要): Latent World Models for Automated Driving: A Unified Taxonomy, Evaluation Framework, and Open Challenges

論文の概要: Latent World Models for Automated Driving: A Unified Taxonomy, Evaluation Framework, and Open Challenges

arxiv url: http://arxiv.org/abs/2603.09086v1
Date: Tue, 10 Mar 2026 01:56:17 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-11 15:25:23.938599
Title: Latent World Models for Automated Driving: A Unified Taxonomy, Evaluation Framework, and Open Challenges
Title（参考訳）: 自律運転のための潜在世界モデル:統一分類学、評価フレームワーク、オープンチャレンジ
Authors: Rongxiang Zeng, Yongqi Dong,
Abstract要約: 本稿では,自動走行のための世界モデルの最近の進歩を生かした,一貫したラテント空間フレームワークを提案する。このフレームワークは、ラテント表現(ラテント・ワールド、ラテント・アクション、ラテント・ジェネレータ、連続状態、離散トークン、ハイブリッド)と幾何学、トポロジー、セマンティクスの構造的先行によって設計空間を整理する。
参考スコア（独自算出の注目度）: 2.76240219662896
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Emerging generative world models and vision-language-action (VLA) systems are rapidly reshaping automated driving by enabling scalable simulation, long-horizon forecasting, and capability-rich decision making. Across these directions, latent representations serve as the central computational substrate: they compress high-dimensional multi-sensor observations, enable temporally coherent rollouts, and provide interfaces for planning, reasoning, and controllable generation. This paper proposes a unifying latent-space framework that synthesizes recent progress in world models for automated driving. The framework organizes the design space by the target and form of latent representations (latent worlds, latent actions, latent generators; continuous states, discrete tokens, and hybrids) and by structural priors for geometry, topology, and semantics. Building on this taxonomy, the paper articulates five cross-cutting internal mechanics (i.e, structural isomorphism, long-horizon temporal stability, semantic and reasoning alignment, value-aligned objectives and post-training, as well as adaptive computation and deliberation) and connects these design choices to robustness, generalization, and deployability. The work also proposes concrete evaluation prescriptions, including a closed-loop metric suite and a resource-aware deliberation cost, designed to reduce the open-loop / closed-loop mismatch. Finally, the paper identifies actionable research directions toward advancing latent world model for decision-ready, verifiable, and resource-efficient automated driving.
Abstract（参考訳）: 新たな生成的世界モデルと視覚言語アクション(VLA)システムは、スケーラブルなシミュレーション、長期予測、能力豊富な意思決定を可能にして、自動化運転を迅速に再構築している。それらは高次元のマルチセンサー観測を圧縮し、時間的に一貫性のあるロールアウトを可能にし、計画、推論、制御可能な生成のためのインターフェースを提供する。本稿では,自動走行のための世界モデルの最近の進歩を生かした,一貫したラテント空間フレームワークを提案する。このフレームワークは、ラテント表現(ラテント・ワールド、ラテント・アクション、ラテント・ジェネレータ、連続状態、離散トークン、ハイブリッド)と幾何学、トポロジー、セマンティクスの構造的先行によって設計空間を整理する。この分類に基づいて、本論文は5つの横断的内部力学(構造的同型、長期的時間的安定性、意味と推論の整合性、価値に整合した目的と後学習、および適応的な計算と熟考)を記述し、これらの設計選択を堅牢性、一般化、展開可能性に結びつける。また、クローズドループ計量スイートや、クローズドループ/クローズドループミスマッチを減らすために設計されたリソースを意識した検討コストなど、具体的な評価基準も提案している。最後に、意思決定可能・検証可能・資源効率の高い自動運転のための潜在世界モデルに向けた実用的な研究の方向性を明らかにした。

論文の概要: Latent World Models for Automated Driving: A Unified Taxonomy, Evaluation Framework, and Open Challenges

関連論文リスト