Fugu-MT 論文翻訳(概要): Kinematics-Aware Latent World Models for Data-Efficient Autonomous Driving

論文の概要: Kinematics-Aware Latent World Models for Data-Efficient Autonomous Driving

arxiv url: http://arxiv.org/abs/2603.07264v1
Date: Sat, 07 Mar 2026 15:47:54 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-10 15:13:14.185725
Title: Kinematics-Aware Latent World Models for Data-Efficient Autonomous Driving
Title（参考訳）: データ効率の良い自律運転のためのキネマティクスを考慮した潜在世界モデル
Authors: Jiazhuo Li, Linjiang Cao, Qi Liu, Xi Xiong,
Abstract要約: 車両キネマティック情報は観測エンコーダに組み込まれ、物理的に意味のある運動力学における潜伏遷移を基底とする。ジオメトリ・アウェアの監督は、RSSM潜時状態を規則化し、ピクセル再構成を超えてタスク関連空間構造をキャプチャする。
参考スコア（独自算出の注目度）: 8.391716372682316
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Data-efficient learning remains a central challenge in autonomous driving due to the high cost and safety risks of large-scale real-world interaction. Although world-model-based reinforcement learning enables policy optimization through latent imagination, existing approaches often lack explicit mechanisms to encode spatial and kinematic structure essential for driving tasks. In this work, we build upon the Recurrent State-Space Model (RSSM) and propose a kinematics-aware latent world model framework for autonomous driving. Vehicle kinematic information is incorporated into the observation encoder to ground latent transitions in physically meaningful motion dynamics, while geometry-aware supervision regularizes the RSSM latent state to capture task-relevant spatial structure beyond pixel reconstruction. The resulting structured latent dynamics improve long-horizon imagination fidelity and stabilize policy optimization. Experiments in a driving simulation benchmark demonstrate consistent gains over both model-free and pixel-based world-model baselines in terms of sample efficiency and driving performance. Ablation studies further verify that the proposed design enhances spatial representation quality within the latent space. These results suggest that integrating kinematic grounding into RSSM-based world models provides a scalable and physically grounded paradigm for autonomous driving policy learning.
Abstract（参考訳）: データ効率の学習は、大規模な現実世界の対話のコストと安全性のリスクのために、自動運転において依然として中心的な課題である。世界モデルに基づく強化学習は、潜在想像力を通じてポリシーの最適化を可能にするが、既存のアプローチでは、タスクの駆動に不可欠な空間的および運動的構造を符号化する明確なメカニズムが欠如していることが多い。本研究では、リカレントステートスペースモデル(RSSM)を構築し、自律運転のためのキネマティクス対応潜在世界モデルフレームワークを提案する。車両キネマティック情報は観測エンコーダに組み込まれ、物理的に意味のある運動力学における潜時遷移を基底とし、幾何学的監視はRSSM潜時状態を正規化し、画素再構成を超えてタスク関連空間構造をキャプチャする。得られた構造的潜在力学は、長い水平想像力を改善し、ポリシー最適化を安定化させる。ドライビングシミュレーションベンチマークの実験では、サンプル効率と駆動性能の両面において、モデルフリーとピクセルベースの両方のワールドモデルベースラインに対して一貫した利得を示した。アブレーション研究により,提案手法が潜在空間における空間表現の質を高めることが確認された。これらの結果は、キネマティックグラウンドをRSSMベースの世界モデルに統合することで、自律運転政策学習のためのスケーラブルで物理的に基盤付けられたパラダイムが提供されることを示唆している。

論文の概要: Kinematics-Aware Latent World Models for Data-Efficient Autonomous Driving

関連論文リスト