Fugu-MT 論文翻訳(概要): OmniNWM: Omniscient Driving Navigation World Models

論文の概要: OmniNWM: Omniscient Driving Navigation World Models

arxiv url: http://arxiv.org/abs/2510.18313v1
Date: Tue, 21 Oct 2025 05:49:01 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-25 03:08:12.934229
Title: OmniNWM: Omniscient Driving Navigation World Models
Title（参考訳）: OmniNWM: 見事なナビゲーションワールドモデル
Authors: Bohan Li, Zhuang Ma, Dalong Du, Baorui Peng, Zhujin Liang, Zhenqiang Liu, Chao Ma, Yueming Jin, Hao Zhao, Wenjun Zeng, Xin Jin,
Abstract要約: 統合されたフレームワーク内の3次元すべてに対処するパノラマナビゲーションワールドモデルであるOmniNWMを紹介する。例えば、OmniNWMは、RGB、セマンティクス、メートル法深度、および3D占有度のパノラマ動画を共同で生成する。動作のために、入力軌跡をピクセルレベルの信号にエンコードする正規化パノラマPlucker線地図表現を導入する。
参考スコア（独自算出の注目度）: 41.681741324622735
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Autonomous driving world models are expected to work effectively across three core dimensions: state, action, and reward. Existing models, however, are typically restricted to limited state modalities, short video sequences, imprecise action control, and a lack of reward awareness. In this paper, we introduce OmniNWM, an omniscient panoramic navigation world model that addresses all three dimensions within a unified framework. For state, OmniNWM jointly generates panoramic videos of RGB, semantics, metric depth, and 3D occupancy. A flexible forcing strategy enables high-quality long-horizon auto-regressive generation. For action, we introduce a normalized panoramic Plucker ray-map representation that encodes input trajectories into pixel-level signals, enabling highly precise and generalizable control over panoramic video generation. Regarding reward, we move beyond learning reward functions with external image-based models: instead, we leverage the generated 3D occupancy to directly define rule-based dense rewards for driving compliance and safety. Extensive experiments demonstrate that OmniNWM achieves state-of-the-art performance in video generation, control accuracy, and long-horizon stability, while providing a reliable closed-loop evaluation framework through occupancy-grounded rewards. Project page is available at https://github.com/Arlo0o/OmniNWM.
Abstract（参考訳）: 自律運転の世界モデルは、状態、行動、報酬の3つの中核領域で効果的に機能することが期待されている。しかし、既存のモデルは通常、限られた状態のモダリティ、短いビデオシーケンス、不正確なアクション制御、報酬意識の欠如に制限されている。本稿では,オムニNWMについて紹介する。オムニNWMはオムニNWM,オムニNWM,オムニNWM,オムニNWM,オムニNWM,オムニNWM,オムニNWM,オムニNWM,オムニNWM,オムニNWM,オムニNWM,オムニNWM,オムニアン・パノラマ的ナビゲーション・ワールド・モデルである。例えば、OmniNWMは、RGB、セマンティクス、メートル法深度、および3D占有度のパノラマ動画を共同で生成する。フレキシブルな強制戦略により、高品質な長距離自動回帰生成が可能となる。本研究では,パノラマ画像生成の高精度かつ汎用的な制御を実現するために,画素レベルの信号に入力軌跡を符号化する正規化パノラマ線地図表現を提案する。報酬に関して、私たちは、外部画像ベースモデルによる報酬関数の学習を超えて、生成した3D占有力を活用して、コンプライアンスと安全性を駆動するためのルールベースの高密度報酬を直接定義します。大規模な実験により,OmniNWMは映像生成,制御精度,長時間水平安定性の両立を実現し,また,占有型報酬による信頼性の高いクローズドループ評価フレームワークを提供する。プロジェクトページはhttps://github.com/Arlo0o/OmniNWM.comで公開されている。

論文の概要: OmniNWM: Omniscient Driving Navigation World Models

関連論文リスト