Fugu-MT 論文翻訳(概要): HorizonDrive: Self-Corrective Autoregressive World Model for Long-horizon Driving Simulation

論文の概要: HorizonDrive: Self-Corrective Autoregressive World Model for Long-horizon Driving Simulation

arxiv url: http://arxiv.org/abs/2605.11596v1
Date: Tue, 12 May 2026 06:22:16 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-13 21:48:56.627419
Title: HorizonDrive: Self-Corrective Autoregressive World Model for Long-horizon Driving Simulation
Title（参考訳）: HorizonDrive:ロングホライゾン駆動シミュレーションのための自己補正型自己回帰世界モデル
Authors: Conglang Zhang, Yifan Zhan, Qingjie Wang, Zhanpeng Ouyang, Yu Li, Zihao Yang, Xiaoyang Guo, Weiqiang Ren, Qian Zhang, Zhen Dong, Yinqiang Zheng, Wei Yin, Zhengqing Chen,
Abstract要約: HorizonDriveはAR駆動シミュレーションのためのアンチドリフティングトレーニング・アンド・蒸留フレームワークである。境界メモリ下でのミニスケールARロールアウトをサポートする。 FIDを52%下げ、FVDを37%下げ、AREとDTWを9%下げる。
参考スコア（独自算出の注目度）: 43.56520703300463
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Closed-loop driving simulation requires real-time interaction beyond short offline clips, pushing current driving world models toward autoregressive (AR) rollout. Existing AR distillation approaches typically rely on frame sinks or student-side degradation training. The former transfers poorly to driving due to fast ego-motion and rapid scene changes, while the latter remains bounded by the teacher's single-pass output length and thus provides only a limited supervision horizon. A natural question is: can the teacher itself be extended via AR rollout to provide unbounded-horizon supervision at bounded memory cost? The key difficulty is that a standard teacher drifts under its own predictions, contaminating the supervision it provides. Our key insight is to make the teacher rollout-capable, ensuring reliable supervision from its own AR rollouts. This is instantiated as HorizonDrive, an anti-drifting training-and-distillation framework for AR driving simulation. First, scheduled rollout recovery (SRR) trains the base model to reconstruct ground-truth future clips from prediction-corrupted histories, yielding a teacher that remains stable across long AR rollouts. Second, the rollout-capable teacher is extended via AR rollout, providing long-horizon distribution-matching supervision under bounded memory, while a short-window student aligns to it with teacher rollout DMD (TRD) for efficient real-time deployment. HorizonDrive natively supports minute-scale AR rollout under bounded memory; on nuScenes, HorizonDrive reduces FID by 52% and FVD by 37%, and lowers ARE and DTW by 21% and 9% relative to the strongest long-horizon streaming baselines, while remaining competitive with single-pass driving video generators.
Abstract（参考訳）: クローズドループ駆動シミュレーションは、短いオフラインクリップ以上のリアルタイムインタラクションを必要とし、現在の駆動世界モデルを自動回帰(AR)ロールアウトにプッシュする。既存のAR蒸留手法は一般的にフレームシンクや学生側の劣化訓練に頼っている。前者は高速なエゴモーションと急激なシーンの変化のために運転に不向きであり、後者は教師のシングルパス出力長によって拘束されているため、限られた監督地平線しか提供しない。自然な疑問は、教師自身をARロールアウトを通じて拡張して、境界メモリコストで非境界水平監視を提供できるか、ということです。重要な難点は、標準教師が独自の予測の下で漂流し、それが提供する監督を汚染することである。私たちの重要な洞察は、教師のロールアウトを可能とし、自身のARロールアウトから信頼できる監督を保証することです。これは、AR駆動シミュレーションのためのアンチドリフティングトレーニングと蒸留フレームワークであるHorizonDriveとしてインスタンス化されている。第一に、スケジュールされたロールアウトリカバリ(SRR)がベースモデルをトレーニングし、予測が破損した履歴から地中真直近のクリップを再構築し、長いARロールアウトで安定した教師を生み出す。第二に、ロールアウト可能な教師はARロールアウトにより拡張され、長期の分散マッチング管理を境界メモリ下で提供し、短ウィンドウの学生は教師ロールアウトDMD(TRD)と整列して、効率的なリアルタイムデプロイメントを実現する。 nuScenesでは、HorizonDriveはFIDを52%、FVDを37%削減し、AREとDTWを最強のロングホライゾンストリーミングベースラインに比べて21%、9%下げる一方で、シングルパス駆動ビデオジェネレータと競合する。

論文の概要: HorizonDrive: Self-Corrective Autoregressive World Model for Long-horizon Driving Simulation

関連論文リスト