Fugu-MT 論文翻訳(概要): Generalization of World Models under Environmental Variability for Vision-based Quadrotor Navigation

論文の概要: Generalization of World Models under Environmental Variability for Vision-based Quadrotor Navigation

arxiv url: http://arxiv.org/abs/2606.05015v1
Date: Wed, 03 Jun 2026 15:38:36 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-04 20:44:18.863478
Title: Generalization of World Models under Environmental Variability for Vision-based Quadrotor Navigation
Title（参考訳）: 視覚に基づくクアドロレータナビゲーションのための環境変動下における世界モデルの一般化
Authors: Luca Zanatta, Grzegorz Malczyk, Kostas Alexis,
Abstract要約: 世界モデルは、環境がどのように進化するかを予測する学習された生成モデルであり、サンプル効率のよいロボット学習のための有望なツールとなっている。本研究では,視覚に基づく四角形ナビゲーションをテストベッド問題とし,環境ランダム性の異なるDreamerV3ベースの世界モデルを訓練する。そして、すべての世界モデルと関連するナビゲーションポリシーを、すべてのセンサーが切断される前に実際の感覚入力を2.5秒だけ受信するオープンループランなど、目に見えない環境で実際の四角形に展開します。以上の結果から,SSL事前トレーニング中の世界モデルロバスト性はsim-to-realの強い予測因子であることが示された。
参考スコア（独自算出の注目度）: 10.364779390403337
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: World models, learned generative models that predict how an environment evolves, have become a promising tool for sample-efficient robot learning. Yet how robust they are to environmental variability remains poorly understood. To address this, we conduct a systematic study using vision-based quadrotor navigation as a testbed problem, training DreamerV3-based world models under varying levels of environmental randomness and evaluating them across all levels through cross-environment validation, spanning both Self-Supervised Learning (SSL) pretraining and Reinforcement Learning (RL) fine-tuning. We then deploy all world models and associated navigation policies on a real quadrotor in unseen environments, including an open-loop run where the model receives just 2.5s of real sensory input before all sensors are cut off, leaving the system to navigate entirely in imagination over a 12m traverse. Our results show that world model robustness during SSL pretraining is a strong predictor of sim-to-real transfer: every model that generalized well in cross-environment SSL validation deployed successfully in the real world, passing through gaps as narrow as 0.67m, whereas the model that dominated simulation policy evaluation failed on the real platform. We further identify (a) the discrete latent size and (b) the training-sequence length as the dominant factors governing world model quality.
Abstract（参考訳）: 世界モデルは、環境がどのように進化するかを予測する学習された生成モデルであり、サンプル効率のよいロボット学習のための有望なツールとなっている。しかし、環境変動にどれほど頑丈かはいまだに理解されていない。そこで我々は,視覚に基づく四角形ナビゲーションをテストベッド問題として用い,環境ランダム性の異なるレベルにおけるドリーマーV3ベースの世界モデルを訓練し,自己教師付き学習(SSL)と強化学習(RL)の両方にまたがる環境間検証を通じて,すべてのレベルにわたって評価する。そして、すべての世界モデルと関連するナビゲーションポリシーを、見知らぬ環境で実際の四角形にデプロイします。例えば、すべてのセンサーが切断される前に、モデルが実際の感覚入力を2.5秒だけ受信するオープンループランニングなどです。その結果、SSLプリトレーニング中の世界モデルの堅牢性は、sim-to-real転送の強力な予測要因であることが判明した: クロス環境のSSLバリデーションにおいてよく一般化されたモデルはすべて、0.67mという狭いギャップを通り抜け、実際のプラットフォームではシミュレーションポリシー評価を支配したモデルが失敗した。我々はさらに特定する a) 個別の潜伏サイズと潜伏サイズ b)世界モデル品質を規定する主要な要因はトレーニングシーケンスの長さである。

論文の概要: Generalization of World Models under Environmental Variability for Vision-based Quadrotor Navigation

関連論文リスト