Fugu-MT 論文翻訳(概要): WildOS: Open-Vocabulary Object Search in the Wild

論文の概要: WildOS: Open-Vocabulary Object Search in the Wild

arxiv url: http://arxiv.org/abs/2602.19308v1
Date: Sun, 22 Feb 2026 19:14:00 GMT
ステータス: 翻訳完了
システム内更新日: 2026-02-24 17:42:02.578048
Title: WildOS: Open-Vocabulary Object Search in the Wild
Title（参考訳）: WildOS: オープン語彙オブジェクト検索
Authors: Hardik Shah, Erica Tevere, Deegan Atha, Marcel Kaufmann, Shehryar Khattak, Manthan Patel, Marco Hutter, Jonas Frey, Patrick Spieler,
Abstract要約: この研究は、長距離でオープンなオブジェクト検索のための統一システムWildOSを提示する。基礎モデルに基づく視覚モジュール ExploRFM を用いて、グラフのフロンティアノードをスコアリングする。また、オープン語彙ターゲットクエリの粗い局所化のための粒子フィルタに基づく手法も導入する。
参考スコア（独自算出の注目度）: 12.098091049832965
License: http://creativecommons.org/licenses/by-sa/4.0/
Abstract: Autonomous navigation in complex, unstructured outdoor environments requires robots to operate over long ranges without prior maps and limited depth sensing. In such settings, relying solely on geometric frontiers for exploration is often insufficient. In such settings, the ability to reason semantically about where to go and what is safe to traverse is crucial for robust, efficient exploration. This work presents WildOS, a unified system for long-range, open-vocabulary object search that combines safe geometric exploration with semantic visual reasoning. WildOS builds a sparse navigation graph to maintain spatial memory, while utilizing a foundation-model-based vision module, ExploRFM, to score frontier nodes of the graph. ExploRFM simultaneously predicts traversability, visual frontiers, and object similarity in image space, enabling real-time, onboard semantic navigation tasks. The resulting vision-scored graph enables the robot to explore semantically meaningful directions while ensuring geometric safety. Furthermore, we introduce a particle-filter-based method for coarse localization of the open-vocabulary target query, that estimates candidate goal positions beyond the robot's immediate depth horizon, enabling effective planning toward distant goals. Extensive closed-loop field experiments across diverse off-road and urban terrains demonstrate that WildOS enables robust navigation, significantly outperforming purely geometric and purely vision-based baselines in both efficiency and autonomy. Our results highlight the potential of vision foundation models to drive open-world robotic behaviors that are both semantically informed and geometrically grounded. Project Page: https://leggedrobotics.github.io/wildos/
Abstract（参考訳）: 複雑で非構造的な屋外環境での自律的なナビゲーションは、ロボットが事前の地図や限定的な深度検知なしで長距離で操作する必要がある。このような環境では、探索のための幾何学的フロンティアのみに頼ることは、しばしば不十分である。このような環境では、どこへ行くべきか、何が安全なのかを意味論的に推論する能力は、堅牢で効率的な探索に不可欠である。この研究は、安全な幾何学的探索と意味的な視覚的推論を組み合わせた、長距離でオープンなオブジェクト探索のための統合システムWildOSを提示する。 WildOSは空間記憶を維持するためにスパースナビゲーショングラフを構築し、基盤モデルベースのビジョンモジュールであるExpploRFMを使用してグラフのフロンティアノードをスコアする。 ExploRFMは、画像空間におけるトラバーサビリティ、ビジュアルフロンティア、オブジェクト類似性を同時に予測し、リアルタイム、オンボードのセマンティックナビゲーションタスクを可能にする。結果として得られる視覚マーク付きグラフにより、ロボットは幾何学的安全性を確保しながら意味論的に意味のある方向を探索することができる。さらに,ロボットの直近の深度地平線を超えた候補目標位置を推定し,遠隔の目標に向けて効果的な計画立案を可能にする,オープン語彙ターゲットクエリの粗い局所化のための粒子フィルタに基づく手法を提案する。様々なオフロードおよび都市地形にわたる大規模なクローズドループフィールド実験は、WildOSが堅牢なナビゲーションを可能にし、純粋に幾何学的かつ純粋に視覚に基づくベースラインを効率と自律性の両方で著しく上回っていることを示している。本研究は,視覚基盤モデルが意味的情報と幾何学的基礎を持つオープンワールドロボット行動を促進する可能性を強調した。 Project Page: https://leggedrobotics.github.io/wildos/

論文の概要: WildOS: Open-Vocabulary Object Search in the Wild

関連論文リスト