Fugu-MT 論文翻訳(概要): SR-Nav: Spatial Relationships Matter for Zero-shot Object Goal Navigation

論文の概要: SR-Nav: Spatial Relationships Matter for Zero-shot Object Goal Navigation

arxiv url: http://arxiv.org/abs/2603.18443v1
Date: Thu, 19 Mar 2026 03:09:32 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-20 17:19:05.933424
Title: SR-Nav: Spatial Relationships Matter for Zero-shot Object Goal Navigation
Title（参考訳）: SR-Nav:ゼロショット目標ナビゲーションのための空間的関係
Authors: Leyuan Fang, Zan Mao, Zijing Wang, Yinlong Yan,
Abstract要約: ゼロショットのオブジェクトゴールナビゲーションは、エゴセントリックな観測のみを使用して、目に見えない環境で対象物を見つけることを目的としている。近年の手法は、ナビゲーション性能を高めるために基礎モデルの理解と推論能力を活用している。本研究では、観察と経験に基づく空間的関係をモデル化し、知覚と計画の両面を強化するフレームワークであるSpatial Relation-Aware Navigation (SR-Nav)を提案する。
参考スコア（独自算出の注目度）: 23.52929533412924
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Zero-shot object-goal navigation aims to find target objects in unseen environments using only egocentric observation. Recent methods leverage foundation models' comprehension and reasoning capabilities to enhance navigation performance. However, when faced with poor viewpoints or weak semantic cues, foundation models often fail to support reliable reasoning in both perception and planning, resulting in inefficient or failed navigation. We observe that inherent relationships among objects and regions encode structured scene priors, which help agents infer plausible target locations even under partial observations. Motivated by this insight, we propose Spatial Relation-aware Navigation (SR-Nav), a framework that models both observed and experience-based spatial relationships to enhance both perception and planning. Specifically, SR-Nav first constructs a Dynamic Spatial Relationship Graph (DSRG) that encodes the target-centered spatial relationships through the foundation models and updates dynamically with real-time observations. We then introduce a Relation-aware Matching Module. It utilizes relationship matching instead of naive detection, leveraging diverse relationships in the DSRG to verify and correct errors, enhancing visual perception robustness. Finally, we design a Dynamic Relationship Planning Module to reduce the planning search space by dynamically computing the optimal paths based on the DSRG from the current position, thereby guiding planning and reducing exploration redundancy. Experiments on HM3D show that our method achieves state-of-the-art performance in both success rate and navigation efficiency. The code will be publicly available at https://github.com/Mzyw-1314/SR-Nav
Abstract（参考訳）: ゼロショットのオブジェクトゴールナビゲーションは、エゴセントリックな観察のみを使用して、目に見えない環境でターゲットオブジェクトを見つけることを目的としている。近年の手法は、ナビゲーション性能を高めるために基礎モデルの理解と推論能力を活用している。しかし、貧弱な視点や弱いセマンティック・キューに直面すると、基礎モデルは認識と計画の両方において信頼できる推論をサポートしず、結果として効率の悪さやナビゲーションが失敗する。対象物と領域間の固有な関係が、構成されたシーン先行を符号化し、部分的な観察下であっても、エージェントが可視なターゲット位置を推測するのに役立つことを観察する。この知見に触発されて、観察と経験に基づく空間関係をモデル化し、知覚と計画の両方を強化するフレームワークであるSpatial Relation-Aware Navigation (SR-Nav)を提案する。具体的には、SR-Navはまず、基礎モデルを通してターゲット中心の空間関係を符号化し、リアルタイム観測で動的に更新する動的空間関係グラフ(DSRG)を構築する。次に、リレーショナル・アウェア・マッチング・モジュールを導入します。ナイーブ検出の代わりに関係マッチングを利用し、DSRG内の多様な関係を利用してエラーの検証と修正を行い、視覚的知覚の堅牢性を高める。最後に,DSRGに基づく最適経路を現在の位置から動的に計算し,探索冗長性を誘導し,探索冗長性を低減し,探索空間を縮小する動的関係計画モジュールを設計する。 HM3D実験により,本手法は成功率と航法効率の両面で最先端の性能を実現する。コードはhttps://github.com/Mzyw-1314/SR-Navで公開される。

論文の概要: SR-Nav: Spatial Relationships Matter for Zero-shot Object Goal Navigation

関連論文リスト