Fugu-MT 論文翻訳(概要): The Neural Compass: Probabilistic Relative Feature Fields for Robotic Search

論文の概要: The Neural Compass: Probabilistic Relative Feature Fields for Robotic Search

arxiv url: http://arxiv.org/abs/2603.08544v1
Date: Mon, 09 Mar 2026 16:11:05 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-10 15:13:16.395945
Title: The Neural Compass: Probabilistic Relative Feature Fields for Robotic Search
Title（参考訳）: ニューラルコンパス:ロボット検索のための確率的相対的特徴場
Authors: Gabriele Somaschini, Adrian Röfer, Abhinav Valada,
Abstract要約: 本稿では,事前学習された視覚言語モデルから得られた特徴量の相対分布を予測するために訓練された特徴場モデルProReFFを提案する。本稿では,ProReFFが自然の場面で有意な相対的特徴分布を捉えていることを示す。提案するエージェントは,最強のベースラインよりも20%効率が高く,最大80%の人的性能を達成できる。
参考スコア（独自算出の注目度）: 16.604293152314995
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Object co-occurrences provide a key cue for finding objects successfully and efficiently in unfamiliar environments. Typically, one looks for cups in kitchens and views fridges as evidence of being in a kitchen. Such priors have also been exploited in artificial agents, but they are typically learned from explicitly labeled data or queried from language models. It is still unclear whether these relations can be learned implicitly from unlabeled observations alone. In this work, we address this problem and propose ProReFF, a feature field model trained to predict relative distributions of features obtained from pre-trained vision language models. In addition, we introduce a learning-based strategy that enables training from unlabeled and potentially contradictory data by aligning inconsistent observations into a coherent relative distribution. For the downstream object search task, we propose an agent that leverages predicted feature distributions as a semantic prior to guide exploration toward regions with a high likelihood of containing the object. We present extensive evaluations demonstrating that ProReFF captures meaningful relative feature distributions in natural scenes and provides insight into the impact of our proposed alignment step. We further evaluate the performance of our search agent in 100 challenges in the Matterport3D simulator, comparing with feature-based baselines and human participants. The proposed agent is 20% more efficient than the strongest baseline and achieves up to 80% of human performance.
Abstract（参考訳）: オブジェクトの共起は、不慣れな環境でオブジェクトを正しく効率的に見つけるための重要な手がかりを提供する。通常、台所のカップを探し、冷蔵庫を台所にいる証拠とみなす。このような先入観は人工エージェントにも利用されてきたが、典型的には明示的にラベル付けされたデータから学習されるか、言語モデルからクエリされる。これらの関係が、ラベルなしの観測だけで暗黙的に学べるかどうかはまだ不明である。本研究では,この問題に対処し,事前学習された視覚言語モデルから得られる特徴の相対分布を予測するために訓練された特徴場モデルProReFFを提案する。さらに,不整合観測を一貫性のある相対分布に整合させることで,ラベルのない,潜在的に矛盾するデータからのトレーニングを可能にする学習ベースの戦略を導入する。下流オブジェクト探索タスクでは、予測特徴分布を意味として活用し、対象を含む可能性の高い領域への探索を誘導するエージェントを提案する。本稿では,ProReFFが自然の場面で有意な特徴分布を捉えることを実証し,提案するアライメントステップの影響について考察する。そこで我々は,Matterport3Dシミュレータにおける100の課題において,検索エージェントの性能をさらに評価し,特徴に基づくベースラインと人間の参加者との比較を行った。提案するエージェントは,最強のベースラインよりも20%効率が高く,最大80%の人的性能を達成できる。

論文の概要: The Neural Compass: Probabilistic Relative Feature Fields for Robotic Search

関連論文リスト