Fugu-MT 論文翻訳(概要): R2F: Repurposing Ray Frontiers for LLM-free Object Navigation

論文の概要: R2F: Repurposing Ray Frontiers for LLM-free Object Navigation

arxiv url: http://arxiv.org/abs/2603.08475v1
Date: Mon, 09 Mar 2026 15:10:10 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-10 15:13:16.283785
Title: R2F: Repurposing Ray Frontiers for LLM-free Object Navigation
Title（参考訳）: R2F: LLMフリーオブジェクトナビゲーションのための光フロンティアの再利用
Authors: Francesco Argenziano, John Mark Alexis Marcelo, Michele Brienza, Abdel Hakim Drid, Emanuele Musumeci, Daniele Nardi, Domenico D. Bloisi, Vincenzo Suriani,
Abstract要約: VLM(Vision-Language Models)とLLM(Large Language Models)は、現在ではエンドツーエンドのポリシーではなく、ハイレベルな意思決定者として広く使われている。室内でのオープン語彙オブジェクトナビゲーションのためのLLMフリーフレームワークを開発した。 Habitat-simおよび実際のロボットプラットフォームにおける実験は、リアルタイム実行による最先端のゼロショットパフォーマンスの競争力を示す。
参考スコア（独自算出の注目度）: 1.4755786263360526
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Zero-shot open-vocabulary object navigation has progressed rapidly with the emergence of large Vision-Language Models (VLMs) and Large Language Models (LLMs), now widely used as high-level decision-makers instead of end-to-end policies. Although effective, such systems often rely on iterative large-model queries at inference time, introducing latency and computational overhead that limit real-time deployment. To address this problem, we repurpose ray frontiers (R2F), a recently proposed frontier-based exploration paradigm, to develop an LLM-free framework for indoor open-vocabulary object navigation. While ray frontiers were originally used to bias exploration using semantic cues carried along rays, we reinterpret frontier regions as explicit, direction-conditioned semantic hypotheses that serve as navigation goals. Language-aligned features accumulated along out-of-range rays are stored sparsely at frontiers, where each region maintains multiple directional embeddings encoding plausible unseen content. In this way, navigation then reduces to embedding-based frontier scoring and goal tracking within a classical mapping and planning pipeline, eliminating iterative large-model reasoning. We further introduce R2F-VLN, a lightweight extension for free-form language instructions using syntactic parsing and relational verification without additional VLM or LLM components. Experiments in Habitat-sim and on a real robotic platform demonstrate competitive state-of-the-art zero-shot performance with real-time execution, achieving up to 6 times faster runtime than VLM-based alternatives.
Abstract（参考訳）: ゼロショットのオープンボキャブラリオブジェクトナビゲーションは、大規模なビジョンランゲージモデル(VLM)と大規模言語モデル(LLM)の出現によって急速に進歩し、現在ではエンドツーエンドのポリシーではなく、ハイレベルな意思決定者として広く使われている。有効ではあるが、そのようなシステムはしばしば推論時に反復的な大モデルクエリに頼り、リアルタイムデプロイメントを制限する遅延と計算オーバーヘッドを導入する。この問題に対処するために、最近提案されたフロンティアに基づく探索パラダイムであるレイフロンティア(R2F)を再利用し、屋内オープン語彙オブジェクトナビゲーションのためのLLMフリーフレームワークを開発する。線フロンティアはもともと、線に沿って運ばれるセマンティックキューを用いて、偏見探索に用いられたが、我々はフロンティア領域を、ナビゲーション目標として機能する明示的で方向条件のセマンティック仮説として再解釈した。アウト・オブ・レンジ線に沿って蓄積された言語対応機能は、フロンティアにわずかに格納され、各リージョンは、可視で見えないコンテンツをコードする複数の方向の埋め込みを保持する。このようにして、ナビゲーションは古典的なマッピングと計画パイプライン内の埋め込みベースのフロンティアスコアとゴールトラッキングに還元され、反復的な大モデル推論が排除される。さらに,R2F-VLNを導入し,構文解析と関係性検証を,付加的なVLMやLLMコンポーネントを使わずに実現した。 Habitat-simの実験と実際のロボットプラットフォームによる実験は、VLMベースの代替よりも最大6倍高速な実行を実現する、リアルタイム実行による最先端のゼロショットパフォーマンスの競争力を示す。

論文の概要: R2F: Repurposing Ray Frontiers for LLM-free Object Navigation

関連論文リスト