Fugu-MT 論文翻訳(概要): Seeing Fast and Slow: Bimodal 3D Scene Graphs for Open-set Tasks

論文の概要: Seeing Fast and Slow: Bimodal 3D Scene Graphs for Open-set Tasks

arxiv url: http://arxiv.org/abs/2605.31067v2
Date: Tue, 02 Jun 2026 08:08:50 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-03 18:57:50.256494
Title: Seeing Fast and Slow: Bimodal 3D Scene Graphs for Open-set Tasks
Title（参考訳）: 高速かつスローに見る: オープンセットタスクのためのバイモーダルな3Dシーングラフ
Authors: Marcel Bartholomeus Prasetyo, Shrutika Vishal Thengane, A Manicka Praveen, Yi Loo, Malika Meghjani,
Abstract要約: BiMoSGは、オープンセットタスクのためのバイモーダルな3Dシーングラフ生成アプローチである。提案する3次元シーングラフ生成手法は,オープンソースの最先端手法よりもはるかに高速であることを示す。
参考スコア（独自算出の注目度）: 2.5641128800447937
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Open-set task execution can significantly benefit from seamlessly switching between coarse and fine scene representations depending on the context and the evolving information as the robot explores the environment. For example, it is often sufficient to start with a coarse scene representation initially and only employ a finer, more granular scene representation when the robot encounters regions which are likely to contain the task relevant objects. Hence, in this work, we propose BiMoSG, a bimodal 3D scene graph generation approach for open-set tasks. BiMoSG employs a "fast" mode by default to efficiently generate a coarse 3D scene graph and can switch to a "slow" mode for generating a finer open vocabulary 3D scene graph of task relevant objects. We demonstrate that our proposed 3D scene graph generation approach is significantly faster than the open-source state-of-the-art approaches. This allows us to integrate the scene graph generation process with task execution for real-time deployment.
Abstract（参考訳）: オープンセットタスク実行は、ロボットが環境を探索する際に、コンテキストや進化する情報に応じて、粗いシーン表現と細かなシーン表現をシームレスに切り替えることの恩恵を受ける。例えば、まずは粗いシーン表現から始めるのに十分であり、ロボットがタスク関連オブジェクトを含む可能性のある領域に遭遇したときには、より微細で粒度の細かいシーン表現のみを使用する。そこで本研究では,オープンセットタスクのためのバイモーダル3次元シーングラフ生成手法であるBiMoSGを提案する。 BiMoSGはデフォルトで「高速」モードを使用して、粗い3Dシーングラフを効率よく生成し、タスク関連オブジェクトのより細かいオープンな3Dシーングラフを生成する「スロー」モードに切り替えることができる。提案する3次元シーングラフ生成手法は,オープンソースの最先端手法よりもはるかに高速であることを示す。これにより、シーングラフ生成プロセスとタスク実行を統合することで、リアルタイムデプロイメントを可能にします。

論文の概要: Seeing Fast and Slow: Bimodal 3D Scene Graphs for Open-set Tasks

関連論文リスト