Fugu-MT 論文翻訳(概要): Hierarchical Object Representation for Spatial Robot Perception: Points, Meshes, and Superquadrics

論文の概要: Hierarchical Object Representation for Spatial Robot Perception: Points, Meshes, and Superquadrics

arxiv url: http://arxiv.org/abs/2606.01545v1
Date: Mon, 01 Jun 2026 01:45:47 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-02 21:34:29.858645
Title: Hierarchical Object Representation for Spatial Robot Perception: Points, Meshes, and Superquadrics
Title（参考訳）: 空間ロボット知覚のための階層的オブジェクト表現:ポイント,メッシュ,スーパークワッドリック
Authors: Ceng Zhang, Wan Su, Mohamed Samshad, Gregory S. Chirikjian, Rajat Talak,
Abstract要約: 階層型3Dシーングラフ(3DSG)は、長期的な自律性のためのアクション可能でスケーラブルな表現として登場した。高忠実度オブジェクトレベルの再構成に利用できる階層的オブジェクト表現を導入する。提案手法は,現在最先端のオブジェクトベースのマップアライメント手法 ROMAN よりも優れていることを示す。
参考スコア（独自算出の注目度）: 7.735876865294663
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Hierarchical 3D Scene Graphs (3DSG) have emerged as an actionable and scalable representation for long-term autonomy incorporating metric, semantic, and topological information in the scene. However, the question of geometric representation of objects in 3DSG has been overlooked as most methods use simplified geometric models such as partial point clouds or 3D bounding boxes. In this work, we introduce a hierarchical object representation that can be leveraged for high-fidelity object-level reconstruction, object-based robust re-localization or map alignment, and efficient and analytical collision checking for safe robot navigation planning in dense and cluttered environments. The representation is structurally organized into four distinct layers, progressively abstracting the scene from raw sensor data to dense 3D meshes to analytical primitives such as superquadrics, which provide a sparse and analytical representation for object geometry. We develop a pipeline that builds the hierarchical object representation from RGB-D image stream captured by a robot, and demonstrate its working in real-world open-set object scenes in both indoor and outdoor environments. Extensive experiments across diverse datasets including HOPE, ReplicaCAD, Kimera-Multi, and NUS Campus Dataset collected using Unitree B2 Robot validate our pipeline in both indoor and outdoor environments. We show that our superquadric-based map alignment method outperforms the current state-of-the-art object based map alignment method ROMAN. Our code can be found at https://github.com/perceptica-robotics/Hickory.
Abstract（参考訳）: 階層型3Dシーングラフ(3DSG)は、計量、意味、トポロジ情報をシーンに組み込んだ長期的な自律性のためのアクション可能でスケーラブルな表現として登場した。しかし、3DSGにおけるオブジェクトの幾何学的表現の問題は、ほとんどの手法が部分点雲や3D境界ボックスのような単純な幾何学的モデルを用いているため、見過ごされている。本研究では,高忠実度オブジェクトレベルの再構成,オブジェクトベースのロバストな再局在化やマップアライメント,密集した環境下での安全なロボットナビゲーション計画のための効率的かつ解析的な衝突チェックなどに活用できる階層型オブジェクト表現を提案する。この表現は4つの異なる層に構造的に整理され、シーンを生のセンサーデータから高密度の3Dメッシュへ徐々に抽象化し、スーパークワッドリックのような分析的プリミティブへと変換し、オブジェクト幾何学のスパースで分析的な表現を提供する。我々は,ロボットが捉えたRGB-D画像ストリームから階層的オブジェクト表現を構築するパイプラインを開発し,室内環境と屋外環境の両方において,実際のオープンセットオブジェクトシーンで動作することを示す。 HOPE、ReplicaCAD、Kimera-Multi、NUS Campus Datasetといったさまざまなデータセットにわたる大規模な実験は、Unitree B2 Robotを使って収集され、私たちのパイプラインを屋内と屋外の両方の環境で検証しました。提案手法は,現在最先端のオブジェクトベースのマップアライメント手法 ROMAN よりも優れていることを示す。私たちのコードはhttps://github.com/perceptica-robotics/Hickory.comにある。

論文の概要: Hierarchical Object Representation for Spatial Robot Perception: Points, Meshes, and Superquadrics

関連論文リスト