Fugu-MT 論文翻訳(概要): PhysGraph: A Physics-aware 3D Scene Graph for Perception and Reasoning

論文の概要: PhysGraph: A Physics-aware 3D Scene Graph for Perception and Reasoning

arxiv url: http://arxiv.org/abs/2606.08655v1
Date: Sun, 07 Jun 2026 14:50:44 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-09 14:42:06.318773
Title: PhysGraph: A Physics-aware 3D Scene Graph for Perception and Reasoning
Title（参考訳）: PhysGraph:知覚と推論のための物理対応の3Dシーングラフ
Authors: Haoyu Li, Aaron Thomas, Shuyan Zhou, Xianyi Cheng,
Abstract要約: 本稿では,3次元構造を持つ図形推論を統一したフレームワークであるPhysGraphについて述べる。 PhysGraphは、セマンティックセグメンテーション、多目的質量推定、調音予測における最先端の結果を達成する。
参考スコア（独自算出の注目度）: 24.361346589238035
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: To perform a wide range of daily tasks, robots need to construct a 3D representation that is semantically rich, physically grounded, and structured enough to support task planning and affordance prediction. However, existing approaches primarily focus on semantic retrieval, often overlooking physical and kinematic factors. Methods that attempt to model physical properties typically rely on narrow training sets or single-object modeling, limiting scalability and generalization across diverse object types. To address these challenges, we present PhysGraph, a framework that unifies symbolic reasoning with structured 3D geometry to model kinematic and physical properties in cluttered scenes. Given RGB-D observations, PhysGraph reconstructs object-centric 3D geometry and associates object instances across views. It then decomposes objects into functional parts and infers materials and articulations through visual reasoning. Evaluated on both synthetic and real-world datasets, PhysGraph achieves state-of-the-art results in semantic segmentation, multi-object mass estimation, and articulation prediction. With its simple yet effective design, PhysGraph produces physically consistent and semantically structured scene graphs, serving as a structured 3D representation for downstream tasks such as constraint-aware 3D affordance prediction and real-to-sim transfer, both of which are demonstrated in our experiments.
Abstract（参考訳）: 幅広い日常的なタスクをこなすためには、ロボットはセマンティックにリッチで、物理的に基礎があり、タスク計画と余裕予測をサポートするのに十分な構造を持つ3D表現を構築する必要がある。しかし、既存のアプローチは主に意味検索に重点を置いており、しばしば物理的要因や運動学的要因を見下ろしている。物理的特性をモデル化しようとする手法は、典型的には狭いトレーニングセットや単一オブジェクトモデリングに依存し、様々なオブジェクトタイプにまたがるスケーラビリティと一般化を制限する。これらの課題に対処するために,3次元構造を持つ記号的推論を統一したフレームワークであるPhysGraphを紹介し,散らばったシーンの運動特性と物理的特性をモデル化する。 RGB-Dの観測から、PhysGraphはオブジェクト中心の3D幾何学を再構築し、ビュー全体でオブジェクトインスタンスを関連付ける。その後、物体を機能部品に分解し、視覚的推論を通じて材料や調音を推測する。 PhysGraphは、合成データセットと実世界のデータセットの両方で評価され、セマンティックセグメンテーション、マルチオブジェクト質量推定、調音予測における最先端の結果を達成する。 PhysGraphは、シンプルで効果的な設計で、物理的に一貫性があり、セマンティックに構造化されたシーングラフを生成し、制約を意識した3Dアベイランス予測やリアル・ツー・シミュレートといった下流タスクのための構造化された3D表現として機能する。

論文の概要: PhysGraph: A Physics-aware 3D Scene Graph for Perception and Reasoning

関連論文リスト