Fugu-MT 論文翻訳(概要): RelWitness: Open-Vocabulary 3D Scene Graph Generation with Visual-Geometric Relation Witnesses

論文の概要: RelWitness: Open-Vocabulary 3D Scene Graph Generation with Visual-Geometric Relation Witnesses

arxiv url: http://arxiv.org/abs/2605.20823v1
Date: Wed, 20 May 2026 07:18:56 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-21 19:19:56.548862
Title: RelWitness: Open-Vocabulary 3D Scene Graph Generation with Visual-Geometric Relation Witnesses
Title（参考訳）: RelWitness:ビジュアル・ジオメトリ・リレーション・ウィットネスを用いたオープン・ボキャブラリ3次元シーングラフ生成
Authors: Minh Anh Nguyen, Quang Huy Tran, Bao Ngoc Le, Tuan Kiet Pham, Sui Yang Guang,
Abstract要約: 不完全な関係管理の下で提案したRGB-Dシークエンスからオープン語彙の3Dシーングラフを生成するフレームワークを提案する。重要なコンセプトは関係の証人であり、キャプチャーされたシーンで関係を観察できる具体的な視覚幾何学的キューである。 RelWitnessはRGBビュー、深度マップ、再構成された3D幾何、ロールセンシティブなテキスト、オブジェクト-プリアヌルビュー、マルチビュー一貫性から関係証記録を構築する。
参考スコア（独自算出の注目度）: 1.3468350096927395
License: http://creativecommons.org/licenses/by-sa/4.0/
Abstract: Open-vocabulary 3D scene graph generation seeks to describe object instances and their relations with flexible natural-language predicates. The central difficulty is not only vocabulary expansion, but supervision reliability: relation annotations in 3D scene graph datasets are selective, and many valid object-pair relations are unannotated. We propose RelWitness, a framework for open-vocabulary 3D scene graph generation from posed RGB-D sequences under incomplete relation supervision. The key concept is a relation witness: a concrete visual-geometric cue that makes a relation observable in the captured scene. Support relations require contact and vertical ordering; containment requires enclosure; proximity requires metric closeness; orientation requires facing direction; and stable relations should persist across views where both objects are visible. RelWitness constructs relation witness records from RGB views, depth maps, reconstructed 3D geometry, role-sensitive text, object-prior null views, and multi-view consistency. A visual-geometric witness verifier assigns unannotated relation candidates to verified missing positives, reliable negatives, or uncertain unlabeled cases. A witness-guided positive-unlabeled objective then learns from incomplete annotations without turning every missing label into a negative. We further introduce witness-consistent decoding and an RGB-D missing-relation audit protocol. Simulated manuscript-planning experiments on 3DSSG/3RScan and ScanNet-derived open-vocabulary splits show the intended behavior: improved unseen-relation recognition, higher witness precision, lower hallucination, and reduced redundant relation phrases. All numerical results are planning values and must be replaced by reproduced measurements before submission
Abstract（参考訳）: Open-vocabulary 3D scene graph generationは、オブジェクトのインスタンスとそのフレキシブルな自然言語述語との関係を記述しようとする。 3次元シーングラフデータセットにおける関係アノテーションは選択的であり、多くの有効なオブジェクトペア関係は無注釈である。提案するRelWitnessは,RGB-D配列を不完全な関係管理下で生成するオープン語彙3Dシーングラフ生成フレームワークである。重要な概念は関係の証人であり、キャプチャーされたシーンで関係を観察できる具体的な視覚幾何学的キューである。サポート関係には接触と垂直の順序付けが必要であり、封じ込めには囲いが必要であり、近接にはメートル法的近接性が必要であり、向きは向きを変えなければならない。 RelWitnessはRGBビュー、深度マップ、再構成された3D幾何、ロールセンシティブなテキスト、オブジェクト-プリアヌルビュー、マルチビュー一貫性から関係証記録を構築する。視覚幾何学的証人検証器は、注釈のない関係候補を、確認済みの正、信頼できない負、不確実な未ラベルのケースに割り当てる。目撃者による肯定的な未ラベルの目的は、失ったラベルをすべて否定的なものにすることなく、不完全なアノテーションから学習する。さらに、証人一致復号とRGB-D欠損関連監査プロトコルを導入する。 3DSSG/3RScanとScanNet由来のオープンボキャブラリスプリットのシミュレートされた原稿計画実験は、未確認関係認識の改善、目撃者精度の向上、幻覚の低下、冗長な関係句の減少といった意図された振る舞いを示している。すべての数値は計画値であり、提出前に再現された測定値に置き換えられなければならない

論文の概要: RelWitness: Open-Vocabulary 3D Scene Graph Generation with Visual-Geometric Relation Witnesses

関連論文リスト