Fugu-MT 論文翻訳(概要): ReScene4D: Temporally Consistent Semantic Instance Segmentation of Evolving Indoor 3D Scenes

論文の概要: ReScene4D: Temporally Consistent Semantic Instance Segmentation of Evolving Indoor 3D Scenes

arxiv url: http://arxiv.org/abs/2601.11508v1
Date: Fri, 16 Jan 2026 18:45:19 GMT
ステータス: 翻訳完了
システム内更新日: 2026-01-19 20:21:50.6014
Title: ReScene4D: Temporally Consistent Semantic Instance Segmentation of Evolving Indoor 3D Scenes
Title（参考訳）: ReScene4D: 進化する屋内3Dシーンの時間的に連続したセマンティックセマンティックセマンティックセグメンテーション
Authors: Emily Steiner, Jianhao Zheng, Henry Howard-Jenkins, Chris Xie, Iro Armeni,
Abstract要約: 時間的にスパースな4D屋内セマンティック・インスタンス・セグメンテーション(SIS)のタスクを導入し、形式化する。本稿では,ReScene4Dを提案する。ReScene4Dは,高密度な観測を必要とせずに,3DSISアーキテクチャを4DSISに適用する新しい手法である。この課題を評価するために、時間的アイデンティティ整合性に報いるため、mAPを拡張した新しい計量 t-mAP を定義する。
参考スコア（独自算出の注目度）: 11.119542051581917
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Indoor environments evolve as objects move, appear, or disappear. Capturing these dynamics requires maintaining temporally consistent instance identities across intermittently captured 3D scans, even when changes are unobserved. We introduce and formalize the task of temporally sparse 4D indoor semantic instance segmentation (SIS), which jointly segments, identifies, and temporally associates object instances. This setting poses a challenge for existing 3DSIS methods, which require a discrete matching step due to their lack of temporal reasoning, and for 4D LiDAR approaches, which perform poorly due to their reliance on high-frequency temporal measurements that are uncommon in the longer-horizon evolution of indoor environments. We propose ReScene4D, a novel method that adapts 3DSIS architectures for 4DSIS without needing dense observations. It explores strategies to share information across observations, demonstrating that this shared context not only enables consistent instance tracking but also improves standard 3DSIS quality. To evaluate this task, we define a new metric, t-mAP, that extends mAP to reward temporal identity consistency. ReScene4D achieves state-of-the-art performance on the 3RScan dataset, establishing a new benchmark for understanding evolving indoor scenes.
Abstract（参考訳）: オブジェクトが動く、現れる、または消えるにつれて、屋内環境は進化する。これらのダイナミクスをキャプチャするには、変更が観測されていない場合でも、断続的にキャプチャされた3Dスキャン間で時間的に一貫したインスタンスIDを維持する必要がある。本研究では,4次元屋内セマンティック・インスタンス・セグメンテーション(SIS)のタスクを導入,形式化し,オブジェクト・インスタンスを共同でセグメント化し,識別し,時間的に関連付ける。この設定は、時間的推論の欠如により離散的なマッチングステップを必要とする既存の3DSIS法と、4D LiDAR法では、屋内環境の長期水平進化において珍しい高周波時間的測定に依存しないため、性能が低下する4D LiDAR法に課題を提起する。本稿では,ReScene4Dを提案する。ReScene4Dは,高密度な観測を必要とせずに,3DSISアーキテクチャを4DSISに適用する新しい手法である。この共有コンテキストは、一貫したインスタンス追跡を可能にするだけでなく、標準的な3DSISの品質も向上する。この課題を評価するために、時間的アイデンティティ整合性に報いるため、mAPを拡張した新しい計量 t-mAP を定義する。 ReScene4Dは3RScanデータセット上で最先端のパフォーマンスを実現し、進化する屋内シーンを理解するための新しいベンチマークを確立する。

論文の概要: ReScene4D: Temporally Consistent Semantic Instance Segmentation of Evolving Indoor 3D Scenes

関連論文リスト