Fugu-MT 論文翻訳(概要): I-Scene: 3D Instance Models are Implicit Generalizable Spatial Learners

論文の概要: I-Scene: 3D Instance Models are Implicit Generalizable Spatial Learners

arxiv url: http://arxiv.org/abs/2512.13683v1
Date: Mon, 15 Dec 2025 18:59:13 GMT
ステータス: 翻訳完了
システム内更新日: 2025-12-16 17:54:56.830338
Title: I-Scene: 3D Instance Models are Implicit Generalizable Spatial Learners
Title（参考訳）: I-Scene:3次元インスタンスモデルは空間学習に欠かせない
Authors: Lu Ling, Yunhao Ge, Yichen Sheng, Aniket Bera,
Abstract要約: インタラクティブな3Dシーン生成において、一般化は依然として中心的な課題である。我々は、シーンレベルの学習者として機能するために、事前訓練された3Dインスタンスジェネレータを書き換える。トレーニングシーンがランダムに構成されたオブジェクトであっても,空間的推論がまだ現れることを示す。
参考スコア（独自算出の注目度）: 21.18471823625016
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Generalization remains the central challenge for interactive 3D scene generation. Existing learning-based approaches ground spatial understanding in limited scene dataset, restricting generalization to new layouts. We instead reprogram a pre-trained 3D instance generator to act as a scene level learner, replacing dataset-bounded supervision with model-centric spatial supervision. This reprogramming unlocks the generator transferable spatial knowledge, enabling generalization to unseen layouts and novel object compositions. Remarkably, spatial reasoning still emerges even when the training scenes are randomly composed objects. This demonstrates that the generator's transferable scene prior provides a rich learning signal for inferring proximity, support, and symmetry from purely geometric cues. Replacing widely used canonical space, we instantiate this insight with a view-centric formulation of the scene space, yielding a fully feed-forward, generalizable scene generator that learns spatial relations directly from the instance model. Quantitative and qualitative results show that a 3D instance generator is an implicit spatial learner and reasoner, pointing toward foundation models for interactive 3D scene understanding and generation. Project page: https://luling06.github.io/I-Scene-project/
Abstract（参考訳）: インタラクティブな3Dシーン生成において、一般化は依然として中心的な課題である。既存の学習に基づくアプローチは、シーンデータセットに制限された空間的理解を基盤として、新しいレイアウトへの一般化を制限している。代わりに、トレーニング済みの3Dインスタンスジェネレータをプログラムしてシーンレベルの学習者として動作させ、データセット境界の監視をモデル中心の空間監視に置き換える。この再プログラミングにより、ジェネレータの移動可能な空間知識が解放され、レイアウトや新しいオブジェクト構成が一般化される。注目すべきは、トレーニングシーンがランダムに構成されたオブジェクトであっても、空間的推論が依然として現れることだ。このことは、発電機の転送可能なシーンが、純粋に幾何学的な手がかりから近接性、支持性、対称性を推測するための豊富な学習信号を提供することを示している。この知見をシーン空間のビュー中心の定式化によりインスタンス化し、インスタンスモデルから直接空間関係を学習する完全フィードフォワードの一般化可能なシーンジェネレータを生成する。定量的および定性的な結果から、3Dインスタンス生成者は暗黙の空間学習者であり、対話型3Dシーン理解と生成の基礎モデルを指し示す。プロジェクトページ: https://luling06.github.io/I-Scene-project/

論文の概要: I-Scene: 3D Instance Models are Implicit Generalizable Spatial Learners

関連論文リスト