Fugu-MT 論文翻訳(概要): Rein3D: Reinforced 3D Indoor Scene Generation with Panoramic Video Diffusion Models

論文の概要: Rein3D: Reinforced 3D Indoor Scene Generation with Panoramic Video Diffusion Models

arxiv url: http://arxiv.org/abs/2604.10578v2
Date: Tue, 14 Apr 2026 11:24:24 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-15 14:01:13.335762
Title: Rein3D: Reinforced 3D Indoor Scene Generation with Panoramic Video Diffusion Models
Title（参考訳）: Rein3D:パノラマビデオ拡散モデルによる3次元室内シーンの強化
Authors: Dehui Wang, Congsheng Xu, Rong Wei, Yue Shi, Shoufa Chen, Dingxiang Luo, Tianshuo Yang, Xiaokang Yang, Wei Sui, Yusen Qin, Rui Tang, Yao Mu,
Abstract要約: Rein3Dは、完全な360度屋内環境を再構築するフレームワークである。我々は不完全なパノラマ動画のレンダリングに放射状探査戦略を採用している。これらの配列はパノラマビデオ・ビデオ拡散モデルにより復元され、高忠実度幾何やテクスチャを合成するためにビデオ超解像によりさらに拡張される。
参考スコア（独自算出の注目度）: 38.45163624089566
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: The growing demand for Embodied AI and VR applications has highlighted the need for synthesizing high-quality 3D indoor scenes from sparse inputs. However, existing approaches struggle to infer massive amounts of missing geometry in large unseen areas while maintaining global consistency, often producing locally plausible but globally inconsistent reconstructions. We present Rein3D, a framework that reconstructs full 360-degree indoor environments by coupling explicit 3D Gaussian Splatting (3DGS) with temporally coherent priors from video diffusion models. Our approach follows a "restore-and-refine" paradigm: we employ a radial exploration strategy to render imperfect panoramic videos along trajectories starting from the origin, effectively uncovering occluded regions from a coarse 3DGS initialization. These sequences are restored by a panoramic video-to-video diffusion model and further enhanced via video super-resolution to synthesize high-fidelity geometry and textures. Finally, these refined videos serve as pseudo-ground truths to update the global 3D Gaussian field. To support this task, we construct PanoV2V-15K, a dataset of over 15K paired clean and degraded panoramic videos for diffusion-based scene restoration. Experiments demonstrate that Rein3D produces photorealistic and globally consistent 3D scenes and significantly improves long-range camera exploration compared with existing baselines.
Abstract（参考訳）: Embodied AIとVRアプリケーションへの需要が高まり、スパース入力から高品質な3D屋内シーンを合成する必要性が浮き彫りになっている。しかし、既存のアプローチは、大きめの目に見えない領域で大量の欠落した幾何を推測するのに苦労し、大域的な一貫性を維持し、しばしば局所的に可塑性であるが、大域的に一貫性のない再構築を生み出す。ビデオ拡散モデルから時間的コヒーレントな先行情報と明示的な3Dガウススプラッティング(3DGS)を結合することにより,全360度屋内環境を再構築するフレームワークRein3Dを提案する。我々は、原点から始まる不完全なパノラマ動画を軌道に沿ってレンダリングするために放射状探索戦略を採用し、粗い3DGS初期化から隠蔽領域を効果的に発見する。これらの配列はパノラマビデオ・ビデオ拡散モデルにより復元され、高忠実度幾何やテクスチャを合成するためにビデオ超解像によりさらに拡張される。最後に、これらの改良されたビデオは、グローバルな3Dガウスの分野を更新するための偽の地上真実として機能する。この課題を支援するために,拡散型シーン復元のための15K以上のペア・ペア・クリーン・デグレード・パノラマ映像のデータセットであるPanoV2V-15Kを構築した。実験により、Rein3Dはフォトリアリスティックでグローバルに一貫した3Dシーンを生成し、既存のベースラインと比較して長距離カメラの探索を大幅に改善することが示された。

論文の概要: Rein3D: Reinforced 3D Indoor Scene Generation with Panoramic Video Diffusion Models

関連論文リスト