Fugu-MT 論文翻訳(概要): DiffInDScene: Diffusion-based High-Quality 3D Indoor Scene Generation

論文の概要: DiffInDScene: Diffusion-based High-Quality 3D Indoor Scene Generation

arxiv url: http://arxiv.org/abs/2306.00519v4
Date: Wed, 29 Nov 2023 03:55:48 GMT
ステータス: 翻訳完了
システム内更新日: 2023-12-01 13:04:38.572910
Title: DiffInDScene: Diffusion-based High-Quality 3D Indoor Scene Generation
Title（参考訳）: DiffInDScene:拡散に基づく高品質3D室内シーン生成
Authors: Xiaoliang Ju, Zhaoyang Huang, Yijin Li, Guofeng Zhang, Yu Qiao, Hongsheng Li
Abstract要約: DiffInDSceneは高品質な屋内シーン生成問題に取り組むための新しいフレームワークである。 KinectFusionのインクリメンタルアライメントとローカルTSDFボリュームの融合に着想を得て,拡散型SDF融合法を提案する。生成した結果から,3次元空間で高品位な部屋生成を実現することが可能であることが示唆された。
参考スコア（独自算出の注目度）: 52.57289166214808
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: We present DiffInDScene, a novel framework for tackling the problem of high-quality 3D indoor scene generation, which is challenging due to the complexity and diversity of the indoor scene geometry. Although diffusion-based generative models have previously demonstrated impressive performance in image generation and object-level 3D generation, they have not yet been applied to room-level 3D generation due to their computationally intensive costs. In DiffInDScene, we propose a cascaded 3D diffusion pipeline that is efficient and possesses strong generative performance for Truncated Signed Distance Function (TSDF). The whole pipeline is designed to run on a sparse occupancy space in a coarse-to-fine fashion. Inspired by KinectFusion's incremental alignment and fusion of local TSDF volumes, we propose a diffusion-based SDF fusion approach that iteratively diffuses and fuses local TSDF volumes, facilitating the generation of an entire room environment. The generated results demonstrate that our work is capable to achieve high-quality room generation directly in three-dimensional space, starting from scratch. In addition to the scene generation, the final part of DiffInDScene can be used as a post-processing module to refine the 3D reconstruction results from multi-view stereo. According to the user study, the mesh quality generated by our DiffInDScene can even outperform the ground truth mesh provided by ScanNet. Please visit our project page for the latest progress and demonstrations: https://github.com/AkiraHero/diffindscene.
Abstract（参考訳）: DiffInDSceneは高品質な屋内シーン生成問題に対処するための新しいフレームワークであり、室内シーンの複雑さと多様性のため課題である。拡散型生成モデルは画像生成やオブジェクトレベルの3D生成において,これまでは顕著な性能を示してきたが,計算集約コストのため,まだ部屋レベルの3D生成には適用されていない。差分法では,tsdf (truncated signed distance function) の高効率かつ強力な生成性能を有するカスケード3次元拡散パイプラインを提案する。パイプライン全体は、粗大な方法でスパース占有空間で実行されるように設計されている。 KinectFusionのインクリメンタルアライメントと局所TSDFボリュームの融合にインスパイアされた本研究では,局所TSDFボリュームを反復的に拡散・融合させる拡散型SDF融合手法を提案する。生成した結果は,スクラッチから始めて,高品位な部屋生成を3次元空間で直接実現できることを実証した。シーン生成に加えて、DiffInDSceneの最終部分は、マルチビューステレオから3D再構成結果を洗練するための後処理モジュールとして使用できる。ユーザ調査によると、私たちのDiffInDSceneによって生成されたメッシュ品質は、ScanNetが提供する地上の真理メッシュよりも優れています。最新の進捗とデモについては、プロジェクトページを参照してください。

論文の概要: DiffInDScene: Diffusion-based High-Quality 3D Indoor Scene Generation

関連論文リスト