RoomPlanner: Explicit Layout Planner for Easier LLM-Driven 3D Room Generation
- URL: http://arxiv.org/abs/2511.17048v1
- Date: Fri, 21 Nov 2025 08:47:32 GMT
- Title: RoomPlanner: Explicit Layout Planner for Easier LLM-Driven 3D Room Generation
- Authors: Wenzhuo Sun, Mingjian Liang, Wenxuan Song, Xuelian Cheng, Zongyuan Ge,
- Abstract summary: We propose RoomPlanner, the first fully automatic 3D room generation framework for creating realistic indoor scenes with only short text as input.<n>Our framework can generate explicit layout criteria for rational spatial placement without any manual layout design or panoramic image guidance.<n>Our method can produce geometrically rational 3D indoor scenes, surpassing prior approaches in both rendering speed and visual quality while preserving editability.
- Score: 14.363615726338773
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: In this paper, we propose RoomPlanner, the first fully automatic 3D room generation framework for painlessly creating realistic indoor scenes with only short text as input. Without any manual layout design or panoramic image guidance, our framework can generate explicit layout criteria for rational spatial placement. We begin by introducing a hierarchical structure of language-driven agent planners that can automatically parse short and ambiguous prompts into detailed scene descriptions. These descriptions include raw spatial and semantic attributes for each object and the background, which are then used to initialize 3D point clouds. To position objects within bounded environments, we implement two arrangement constraints that iteratively optimize spatial arrangements, ensuring a collision-free and accessible layout solution. In the final rendering stage, we propose a novel AnyReach Sampling strategy for camera trajectory, along with the Interval Timestep Flow Sampling (ITFS) strategy, to efficiently optimize the coarse 3D Gaussian scene representation. These approaches help reduce the total generation time to under 30 minutes. Extensive experiments demonstrate that our method can produce geometrically rational 3D indoor scenes, surpassing prior approaches in both rendering speed and visual quality while preserving editability. The code will be available soon.
Related papers
- PSGS: Text-driven Panorama Sliding Scene Generation via Gaussian Splatting [18.048020748522312]
We propose PSGS, a framework for high-fidelity panoramic scene generation.<n>First, a novel two-layer optimization architecture generates semantically coherent panoramas.<n>Second, our panorama sliding mechanism initializes globally consistent 3D Gaussian Splatting point clouds.
arXiv Detail & Related papers (2026-01-31T02:34:46Z) - HLG: Comprehensive 3D Room Construction via Hierarchical Layout Generation [31.010614667725843]
Hierarchical Layout Generation (HLG) is a novel method for fine-grained 3D scene generation.<n>HLG is the first to adopt a coarse-to-fine hierarchical approach, refining scene layouts from large-scale furniture placement to intricate object arrangements.<n>We show superior performance in generating realistic indoor scenes compared to existing methods.
arXiv Detail & Related papers (2025-08-25T09:32:57Z) - OptiScene: LLM-driven Indoor Scene Layout Generation via Scaled Human-aligned Data Synthesis and Multi-Stage Preference Optimization [54.60030826635478]
Existing indoor layout generation methods fall into two categories: prompt-driven and learning-based.<n>We present 3D- SynthPlace, a large-scale dataset that combines synthetic layouts generated via a 'GPT synthesize, Human inspect' pipeline.<n>We introduce OptiScene, a strong open-source LLM optimized for indoor layout generation.
arXiv Detail & Related papers (2025-06-09T09:13:06Z) - CHOrD: Generation of Collision-Free, House-Scale, and Organized Digital Twins for 3D Indoor Scenes with Controllable Floor Plans and Optimal Layouts [21.63819006421225]
We introduce CHOrD, a novel framework for scalable synthesis of 3D indoor scenes.<n>ChorD creates house-scale, collision-free, and hierarchically structured indoor digital twins.
arXiv Detail & Related papers (2025-03-15T02:05:10Z) - Layout2Scene: 3D Semantic Layout Guided Scene Generation via Geometry and Appearance Diffusion Priors [52.63385546943866]
We present a text-to-scene generation method (namely, Layout2Scene) using additional semantic layout as the prompt to inject precise control of 3D object positions.<n>To fully leverage 2D diffusion priors in geometry and appearance generation, we introduce a semantic-guided geometry diffusion model and a semantic-geometry guided diffusion model.<n>Our method can generate more plausible and realistic scenes as compared to state-of-the-art approaches.
arXiv Detail & Related papers (2025-01-05T12:20:13Z) - Prim2Room: Layout-Controllable Room Mesh Generation from Primitives [90.5012354166981]
Prim2Room is a framework for controllable room mesh generation leveraging 2D layout conditions and 3D primitive retrieval.
We introduce an adaptive viewpoint selection algorithm that allows the system to generate the furniture texture and geometry from more favorable views.
Our method not only enhances the accuracy and aesthetic appeal of generated 3D scenes but also provides a user-friendly platform for detailed room design.
arXiv Detail & Related papers (2024-09-09T07:25:47Z) - GALA3D: Towards Text-to-3D Complex Scene Generation via Layout-guided Generative Gaussian Splatting [52.150502668874495]
We present GALA3D, generative 3D GAussians with LAyout-guided control, for effective compositional text-to-3D generation.
GALA3D is a user-friendly, end-to-end framework for state-of-the-art scene-level 3D content generation and controllable editing.
arXiv Detail & Related papers (2024-02-11T13:40:08Z) - SceneWiz3D: Towards Text-guided 3D Scene Composition [134.71933134180782]
Existing approaches either leverage large text-to-image models to optimize a 3D representation or train 3D generators on object-centric datasets.
We introduce SceneWiz3D, a novel approach to synthesize high-fidelity 3D scenes from text.
arXiv Detail & Related papers (2023-12-13T18:59:30Z) - 3D Sketch-aware Semantic Scene Completion via Semi-supervised Structure
Prior [50.73148041205675]
The goal of the Semantic Scene Completion (SSC) task is to simultaneously predict a completed 3D voxel representation of volumetric occupancy and semantic labels of objects in the scene from a single-view observation.
We propose to devise a new geometry-based strategy to embed depth information with low-resolution voxel representation.
Our proposed geometric embedding works better than the depth feature learning from habitual SSC frameworks.
arXiv Detail & Related papers (2020-03-31T09:33:46Z) - Scan2Plan: Efficient Floorplan Generation from 3D Scans of Indoor Scenes [9.71137838903781]
Scan2Plan is a novel approach for accurate estimation of a floorplan from a 3D scan of the structural elements of indoor environments.
The proposed method incorporates a two-stage approach where the initial stage clusters an unordered point cloud representation of the scene.
The subsequent stage estimates a closed perimeter, parameterized by a simple polygon, for each individual room.
The final floorplan is simply an assembly of all such room perimeters in the global co-ordinate system.
arXiv Detail & Related papers (2020-03-16T17:59:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.