Related papers: Recent Advance in 3D Object and Scene Generation: A Survey

Recent Advance in 3D Object and Scene Generation: A Survey

URL: http://arxiv.org/abs/2504.11734v1
Date: Wed, 16 Apr 2025 03:22:06 GMT
Title: Recent Advance in 3D Object and Scene Generation: A Survey
Authors: Xiang Tang, Ruotong Li, Xiaopeng Fan,
Abstract summary: This survey aims to provide readers with a structured understanding of state-of-the-art 3D generation technologies.<n>We focus on three dominant paradigms: layout-guided compositional synthesis, 2D prior-based scene generation, and rule-driven modeling.
Score: 14.673302810271219
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: In recent years, the demand for 3D content has grown exponentially with intelligent upgrading of interactive media, extended reality (XR), and Metaverse industries. In order to overcome the limitation of traditional manual modeling approaches, such as labor-intensive workflows and prolonged production cycles, revolutionary advances have been achieved through the convergence of novel 3D representation paradigms and artificial intelligence generative technologies. In this survey, we conduct a systematically review of the cutting-edge achievements in static 3D object and scene generation, as well as establish a comprehensive technical framework through systematic categorization. Specifically, we initiate our analysis with mainstream 3D object representations, followed by in-depth exploration of two principal technical pathways in object generation: data-driven supervised learning methods and deep generative model-based approaches. Regarding scene generation, we focus on three dominant paradigms: layout-guided compositional synthesis, 2D prior-based scene generation, and rule-driven modeling. Finally, we critically examine persistent challenges in 3D generation and propose potential research directions for future investigation. This survey aims to provide readers with a structured understanding of state-of-the-art 3D generation technologies while inspiring researchers to undertake more exploration in this domain.

Related papers

Advances in Feed-Forward 3D Reconstruction and View Synthesis: A Survey [154.50661618628433]
3D reconstruction and view synthesis are foundational problems in computer vision, graphics, and immersive technologies such as augmented reality (AR), virtual reality (VR), and digital twins.<n>Recent advances in feed-forward approaches, driven by deep learning, have revolutionized this field by enabling fast and generalizable 3D reconstruction and view synthesis.
arXiv Detail & Related papers (2025-07-19T06:13:25Z)
3D Shape Generation: A Survey [0.6445605125467574]
Recent advances in deep learning have transformed the field of 3D shape generation.<n>This survey organizes the discussion around three core components: shape representations, generative modeling approaches, and evaluation protocols.<n>We identify open challenges and outline future research directions that could drive progress in controllable, efficient, and high-quality 3D shape generation.
arXiv Detail & Related papers (2025-06-27T23:06:06Z)
3D Scene Generation: A Survey [41.202497008985425]
3D scene generation seeks to synthesize spatially structured, semantically meaningful, and photorealistic environments for applications such as immersive media, robotics, autonomous driving, and embodied AI.<n>This review organizes recent advances in 3D scene generation and highlights promising directions at the intersection of generative AI, 3D vision, and embodied intelligence.
arXiv Detail & Related papers (2025-05-08T17:59:54Z)
Advances in 4D Generation: A Survey [20.285058992203442]
4D generation focuses on creating dynamic 3D assets with consistency based on user input.<n>We summarize five major challenges of 4D generation: consistency, controllability, diversity, efficiency, and fidelity.<n>We provide an in-depth discussion of the obstacles currently hindering the development of the 4D generation.
arXiv Detail & Related papers (2025-03-18T17:59:51Z)
3D Representation Methods: A Survey [0.0]
3D representation has experienced significant advancements, driven by the increasing demand for high-fidelity 3D models in various applications. This review examines the development and current state of 3D representation methods, highlighting their research trajectories, innovations, strength and weakness.
arXiv Detail & Related papers (2024-10-09T02:01:05Z)
3D Gaussian Splatting: Survey, Technologies, Challenges, and Opportunities [57.444435654131006]
3D Gaussian Splatting (3DGS) has emerged as a prominent technique with the potential to become a mainstream method for 3D representations. This survey aims to analyze existing 3DGS-related works from multiple intersecting perspectives.
arXiv Detail & Related papers (2024-07-24T16:53:17Z)
A Survey On Text-to-3D Contents Generation In The Wild [5.875257756382124]
3D content creation plays a vital role in various applications, such as gaming, robotics simulation, and virtual reality. To address this challenge, text-to-3D generation technologies have emerged as a promising solution for automating 3D creation.
arXiv Detail & Related papers (2024-05-15T15:23:22Z)
Scalable Scene Modeling from Perspective Imaging: Physics-based Appearance and Geometry Inference [3.2229099973277076]
dissertation presents a fraction of contributions that advances 3D scene modeling to its state of the art. In contrast to the prevailing deep learning methods, as a core contribution, this thesis aims to develop algorithms that follow first principles.
arXiv Detail & Related papers (2024-04-01T17:09:40Z)
A Comprehensive Survey of 3D Dense Captioning: Localizing and Describing Objects in 3D Scenes [80.20670062509723]
3D dense captioning is an emerging vision-language bridging task that aims to generate detailed descriptions for 3D scenes. It presents significant potential and challenges due to its closer representation of the real world compared to 2D visual captioning. Despite the popularity and success of existing methods, there is a lack of comprehensive surveys summarizing the advancements in this field.
arXiv Detail & Related papers (2024-03-12T10:04:08Z)
A Comprehensive Survey on 3D Content Generation [148.434661725242]
3D content generation shows both academic and practical values. New taxonomy is proposed that categorizes existing approaches into three types: 3D native generative methods, 2D prior-based 3D generative methods, and hybrid 3D generative methods.
arXiv Detail & Related papers (2024-02-02T06:20:44Z)
Advances in 3D Generation: A Survey [54.95024616672868]
The field of 3D content generation is developing rapidly, enabling the creation of increasingly high-quality and diverse 3D models. Specifically, we introduce the 3D representations that serve as the backbone for 3D generation. We provide a comprehensive overview of the rapidly growing literature on generation methods, categorized by the type of algorithmic paradigms.
arXiv Detail & Related papers (2024-01-31T13:06:48Z)
Towards AI-Architecture Liberty: A Comprehensive Survey on Design and Generation of Virtual Architecture by Deep Learning [23.58793497403681]
3D shape generation techniques leveraging deep learning have garnered significant interest from both the computer vision and architectural design communities. We review 149 related articles covering architectural design, 3D shape techniques, and virtual environments. We highlight four important enablers of ubiquitous interaction with immersive systems in deep learning-assisted architectural generation.
arXiv Detail & Related papers (2023-04-30T15:38:36Z)
3D Object Detection from Images for Autonomous Driving: A Survey [68.33502122185813]
3D object detection from images is one of the fundamental and challenging problems in autonomous driving. More than 200 works have studied this problem from 2015 to 2021, encompassing a broad spectrum of theories, algorithms, and applications. We provide the first comprehensive survey of this novel and continuously growing research field, summarizing the most commonly used pipelines for image-based 3D detection.
arXiv Detail & Related papers (2022-02-07T07:12:24Z)
Recent Advances in Monocular 2D and 3D Human Pose Estimation: A Deep Learning Perspective [69.44384540002358]
We provide a comprehensive and holistic 2D-to-3D perspective to tackle this problem. We categorize the mainstream and milestone approaches since the year 2014 under unified frameworks. We also summarize the pose representation styles, benchmarks, evaluation metrics, and the quantitative performance of popular approaches.
arXiv Detail & Related papers (2021-04-23T11:07:07Z)

This list is automatically generated from the titles and abstracts of the papers in this site.