FullPart: Generating each 3D Part at Full Resolution
- URL: http://arxiv.org/abs/2510.26140v1
- Date: Thu, 30 Oct 2025 04:51:05 GMT
- Title: FullPart: Generating each 3D Part at Full Resolution
- Authors: Lihe Ding, Shaocong Dong, Yaokun Li, Chenjian Gao, Xiao Chen, Rui Han, Yihao Kuang, Hong Zhang, Bo Huang, Zhanpeng Huang, Zibin Wang, Dan Xu, Tianfan Xue,
- Abstract summary: FullPart is a novel framework that combines both implicit and explicit paradigms.<n>It generates detailed parts, each within its own fixed full-resolution voxel grid.<n>To tackle the scarcity of reliable part data, we present PartVerse-XL, the largest human-annotated 3D part dataset to date with 40K objects and 320K parts.
- Score: 33.98986324893688
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Part-based 3D generation holds great potential for various applications. Previous part generators that represent parts using implicit vector-set tokens often suffer from insufficient geometric details. Another line of work adopts an explicit voxel representation but shares a global voxel grid among all parts; this often causes small parts to occupy too few voxels, leading to degraded quality. In this paper, we propose FullPart, a novel framework that combines both implicit and explicit paradigms. It first derives the bounding box layout through an implicit box vector-set diffusion process, a task that implicit diffusion handles effectively since box tokens contain little geometric detail. Then, it generates detailed parts, each within its own fixed full-resolution voxel grid. Instead of sharing a global low-resolution space, each part in our method - even small ones - is generated at full resolution, enabling the synthesis of intricate details. We further introduce a center-point encoding strategy to address the misalignment issue when exchanging information between parts of different actual sizes, thereby maintaining global coherence. Moreover, to tackle the scarcity of reliable part data, we present PartVerse-XL, the largest human-annotated 3D part dataset to date with 40K objects and 320K parts. Extensive experiments demonstrate that FullPart achieves state-of-the-art results in 3D part generation. We will release all code, data, and model to benefit future research in 3D part generation.
Related papers
- PartNeXt: A Next-Generation Dataset for Fine-Grained and Hierarchical 3D Part Understanding [65.55036443711528]
PartNeXt is a next-generation dataset with over 23,000 high-quality, textured 3D models annotated with fine-grained, hierarchical part labels across 50 categories.<n>We benchmark PartNeXt on two tasks: (1) class-agnostic part segmentation, where state-of-the-art methods struggle with fine-grained and leaf-level parts, and (2) 3D part-centric question answering, a new benchmark for 3D-LLMs that reveals significant gaps in open-vocabulary part grounding.
arXiv Detail & Related papers (2025-10-23T03:06:08Z) - X-Part: high fidelity and structure coherent shape decomposition [21.08914191612629]
X-Part is a controllable generative model designed to decompose a holistic 3D object into semantically meaningful parts.<n>X-Part exploits the bounding box as prompts for the part generation and injects point-wise semantic features for meaningful decomposition.<n>This work establishes a new paradigm for creating production-ready, editable, and structurally sound 3D assets.
arXiv Detail & Related papers (2025-09-10T14:37:02Z) - Efficient Part-level 3D Object Generation via Dual Volume Packing [51.712083185732894]
We propose a new end-to-end framework for part-level 3D object generation.<n>Our method generates high-quality 3D objects with an arbitrary number of complete and semantically meaningful parts.
arXiv Detail & Related papers (2025-06-11T17:55:03Z) - HoloPart: Generative 3D Part Amodal Segmentation [23.84639726216676]
3D part amodal segmentation is a challenging but crucial task for 3D content creation and understanding.<n>Inspired by 2D amodal segmentation, we introduce this novel task to the 3D domain and propose a practical, two-stage approach.<n>We introduce new benchmarks based on the ABO and Part-Tiny datasets and demonstrate that HoloPart significantly outperforms state-of-the-art shape completion methods.
arXiv Detail & Related papers (2025-04-10T17:53:31Z) - PartGen: Part-level 3D Generation and Reconstruction with Multi-View Diffusion Models [63.1432721793683]
We introduce PartGen, a novel approach that generates 3D objects composed of meaningful parts starting from text, an image, or an unstructured 3D object.<n>We evaluate our method on generated and real 3D assets and show that it outperforms segmentation and part-extraction baselines by a large margin.
arXiv Detail & Related papers (2024-12-24T18:59:43Z) - 3D Part Segmentation via Geometric Aggregation of 2D Visual Features [57.20161517451834]
Supervised 3D part segmentation models are tailored for a fixed set of objects and parts, limiting their transferability to open-set, real-world scenarios.<n>Recent works have explored vision-language models (VLMs) as a promising alternative, using multi-view rendering and textual prompting to identify object parts.<n>To address these limitations, we propose COPS, a COmprehensive model for Parts that blends semantics extracted from visual concepts and 3D geometry to effectively identify object parts.
arXiv Detail & Related papers (2024-12-05T15:27:58Z) - Part123: Part-aware 3D Reconstruction from a Single-view Image [54.589723979757515]
Part123 is a novel framework for part-aware 3D reconstruction from a single-view image.
We introduce contrastive learning into a neural rendering framework to learn a part-aware feature space.
A clustering-based algorithm is also developed to automatically derive 3D part segmentation results from the reconstructed models.
arXiv Detail & Related papers (2024-05-27T07:10:21Z) - 3D Sketch-aware Semantic Scene Completion via Semi-supervised Structure
Prior [50.73148041205675]
The goal of the Semantic Scene Completion (SSC) task is to simultaneously predict a completed 3D voxel representation of volumetric occupancy and semantic labels of objects in the scene from a single-view observation.
We propose to devise a new geometry-based strategy to embed depth information with low-resolution voxel representation.
Our proposed geometric embedding works better than the depth feature learning from habitual SSC frameworks.
arXiv Detail & Related papers (2020-03-31T09:33:46Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.