Neural Part Priors: Learning to Optimize Part-Based Object Completion in
RGB-D Scans
- URL: http://arxiv.org/abs/2203.09375v2
- Date: Sat, 25 Mar 2023 01:54:06 GMT
- Title: Neural Part Priors: Learning to Optimize Part-Based Object Completion in
RGB-D Scans
- Authors: Alexey Bokhovkin, Angela Dai
- Abstract summary: We propose to leverage large-scale synthetic datasets of 3D shapes annotated with part information to learn Neural Part Priors.
We can optimize over the learned part priors in order to fit to real-world scanned 3D scenes at test time.
Experiments on the ScanNet dataset demonstrate that NPPs significantly outperforms state of the art in part decomposition and object completion.
- Score: 27.377128012679076
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: 3D object recognition has seen significant advances in recent years, showing
impressive performance on real-world 3D scan benchmarks, but lacking in object
part reasoning, which is fundamental to higher-level scene understanding such
as inter-object similarities or object functionality. Thus, we propose to
leverage large-scale synthetic datasets of 3D shapes annotated with part
information to learn Neural Part Priors (NPPs), optimizable spaces
characterizing geometric part priors. Crucially, we can optimize over the
learned part priors in order to fit to real-world scanned 3D scenes at test
time, enabling robust part decomposition of the real objects in these scenes
that also estimates the complete geometry of the object while fitting
accurately to the observed real geometry. Moreover, this enables global
optimization over geometrically similar detected objects in a scene, which
often share strong geometric commonalities, enabling scene-consistent part
decompositions. Experiments on the ScanNet dataset demonstrate that NPPs
significantly outperforms state of the art in part decomposition and object
completion in real-world scenes.
Related papers
- Enhancing Generalizability of Representation Learning for Data-Efficient 3D Scene Understanding [50.448520056844885]
We propose a generative Bayesian network to produce diverse synthetic scenes with real-world patterns.
A series of experiments robustly display our method's consistent superiority over existing state-of-the-art pre-training approaches.
arXiv Detail & Related papers (2024-06-17T07:43:53Z) - Generalizing Single-View 3D Shape Retrieval to Occlusions and Unseen
Objects [32.32128461720876]
Single-view 3D shape retrieval is a challenging task that is increasingly important with the growth of available 3D data.
We systematically evaluate single-view 3D shape retrieval along three different axes: the presence of object occlusions and truncations, generalization to unseen 3D shape data, and generalization to unseen objects in the input images.
arXiv Detail & Related papers (2023-12-31T05:39:38Z) - LASA: Instance Reconstruction from Real Scans using A Large-scale
Aligned Shape Annotation Dataset [17.530432165466507]
We present a novel Cross-Modal Shape Reconstruction (DisCo) method and an Occupancy-Guided 3D Object Detection (OccGOD) method.
Our methods achieve state-of-the-art performance in both instance-level scene reconstruction and 3D object detection tasks.
arXiv Detail & Related papers (2023-12-19T18:50:10Z) - Semi-Weakly Supervised Object Kinematic Motion Prediction [56.282759127180306]
Given a 3D object, kinematic motion prediction aims to identify the mobile parts as well as the corresponding motion parameters.
We propose a graph neural network to learn the map between hierarchical part-level segmentation and mobile parts parameters.
The network predictions yield a large scale of 3D objects with pseudo labeled mobility information.
arXiv Detail & Related papers (2023-03-31T02:37:36Z) - 3D Video Object Detection with Learnable Object-Centric Global
Optimization [65.68977894460222]
Correspondence-based optimization is the cornerstone for 3D scene reconstruction but is less studied in 3D video object detection.
We propose BA-Det, an end-to-end optimizable object detector with object-centric temporal correspondence learning and featuremetric object bundle adjustment.
arXiv Detail & Related papers (2023-03-27T17:39:39Z) - RandomRooms: Unsupervised Pre-training from Synthetic Shapes and
Randomized Layouts for 3D Object Detection [138.2892824662943]
A promising solution is to make better use of the synthetic dataset, which consists of CAD object models, to boost the learning on real datasets.
Recent work on 3D pre-training exhibits failure when transfer features learned on synthetic objects to other real-world applications.
In this work, we put forward a new method called RandomRooms to accomplish this objective.
arXiv Detail & Related papers (2021-08-17T17:56:12Z) - Discovering 3D Parts from Image Collections [98.16987919686709]
We tackle the problem of 3D part discovery from only 2D image collections.
Instead of relying on manually annotated parts for supervision, we propose a self-supervised approach.
Our key insight is to learn a novel part shape prior that allows each part to fit an object shape faithfully while constrained to have simple geometry.
arXiv Detail & Related papers (2021-07-28T20:29:16Z) - Towards Part-Based Understanding of RGB-D Scans [43.4094489272776]
We propose the task of part-based scene understanding of real-world 3D environments.
From an RGB-D scan of a scene, we detect objects, and for each object predict its decomposition into geometric part masks.
We leverage an intermediary part graph representation to enable robust completion as well as building of part priors.
arXiv Detail & Related papers (2020-12-03T17:30:02Z) - Learning Unsupervised Hierarchical Part Decomposition of 3D Objects from
a Single RGB Image [102.44347847154867]
We propose a novel formulation that allows to jointly recover the geometry of a 3D object as a set of primitives.
Our model recovers the higher level structural decomposition of various objects in the form of a binary tree of primitives.
Our experiments on the ShapeNet and D-FAUST datasets demonstrate that considering the organization of parts indeed facilitates reasoning about 3D geometry.
arXiv Detail & Related papers (2020-04-02T17:58:05Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.