Category-Agnostic 6D Pose Estimation with Conditional Neural Processes
- URL: http://arxiv.org/abs/2206.07162v2
- Date: Thu, 19 Oct 2023 09:32:50 GMT
- Title: Category-Agnostic 6D Pose Estimation with Conditional Neural Processes
- Authors: Yumeng Li, Ning Gao, Hanna Ziesche, Gerhard Neumann
- Abstract summary: We present a novel meta-learning approach for 6D pose estimation on unknown objects.
Our algorithm learns object representation in a category-agnostic way, which endows it with strong generalization capabilities across object categories.
- Score: 19.387280883044482
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: We present a novel meta-learning approach for 6D pose estimation on unknown
objects. In contrast to ``instance-level" and ``category-level" pose estimation
methods, our algorithm learns object representation in a category-agnostic way,
which endows it with strong generalization capabilities across object
categories. Specifically, we employ a neural process-based meta-learning
approach to train an encoder to capture texture and geometry of an object in a
latent representation, based on very few RGB-D images and ground-truth
keypoints. The latent representation is then used by a simultaneously
meta-trained decoder to predict the 6D pose of the object in new images.
Furthermore, we propose a novel geometry-aware decoder for the keypoint
prediction using a Graph Neural Network (GNN), which explicitly takes geometric
constraints specific to each object into consideration. To evaluate our
algorithm, extensive experiments are conducted on the \linemod dataset, and on
our new fully-annotated synthetic datasets generated from Multiple Categories
in Multiple Scenes (MCMS). Experimental results demonstrate that our model
performs well on unseen objects with very different shapes and appearances.
Remarkably, our model also shows robust performance on occluded scenes although
trained fully on data without occlusion. To our knowledge, this is the first
work exploring \textbf{cross-category level} 6D pose estimation.
Related papers
- FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects [55.77542145604758]
FoundationPose is a unified foundation model for 6D object pose estimation and tracking.
Our approach can be instantly applied at test-time to a novel object without fine-tuning.
arXiv Detail & Related papers (2023-12-13T18:28:09Z) - MegaPose: 6D Pose Estimation of Novel Objects via Render & Compare [84.80956484848505]
MegaPose is a method to estimate the 6D pose of novel objects, that is, objects unseen during training.
We present a 6D pose refiner based on a render&compare strategy which can be applied to novel objects.
Second, we introduce a novel approach for coarse pose estimation which leverages a network trained to classify whether the pose error between a synthetic rendering and an observed image of the same object can be corrected by the refiner.
arXiv Detail & Related papers (2022-12-13T19:30:03Z) - Robust Category-Level 6D Pose Estimation with Coarse-to-Fine Rendering
of Neural Features [17.920305227880245]
We consider the problem of category-level 6D pose estimation from a single RGB image.
Our approach represents an object category as a cuboid mesh and learns a generative model of the neural feature activations at each mesh.
Our experiments demonstrate an enhanced category-level 6D pose estimation performance compared to prior work.
arXiv Detail & Related papers (2022-09-12T21:31:36Z) - Unseen Object 6D Pose Estimation: A Benchmark and Baselines [62.8809734237213]
We propose a new task that enables and facilitates algorithms to estimate the 6D pose estimation of novel objects during testing.
We collect a dataset with both real and synthetic images and up to 48 unseen objects in the test set.
By training an end-to-end 3D correspondences network, our method finds corresponding points between an unseen object and a partial view RGBD image accurately and efficiently.
arXiv Detail & Related papers (2022-06-23T16:29:53Z) - Semantic keypoint-based pose estimation from single RGB frames [64.80395521735463]
We present an approach to estimating the continuous 6-DoF pose of an object from a single RGB image.
The approach combines semantic keypoints predicted by a convolutional network (convnet) with a deformable shape model.
We show that our approach can accurately recover the 6-DoF object pose for both instance- and class-based scenarios.
arXiv Detail & Related papers (2022-04-12T15:03:51Z) - OVE6D: Object Viewpoint Encoding for Depth-based 6D Object Pose
Estimation [12.773040823634908]
We propose a universal framework, called OVE6D, for model-based 6D object pose estimation from a single depth image and a target object mask.
Our model is trained using purely synthetic data rendered from ShapeNet, and, unlike most of the existing methods, it generalizes well on new real-world objects without any fine-tuning.
We show that OVE6D outperforms some contemporary deep learning-based pose estimation methods specifically trained for individual objects or datasets with real-world training data.
arXiv Detail & Related papers (2022-03-02T12:51:33Z) - DONet: Learning Category-Level 6D Object Pose and Size Estimation from
Depth Observation [53.55300278592281]
We propose a method of Category-level 6D Object Pose and Size Estimation (COPSE) from a single depth image.
Our framework makes inferences based on the rich geometric information of the object in the depth channel alone.
Our framework competes with state-of-the-art approaches that require labeled real-world images.
arXiv Detail & Related papers (2021-06-27T10:41:50Z) - Shape Prior Deformation for Categorical 6D Object Pose and Size
Estimation [62.618227434286]
We present a novel learning approach to recover the 6D poses and sizes of unseen object instances from an RGB-D image.
We propose a deep network to reconstruct the 3D object model by explicitly modeling the deformation from a pre-learned categorical shape prior.
arXiv Detail & Related papers (2020-07-16T16:45:05Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.