Category-Level 6D Object Pose Estimation via Cascaded Relation and
Recurrent Reconstruction Networks
- URL: http://arxiv.org/abs/2108.08755v1
- Date: Thu, 19 Aug 2021 15:46:52 GMT
- Title: Category-Level 6D Object Pose Estimation via Cascaded Relation and
Recurrent Reconstruction Networks
- Authors: Jiaze Wang, Kai Chen, Qi Dou
- Abstract summary: Category-level 6D pose estimation is fundamental to many scenarios such as robotic manipulation and augmented reality.
We achieve accurate category-level 6D pose estimation via cascaded relation and recurrent reconstruction networks.
Our method exceeds the latest state-of-the-art SPD by $4.9%$ and $17.7%$ on the CAMERA25 dataset.
- Score: 22.627704070200863
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Category-level 6D pose estimation, aiming to predict the location and
orientation of unseen object instances, is fundamental to many scenarios such
as robotic manipulation and augmented reality, yet still remains unsolved.
Precisely recovering instance 3D model in the canonical space and accurately
matching it with the observation is an essential point when estimating 6D pose
for unseen objects. In this paper, we achieve accurate category-level 6D pose
estimation via cascaded relation and recurrent reconstruction networks.
Specifically, a novel cascaded relation network is dedicated for advanced
representation learning to explore the complex and informative relations among
instance RGB image, instance point cloud and category shape prior. Furthermore,
we design a recurrent reconstruction network for iterative residual refinement
to progressively improve the reconstruction and correspondence estimations from
coarse to fine. Finally, the instance 6D pose is obtained leveraging the
estimated dense correspondences between the instance point cloud and the
reconstructed 3D model in the canonical space. We have conducted extensive
experiments on two well-acknowledged benchmarks of category-level 6D pose
estimation, with significant performance improvement over existing approaches.
On the representatively strict evaluation metrics of $3D_{75}$ and $5^{\circ}2
cm$, our method exceeds the latest state-of-the-art SPD by $4.9\%$ and $17.7\%$
on the CAMERA25 dataset, and by $2.7\%$ and $8.5\%$ on the REAL275 dataset.
Codes are available at https://wangjiaze.cn/projects/6DPoseEstimation.html.
Related papers
- Improving 2D-3D Dense Correspondences with Diffusion Models for 6D
Object Pose Estimation [9.760487761422326]
Estimating 2D-3D correspondences between RGB images and 3D space is a fundamental problem in 6D object pose estimation.
Recent pose estimators use dense correspondence maps and Point-to-Point algorithms to estimate object poses.
Recent advancements in image-to-image translation have led to diffusion models being the superior choice when evaluated on benchmarking datasets.
arXiv Detail & Related papers (2024-02-09T14:27:40Z) - RelPose++: Recovering 6D Poses from Sparse-view Observations [66.6922660401558]
We address the task of estimating 6D camera poses from sparse-view image sets (2-8 images)
We build on the recent RelPose framework which learns a network that infers distributions over relative rotations over image pairs.
Our final system results in large improvements in 6D pose prediction over prior art on both seen and unseen object categories.
arXiv Detail & Related papers (2023-05-08T17:59:58Z) - Coupled Iterative Refinement for 6D Multi-Object Pose Estimation [64.7198752089041]
Given a set of known 3D objects and an RGB or RGB-D input image, we detect and estimate the 6D pose of each object.
Our approach iteratively refines both pose and correspondence in a tightly coupled manner, allowing us to dynamically remove outliers to improve accuracy.
arXiv Detail & Related papers (2022-04-26T18:00:08Z) - FS6D: Few-Shot 6D Pose Estimation of Novel Objects [116.34922994123973]
6D object pose estimation networks are limited in their capability to scale to large numbers of object instances.
In this work, we study a new open set problem; the few-shot 6D object poses estimation: estimating the 6D pose of an unknown object by a few support views without extra training.
arXiv Detail & Related papers (2022-03-28T10:31:29Z) - NeRF-Pose: A First-Reconstruct-Then-Regress Approach for
Weakly-supervised 6D Object Pose Estimation [44.42449011619408]
We present a weakly-supervised reconstruction-based pipeline, named NeRF-Pose, which needs only 2D object segmentation and known relative camera poses during training.
A NeRF-enabled RAN+SAC algorithm is used to estimate stable and accurate pose from the predicted correspondences.
Experiments on LineMod-Occlusion show that the proposed method has state-of-the-art accuracy in comparison to the best 6D pose estimation methods.
arXiv Detail & Related papers (2022-03-09T15:28:02Z) - Leveraging SE(3) Equivariance for Self-Supervised Category-Level Object
Pose Estimation [30.04752448942084]
Category-level object pose estimation aims to find 6D object poses of previously unseen object instances from known categories without access to object CAD models.
We propose for the first time a self-supervised learning framework to estimate category-level 6D object pose from single 3D point clouds.
arXiv Detail & Related papers (2021-10-30T06:46:44Z) - FS-Net: Fast Shape-based Network for Category-Level 6D Object Pose
Estimation with Decoupled Rotation Mechanism [49.89268018642999]
We propose a fast shape-based network (FS-Net) with efficient category-level feature extraction for 6D pose estimation.
The proposed method achieves state-of-the-art performance in both category- and instance-level 6D object pose estimation.
arXiv Detail & Related papers (2021-03-12T03:07:24Z) - Next-best-view Regression using a 3D Convolutional Neural Network [0.9449650062296823]
We propose a data-driven approach to address the next-best-view problem.
The proposed approach trains a 3D convolutional neural network with previous reconstructions in order to regress the btxtposition of the next-best-view.
We have validated the proposed approach making use of two groups of experiments.
arXiv Detail & Related papers (2021-01-23T01:50:26Z) - Spatial Attention Improves Iterative 6D Object Pose Estimation [52.365075652976735]
We propose a new method for 6D pose estimation refinement from RGB images.
Our main insight is that after the initial pose estimate, it is important to pay attention to distinct spatial features of the object.
We experimentally show that this approach learns to attend to salient spatial features and learns to ignore occluded parts of the object, leading to better pose estimation across datasets.
arXiv Detail & Related papers (2021-01-05T17:18:52Z) - Shape Prior Deformation for Categorical 6D Object Pose and Size
Estimation [62.618227434286]
We present a novel learning approach to recover the 6D poses and sizes of unseen object instances from an RGB-D image.
We propose a deep network to reconstruct the 3D object model by explicitly modeling the deformation from a pre-learned categorical shape prior.
arXiv Detail & Related papers (2020-07-16T16:45:05Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.