DualPoseNet: Category-level 6D Object Pose and Size Estimation using
Dual Pose Network with Refined Learning of Pose Consistency
- URL: http://arxiv.org/abs/2103.06526v1
- Date: Thu, 11 Mar 2021 08:33:47 GMT
- Title: DualPoseNet: Category-level 6D Object Pose and Size Estimation using
Dual Pose Network with Refined Learning of Pose Consistency
- Authors: Jiehong Lin, Zewei Wei, Zhihao Li, Songcen Xu, Kui Jia, Yuanqing Li
- Abstract summary: Category-level 6D object pose and size estimation is to predict 9 degrees-of-freedom (9DoF) pose configurations of rotation, translation, and size for object instances.
We propose a new method of Dual Pose Network with refined learning of pose consistency for this task, shortened as DualPoseNet.
- Score: 30.214100288708163
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Category-level 6D object pose and size estimation is to predict 9
degrees-of-freedom (9DoF) pose configurations of rotation, translation, and
size for object instances observed in single, arbitrary views of cluttered
scenes. It extends previous related tasks with learning of the two additional
rotation angles. This seemingly small difference poses technical challenges due
to the learning and prediction in the full rotation space of SO(3). In this
paper, we propose a new method of Dual Pose Network with refined learning of
pose consistency for this task, shortened as DualPoseNet. DualPoseNet stacks
two parallel pose decoders on top of a shared pose encoder, where the implicit
decoder predicts object poses with a working mechanism different from that of
the explicit one; they thus impose complementary supervision on the training of
pose encoder. We construct the encoder based on spherical convolutions, and
design a module of Spherical Fusion wherein for a better embedding of
pose-sensitive features from the appearance and shape observations. Given no
the testing CAD models, it is the novel introduction of the implicit decoder
that enables the refined pose prediction during testing, by enforcing the
predicted pose consistency between the two decoders using a self-adaptive loss
term. Thorough experiments on the benchmark 9DoF object pose datasets of
CAMERA25 and REAL275 confirm efficacy of our designs. DualPoseNet outperforms
existing methods with a large margin in the regime of high precision.
Related papers
- SecondPose: SE(3)-Consistent Dual-Stream Feature Fusion for Category-Level Pose Estimation [79.12683101131368]
Category-level object pose estimation, aiming to predict the 6D pose and 3D size of objects from known categories, typically struggles with large intra-class shape variation.
We present SecondPose, a novel approach integrating object-specific geometric features with semantic category priors from DINOv2.
arXiv Detail & Related papers (2023-11-18T17:14:07Z) - RelPose++: Recovering 6D Poses from Sparse-view Observations [66.6922660401558]
We address the task of estimating 6D camera poses from sparse-view image sets (2-8 images)
We build on the recent RelPose framework which learns a network that infers distributions over relative rotations over image pairs.
Our final system results in large improvements in 6D pose prediction over prior art on both seen and unseen object categories.
arXiv Detail & Related papers (2023-05-08T17:59:58Z) - CheckerPose: Progressive Dense Keypoint Localization for Object Pose
Estimation with Graph Neural Network [66.24726878647543]
Estimating the 6-DoF pose of a rigid object from a single RGB image is a crucial yet challenging task.
Recent studies have shown the great potential of dense correspondence-based solutions.
We propose a novel pose estimation algorithm named CheckerPose, which improves on three main aspects.
arXiv Detail & Related papers (2023-03-29T17:30:53Z) - DCL-Net: Deep Correspondence Learning Network for 6D Pose Estimation [43.963630959349885]
We introduce a new method of Deep Correspondence Learning Network for direct 6D object pose estimation, shortened as DCL-Net.
We show that DCL-Net outperforms existing methods on three benchmarking datasets, including YCB-Video, LineMOD, and Oclussion-LineMOD.
arXiv Detail & Related papers (2022-10-11T08:04:40Z) - Coupled Iterative Refinement for 6D Multi-Object Pose Estimation [64.7198752089041]
Given a set of known 3D objects and an RGB or RGB-D input image, we detect and estimate the 6D pose of each object.
Our approach iteratively refines both pose and correspondence in a tightly coupled manner, allowing us to dynamically remove outliers to improve accuracy.
arXiv Detail & Related papers (2022-04-26T18:00:08Z) - SO-Pose: Exploiting Self-Occlusion for Direct 6D Pose Estimation [98.83762558394345]
SO-Pose is a framework for regressing all 6 degrees-of-freedom (6DoF) for the object pose in a cluttered environment from a single RGB image.
We introduce a novel reasoning about self-occlusion, in order to establish a two-layer representation for 3D objects.
Cross-layer consistencies that align correspondences, self-occlusion and 6D pose, we can further improve accuracy and robustness.
arXiv Detail & Related papers (2021-08-18T19:49:29Z) - DSC-PoseNet: Learning 6DoF Object Pose Estimation via Dual-scale
Consistency [43.09728251735362]
We present a two-step pose estimation framework to attain 6DoF object poses from 2D object bounding-boxes.
In the first step, the framework learns to segment objects from real and synthetic data.
In the second step, we design a dual-scale pose estimation network, namely DSC-PoseNet.
Our method outperforms state-of-the-art models trained on synthetic data by a large margin.
arXiv Detail & Related papers (2021-04-08T10:19:35Z) - CAPTRA: CAtegory-level Pose Tracking for Rigid and Articulated Objects
from Point Clouds [97.63549045541296]
We propose a unified framework that can handle 9DoF pose tracking for novel rigid object instances and per-part pose tracking for articulated objects.
Our method achieves new state-of-the-art performance on category-level rigid object pose (NOCS-REAL275) and articulated object pose benchmarks (SAPIEN, BMVC) at the fastest FPS 12.
arXiv Detail & Related papers (2021-04-08T00:14:58Z) - A Pose Proposal and Refinement Network for Better Object Pose Estimation [0.5735035463793008]
We present a novel, end-to-end 6D object pose estimation method that operates on RGB inputs.
Our proposed pipeline outperforms state-of-the-art RGB-based methods with competitive runtime performance.
arXiv Detail & Related papers (2020-04-11T23:13:54Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.