A Pose Proposal and Refinement Network for Better Object Pose Estimation
- URL: http://arxiv.org/abs/2004.05507v2
- Date: Wed, 7 Oct 2020 15:41:11 GMT
- Title: A Pose Proposal and Refinement Network for Better Object Pose Estimation
- Authors: Ameni Trabelsi, Mohamed Chaabane, Nathaniel Blanchard and Ross
Beveridge
- Abstract summary: We present a novel, end-to-end 6D object pose estimation method that operates on RGB inputs.
Our proposed pipeline outperforms state-of-the-art RGB-based methods with competitive runtime performance.
- Score: 0.5735035463793008
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this paper, we present a novel, end-to-end 6D object pose estimation
method that operates on RGB inputs. Our approach is composed of 2 main
components: the first component classifies the objects in the input image and
proposes an initial 6D pose estimate through a multi-task, CNN-based
encoder/multi-decoder module. The second component, a refinement module,
includes a renderer and a multi-attentional pose refinement network, which
iteratively refines the estimated poses by utilizing both appearance features
and flow vectors. Our refiner takes advantage of the hybrid representation of
the initial pose estimates to predict the relative errors with respect to the
target poses. It is further augmented by a spatial multi-attention block that
emphasizes objects' discriminative feature parts. Experiments on three
benchmarks for 6D pose estimation show that our proposed pipeline outperforms
state-of-the-art RGB-based methods with competitive runtime performance.
Related papers
- RelPose++: Recovering 6D Poses from Sparse-view Observations [66.6922660401558]
We address the task of estimating 6D camera poses from sparse-view image sets (2-8 images)
We build on the recent RelPose framework which learns a network that infers distributions over relative rotations over image pairs.
Our final system results in large improvements in 6D pose prediction over prior art on both seen and unseen object categories.
arXiv Detail & Related papers (2023-05-08T17:59:58Z) - Robust Category-Level 6D Pose Estimation with Coarse-to-Fine Rendering
of Neural Features [17.920305227880245]
We consider the problem of category-level 6D pose estimation from a single RGB image.
Our approach represents an object category as a cuboid mesh and learns a generative model of the neural feature activations at each mesh.
Our experiments demonstrate an enhanced category-level 6D pose estimation performance compared to prior work.
arXiv Detail & Related papers (2022-09-12T21:31:36Z) - Coupled Iterative Refinement for 6D Multi-Object Pose Estimation [64.7198752089041]
Given a set of known 3D objects and an RGB or RGB-D input image, we detect and estimate the 6D pose of each object.
Our approach iteratively refines both pose and correspondence in a tightly coupled manner, allowing us to dynamically remove outliers to improve accuracy.
arXiv Detail & Related papers (2022-04-26T18:00:08Z) - ZebraPose: Coarse to Fine Surface Encoding for 6DoF Object Pose
Estimation [76.31125154523056]
We present a discrete descriptor, which can represent the object surface densely.
We also propose a coarse to fine training strategy, which enables fine-grained correspondence prediction.
arXiv Detail & Related papers (2022-03-17T16:16:24Z) - Single-stage Keypoint-based Category-level Object Pose Estimation from
an RGB Image [27.234658117816103]
We propose a single-stage, keypoint-based approach for category-level object pose estimation.
The proposed network performs 2D object detection, detects 2D keypoints, estimates 6-DoF pose, and regresses relative bounding cuboid dimensions.
We conduct extensive experiments on the challenging Objectron benchmark, outperforming state-of-the-art methods on the 3D IoU metric.
arXiv Detail & Related papers (2021-09-13T17:55:00Z) - SO-Pose: Exploiting Self-Occlusion for Direct 6D Pose Estimation [98.83762558394345]
SO-Pose is a framework for regressing all 6 degrees-of-freedom (6DoF) for the object pose in a cluttered environment from a single RGB image.
We introduce a novel reasoning about self-occlusion, in order to establish a two-layer representation for 3D objects.
Cross-layer consistencies that align correspondences, self-occlusion and 6D pose, we can further improve accuracy and robustness.
arXiv Detail & Related papers (2021-08-18T19:49:29Z) - FS-Net: Fast Shape-based Network for Category-Level 6D Object Pose
Estimation with Decoupled Rotation Mechanism [49.89268018642999]
We propose a fast shape-based network (FS-Net) with efficient category-level feature extraction for 6D pose estimation.
The proposed method achieves state-of-the-art performance in both category- and instance-level 6D object pose estimation.
arXiv Detail & Related papers (2021-03-12T03:07:24Z) - DualPoseNet: Category-level 6D Object Pose and Size Estimation using
Dual Pose Network with Refined Learning of Pose Consistency [30.214100288708163]
Category-level 6D object pose and size estimation is to predict 9 degrees-of-freedom (9DoF) pose configurations of rotation, translation, and size for object instances.
We propose a new method of Dual Pose Network with refined learning of pose consistency for this task, shortened as DualPoseNet.
arXiv Detail & Related papers (2021-03-11T08:33:47Z) - Spatial Attention Improves Iterative 6D Object Pose Estimation [52.365075652976735]
We propose a new method for 6D pose estimation refinement from RGB images.
Our main insight is that after the initial pose estimate, it is important to pay attention to distinct spatial features of the object.
We experimentally show that this approach learns to attend to salient spatial features and learns to ignore occluded parts of the object, leading to better pose estimation across datasets.
arXiv Detail & Related papers (2021-01-05T17:18:52Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.