SSP-Pose: Symmetry-Aware Shape Prior Deformation for Direct
Category-Level Object Pose Estimation
- URL: http://arxiv.org/abs/2208.06661v1
- Date: Sat, 13 Aug 2022 14:37:31 GMT
- Title: SSP-Pose: Symmetry-Aware Shape Prior Deformation for Direct
Category-Level Object Pose Estimation
- Authors: Ruida Zhang, Yan Di, Fabian Manhardt, Federico Tombari, Xiangyang Ji
- Abstract summary: Category-level pose estimation is a challenging problem due to intra-class shape variations.
We propose an end-to-end trainable network SSP-Pose for category-level pose estimation.
SSP-Pose produces superior performance compared with competitors with a real-time inference speed at about 25Hz.
- Score: 77.88624073105768
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Category-level pose estimation is a challenging problem due to intra-class
shape variations. Recent methods deform pre-computed shape priors to map the
observed point cloud into the normalized object coordinate space and then
retrieve the pose via post-processing, i.e., Umeyama's Algorithm. The
shortcomings of this two-stage strategy lie in two aspects: 1) The surrogate
supervision on the intermediate results can not directly guide the learning of
pose, resulting in large pose error after post-processing. 2) The inference
speed is limited by the post-processing step. In this paper, to handle these
shortcomings, we propose an end-to-end trainable network SSP-Pose for
category-level pose estimation, which integrates shape priors into a direct
pose regression network. SSP-Pose stacks four individual branches on a shared
feature extractor, where two branches are designed to deform and match the
prior model with the observed instance, and the other two branches are applied
for directly regressing the totally 9 degrees-of-freedom pose and performing
symmetry reconstruction and point-wise inlier mask prediction respectively.
Consistency loss terms are then naturally exploited to align the outputs of
different branches and promote the performance. During inference, only the
direct pose regression branch is needed. In this manner, SSP-Pose not only
learns category-level pose-sensitive characteristics to boost performance but
also keeps a real-time inference speed. Moreover, we utilize the symmetry
information of each category to guide the shape prior deformation, and propose
a novel symmetry-aware loss to mitigate the matching ambiguity. Extensive
experiments on public datasets demonstrate that SSP-Pose produces superior
performance compared with competitors with a real-time inference speed at about
25Hz.
Related papers
- Towards Cross-View-Consistent Self-Supervised Surround Depth Estimation [9.569646683579899]
Self-Supervised Surround Depth Estimation from consecutive images offers an economical alternative.
Previous SSSDE methods have proposed different mechanisms to fuse information across images, but few of them explicitly consider the cross-view constraints.
This paper proposes an efficient and consistent pose estimation design and two loss functions to enhance cross-view consistency for SSSDE.
arXiv Detail & Related papers (2024-07-04T16:29:05Z) - DR-Pose: A Two-stage Deformation-and-Registration Pipeline for
Category-level 6D Object Pose Estimation [9.074911955402495]
Category-level object pose estimation involves estimating the 6D pose and the 3D metric size of objects from predetermined categories.
Recent approaches take categorical shape prior information as reference to improve pose estimation accuracy.
We propose a two-stage deformation-and registration pipeline called DR-Pose, which consists of completion-aided deformation stage and scaled registration stage.
arXiv Detail & Related papers (2023-09-05T03:24:09Z) - IMP: Iterative Matching and Pose Estimation with Adaptive Pooling [34.36397639248686]
We propose an textbfefficient IMP, called EIMP, to dynamically discard keypoints without potential matches.
Experiments on YFCC100m, Scannet, and Aachen Day-Night datasets demonstrate that the proposed method outperforms previous approaches in terms of accuracy and efficiency.
arXiv Detail & Related papers (2023-04-28T13:25:50Z) - Generalizing Neural Human Fitting to Unseen Poses With Articulated SE(3)
Equivariance [48.39751410262664]
ArtEq is a part-based SE(3)-equivariant neural architecture for SMPL model estimation from point clouds.
Experimental results show that ArtEq generalizes to poses not seen during training, outperforming state-of-the-art methods by 44% in terms of body reconstruction accuracy.
arXiv Detail & Related papers (2023-04-20T17:58:26Z) - RBP-Pose: Residual Bounding Box Projection for Category-Level Pose
Estimation [103.74918834553247]
Category-level object pose estimation aims to predict the 6D pose as well as the 3D metric size of arbitrary objects from a known set of categories.
Recent methods harness shape prior adaptation to map the observed point cloud into the canonical space and apply Umeyama algorithm to recover the pose and size.
We propose a novel geometry-guided Residual Object Bounding Box Projection network RBP-Pose that jointly predicts object pose and residual vectors.
arXiv Detail & Related papers (2022-07-30T14:45:20Z) - CATRE: Iterative Point Clouds Alignment for Category-level Object Pose
Refinement [52.41884119329864]
Category-level object pose and size refiner CATRE is able to iteratively enhance pose estimate from point clouds to produce accurate results.
Our approach remarkably outperforms state-of-the-art methods on REAL275, CAMERA25, and LM benchmarks up to a speed of 85.32Hz.
arXiv Detail & Related papers (2022-07-17T05:55:00Z) - Uncertainty-Aware Adaptation for Self-Supervised 3D Human Pose
Estimation [70.32536356351706]
We introduce MRP-Net that constitutes a common deep network backbone with two output heads subscribing to two diverse configurations.
We derive suitable measures to quantify prediction uncertainty at both pose and joint level.
We present a comprehensive evaluation of the proposed approach and demonstrate state-of-the-art performance on benchmark datasets.
arXiv Detail & Related papers (2022-03-29T07:14:58Z) - Locally Aware Piecewise Transformation Fields for 3D Human Mesh
Registration [67.69257782645789]
We propose piecewise transformation fields that learn 3D translation vectors to map any query point in posed space to its correspond position in rest-pose space.
We show that fitting parametric models with poses by our network results in much better registration quality, especially for extreme poses.
arXiv Detail & Related papers (2021-04-16T15:16:09Z) - Deep Semantic Matching with Foreground Detection and Cycle-Consistency [103.22976097225457]
We address weakly supervised semantic matching based on a deep network.
We explicitly estimate the foreground regions to suppress the effect of background clutter.
We develop cycle-consistent losses to enforce the predicted transformations across multiple images to be geometrically plausible and consistent.
arXiv Detail & Related papers (2020-03-31T22:38:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.