Certifiable 3D Object Pose Estimation: Foundations, Learning Models, and
Self-Training
- URL: http://arxiv.org/abs/2206.11215v4
- Date: Fri, 28 Apr 2023 19:47:34 GMT
- Title: Certifiable 3D Object Pose Estimation: Foundations, Learning Models, and
Self-Training
- Authors: Rajat Talak, Lisa Peng, and Luca Carlone
- Abstract summary: We consider a certifiable object pose estimation problem, where -- given a partial point cloud of an object -- the goal is to provide a certificate of correctness for the resulting estimate.
We propose C-3PO, a semantic-keypoint-based pose estimation model, augmented with the two certificates.
- Score: 23.802602957611676
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We consider a certifiable object pose estimation problem, where -- given a
partial point cloud of an object -- the goal is to not only estimate the object
pose, but also to provide a certificate of correctness for the resulting
estimate. Our first contribution is a general theory of certification for
end-to-end perception models. In particular, we introduce the notion of
$\zeta$-correctness, which bounds the distance between an estimate and the
ground truth. We show that $\zeta$-correctness can be assessed by implementing
two certificates: (i) a certificate of observable correctness, that asserts if
the model output is consistent with the input data and prior information, (ii)
a certificate of non-degeneracy, that asserts whether the input data is
sufficient to compute a unique estimate. Our second contribution is to apply
this theory and design a new learning-based certifiable pose estimator. We
propose C-3PO, a semantic-keypoint-based pose estimation model, augmented with
the two certificates, to solve the certifiable pose estimation problem. C-3PO
also includes a keypoint corrector, implemented as a differentiable
optimization layer, that can correct large detection errors (e.g. due to the
sim-to-real gap). Our third contribution is a novel self-supervised training
approach that uses our certificate of observable correctness to provide the
supervisory signal to C-3PO during training. In it, the model trains only on
the observably correct input-output pairs, in each training iteration. As
training progresses, we see that the observably correct input-output pairs
grow, eventually reaching near 100% in many cases. Our experiments show that
(i) standard semantic-keypoint-based methods outperform more recent
alternatives, (ii) C-3PO further improves performance and significantly
outperforms all the baselines, and (iii) C-3PO's certificates are able to
discern correct pose estimates.
Related papers
- Certifying Robustness of Learning-Based Keypoint Detection and Pose Estimation Methods [9.953693315812995]
This work addresses the certification of robustness of vision-based two-stage 6D object pose estimation.
The core idea is to transform the certification of local robustness into neural network verification for classification tasks.
arXiv Detail & Related papers (2024-07-31T19:02:54Z) - Q-REG: End-to-End Trainable Point Cloud Registration with Surface
Curvature [81.25511385257344]
We present a novel solution, Q-REG, which utilizes rich geometric information to estimate the rigid pose from a single correspondence.
Q-REG allows to formalize the robust estimation as an exhaustive search, hence enabling end-to-end training.
We demonstrate in the experiments that Q-REG is agnostic to the correspondence matching method and provides consistent improvement both when used only in inference and in end-to-end training.
arXiv Detail & Related papers (2023-09-27T20:58:53Z) - A Correct-and-Certify Approach to Self-Supervise Object Pose Estimators
via Ensemble Self-Training [26.47895284071508]
Real-world robotics applications demand object pose estimation methods that work reliably across a variety of scenarios.
Our first contribution is to develop a robust corrector module that corrects pose estimates using depth information.
Our second contribution is an ensemble self-training approach that simultaneously trains multiple pose estimators in a self-supervised manner.
arXiv Detail & Related papers (2023-02-12T23:02:03Z) - Selective classification using a robust meta-learning approach [28.460912135533988]
We propose a novel instance-conditioned reweighting approach that captures predictive uncertainty using an auxiliary network.
We show in controlled experiments that we effectively capture the diverse specific notions of uncertainty through this meta-objective.
For diabetic retinopathy, we see upto 3.4%/3.3% accuracy and AUC gains over SOTA in selective classification.
arXiv Detail & Related papers (2022-12-12T15:45:23Z) - CPPF++: Uncertainty-Aware Sim2Real Object Pose Estimation by Vote Aggregation [67.12857074801731]
We introduce a novel method, CPPF++, designed for sim-to-real pose estimation.
To address the challenge posed by vote collision, we propose a novel approach that involves modeling the voting uncertainty.
We incorporate several innovative modules, including noisy pair filtering, online alignment optimization, and a feature ensemble.
arXiv Detail & Related papers (2022-11-24T03:27:00Z) - VL4Pose: Active Learning Through Out-Of-Distribution Detection For Pose
Estimation [79.50280069412847]
We introduce VL4Pose, a first principles approach for active learning through out-of-distribution detection.
Our solution involves modelling the pose through a simple parametric Bayesian network trained via maximum likelihood estimation.
We perform qualitative and quantitative experiments on three datasets: MPII, LSP and ICVL, spanning human and hand pose estimation.
arXiv Detail & Related papers (2022-10-12T09:03:55Z) - Leveraging Unlabeled Data to Predict Out-of-Distribution Performance [63.740181251997306]
Real-world machine learning deployments are characterized by mismatches between the source (training) and target (test) distributions.
In this work, we investigate methods for predicting the target domain accuracy using only labeled source data and unlabeled target data.
We propose Average Thresholded Confidence (ATC), a practical method that learns a threshold on the model's confidence, predicting accuracy as the fraction of unlabeled examples.
arXiv Detail & Related papers (2022-01-11T23:01:12Z) - Dynamic Iterative Refinement for Efficient 3D Hand Pose Estimation [87.54604263202941]
We propose a tiny deep neural network of which partial layers are iteratively exploited for refining its previous estimations.
We employ learned gating criteria to decide whether to exit from the weight-sharing loop, allowing per-sample adaptation in our model.
Our method consistently outperforms state-of-the-art 2D/3D hand pose estimation approaches in terms of both accuracy and efficiency for widely used benchmarks.
arXiv Detail & Related papers (2021-11-11T23:31:34Z) - Fast Uncertainty Quantification for Deep Object Pose Estimation [91.09217713805337]
Deep learning-based object pose estimators are often unreliable and overconfident.
In this work, we propose a simple, efficient, and plug-and-play UQ method for 6-DoF object pose estimation.
arXiv Detail & Related papers (2020-11-16T06:51:55Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.