SpyroPose: SE(3) Pyramids for Object Pose Distribution Estimation
- URL: http://arxiv.org/abs/2303.05308v2
- Date: Wed, 3 Jan 2024 11:12:33 GMT
- Title: SpyroPose: SE(3) Pyramids for Object Pose Distribution Estimation
- Authors: Rasmus Laurvig Haugaard, Frederik Hagelskj{\ae}r, Thorbj{\o}rn
Mosekj{\ae}r Iversen
- Abstract summary: We propose a novel method for pose distribution estimation on SE(3).
We use a hierarchical grid, a pyramid, which enables efficient importance sampling during training and sparse evaluation of the pyramid at inference.
Our method outperforms state-of-the-art methods on SO(3), and to the best of our knowledge, we provide the first quantitative results on pose distribution estimation on SE(3).
- Score: 4.8342038441006805
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Object pose estimation is a core computer vision problem and often an
essential component in robotics. Pose estimation is usually approached by
seeking the single best estimate of an object's pose, but this approach is
ill-suited for tasks involving visual ambiguity. In such cases it is desirable
to estimate the uncertainty as a pose distribution to allow downstream tasks to
make informed decisions. Pose distributions can have arbitrary complexity which
motivates estimating unparameterized distributions, however, until now they
have only been used for orientation estimation on SO(3) due to the difficulty
in training on and normalizing over SE(3). We propose a novel method for pose
distribution estimation on SE(3). We use a hierarchical grid, a pyramid, which
enables efficient importance sampling during training and sparse evaluation of
the pyramid at inference, allowing real time 6D pose distribution estimation.
Our method outperforms state-of-the-art methods on SO(3), and to the best of
our knowledge, we provide the first quantitative results on pose distribution
estimation on SE(3). Code will be available at spyropose.github.io
Related papers
- ADen: Adaptive Density Representations for Sparse-view Camera Pose Estimation [17.097170273209333]
Recovering camera poses from a set of images is a foundational task in 3D computer vision.
Recent data-driven approaches aim to directly output camera poses, either through regressing the 6DoF camera poses or formulating rotation as a probability distribution.
We propose ADen to unify the two frameworks by employing a generator and a discriminator.
arXiv Detail & Related papers (2024-08-16T22:45:46Z) - DVMNet: Computing Relative Pose for Unseen Objects Beyond Hypotheses [59.51874686414509]
Current approaches approximate the continuous pose representation with a large number of discrete pose hypotheses.
We present a Deep Voxel Matching Network (DVMNet) that eliminates the need for pose hypotheses and computes the relative object pose in a single pass.
Our method delivers more accurate relative pose estimates for novel objects at a lower computational cost compared to state-of-the-art methods.
arXiv Detail & Related papers (2024-03-20T15:41:32Z) - PoseMatcher: One-shot 6D Object Pose Estimation by Deep Feature Matching [51.142988196855484]
We propose PoseMatcher, an accurate model free one-shot object pose estimator.
We create a new training pipeline for object to image matching based on a three-view system.
To enable PoseMatcher to attend to distinct input modalities, an image and a pointcloud, we introduce IO-Layer.
arXiv Detail & Related papers (2023-04-03T21:14:59Z) - DiffPose: Multi-hypothesis Human Pose Estimation using Diffusion models [5.908471365011943]
We propose emphDiffPose, a conditional diffusion model that predicts multiple hypotheses for a given input image.
We show that DiffPose slightly improves upon the state of the art for multi-hypothesis pose estimation for simple poses and outperforms it by a large margin for highly ambiguous poses.
arXiv Detail & Related papers (2022-11-29T18:55:13Z) - LocPoseNet: Robust Location Prior for Unseen Object Pose Estimation [69.70498875887611]
LocPoseNet is able to robustly learn location prior for unseen objects.
Our method outperforms existing works by a large margin on LINEMOD and GenMOP.
arXiv Detail & Related papers (2022-11-29T15:21:34Z) - VL4Pose: Active Learning Through Out-Of-Distribution Detection For Pose
Estimation [79.50280069412847]
We introduce VL4Pose, a first principles approach for active learning through out-of-distribution detection.
Our solution involves modelling the pose through a simple parametric Bayesian network trained via maximum likelihood estimation.
We perform qualitative and quantitative experiments on three datasets: MPII, LSP and ICVL, spanning human and hand pose estimation.
arXiv Detail & Related papers (2022-10-12T09:03:55Z) - Ki-Pode: Keypoint-based Implicit Pose Distribution Estimation of Rigid
Objects [1.209625228546081]
We propose a novel pose distribution estimation method.
An implicit formulation of the probability distribution over object pose is derived from an intermediary representation of an object as a set of keypoints.
The method has been evaluated on the task of rotation distribution estimation on the YCB-V and T-LESS datasets.
arXiv Detail & Related papers (2022-09-20T11:59:05Z) - Unseen Object 6D Pose Estimation: A Benchmark and Baselines [62.8809734237213]
We propose a new task that enables and facilitates algorithms to estimate the 6D pose estimation of novel objects during testing.
We collect a dataset with both real and synthetic images and up to 48 unseen objects in the test set.
By training an end-to-end 3D correspondences network, our method finds corresponding points between an unseen object and a partial view RGBD image accurately and efficiently.
arXiv Detail & Related papers (2022-06-23T16:29:53Z) - CPPF: Towards Robust Category-Level 9D Pose Estimation in the Wild [45.93626858034774]
Category-level PPF voting method to achieve accurate, robust and generalizable 9D pose estimation in the wild.
A novel coarse-to-fine voting algorithm is proposed to eliminate noisy point pair samples and generate final predictions from the population.
Our method is on par with current state of the arts with real-world training data.
arXiv Detail & Related papers (2022-03-07T01:36:22Z) - Implicit-PDF: Non-Parametric Representation of Probability Distributions
on the Rotation Manifold [47.31074799708132]
We introduce a method to estimate arbitrary, non-parametric distributions on SO(3).
Our key idea is to represent the distributions implicitly, with a neural network that estimates the probability given the input image and a candidate pose.
We achieve state-of-the-art performance on Pascal3D+ and ModelNet10-SO(3) benchmarks.
arXiv Detail & Related papers (2021-06-10T17:57:23Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.