Deep Keypoint-Based Camera Pose Estimation with Geometric Constraints
- URL: http://arxiv.org/abs/2007.15122v1
- Date: Wed, 29 Jul 2020 21:41:31 GMT
- Title: Deep Keypoint-Based Camera Pose Estimation with Geometric Constraints
- Authors: You-Yi Jau, Rui Zhu, Hao Su, Manmohan Chandraker
- Abstract summary: Estimating relative camera poses from consecutive frames is a fundamental problem in visual odometry.
We propose an end-to-end trainable framework consisting of learnable modules for detection, feature extraction, matching and outlier rejection.
- Score: 80.60538408386016
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Estimating relative camera poses from consecutive frames is a fundamental
problem in visual odometry (VO) and simultaneous localization and mapping
(SLAM), where classic methods consisting of hand-crafted features and
sampling-based outlier rejection have been a dominant choice for over a decade.
Although multiple works propose to replace these modules with learning-based
counterparts, most have not yet been as accurate, robust and generalizable as
conventional methods. In this paper, we design an end-to-end trainable
framework consisting of learnable modules for detection, feature extraction,
matching and outlier rejection, while directly optimizing for the geometric
pose objective. We show both quantitatively and qualitatively that pose
estimation performance may be achieved on par with the classic pipeline.
Moreover, we are able to show by end-to-end training, the key components of the
pipeline could be significantly improved, which leads to better
generalizability to unseen datasets compared to existing learning-based
methods.
Related papers
- FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects [55.77542145604758]
FoundationPose is a unified foundation model for 6D object pose estimation and tracking.
Our approach can be instantly applied at test-time to a novel object without fine-tuning.
arXiv Detail & Related papers (2023-12-13T18:28:09Z) - RelPose: Predicting Probabilistic Relative Rotation for Single Objects
in the Wild [73.1276968007689]
We describe a data-driven method for inferring the camera viewpoints given multiple images of an arbitrary object.
We show that our approach outperforms state-of-the-art SfM and SLAM methods given sparse images on both seen and unseen categories.
arXiv Detail & Related papers (2022-08-11T17:59:59Z) - Semantic keypoint-based pose estimation from single RGB frames [64.80395521735463]
We present an approach to estimating the continuous 6-DoF pose of an object from a single RGB image.
The approach combines semantic keypoints predicted by a convolutional network (convnet) with a deformable shape model.
We show that our approach can accurately recover the 6-DoF object pose for both instance- and class-based scenarios.
arXiv Detail & Related papers (2022-04-12T15:03:51Z) - Deep Shells: Unsupervised Shape Correspondence with Optimal Transport [52.646396621449]
We propose a novel unsupervised learning approach to 3D shape correspondence.
We show that the proposed method significantly improves over the state-of-the-art on multiple datasets.
arXiv Detail & Related papers (2020-10-28T22:24:07Z) - REDE: End-to-end Object 6D Pose Robust Estimation Using Differentiable
Outliers Elimination [15.736699709454857]
We propose REDE, a novel end-to-end object pose estimator using RGB-D data.
We also propose a differentiable outliers elimination method that regresses the candidate result and the confidence simultaneously.
The experimental results on three benchmark datasets show that REDE slightly outperforms the state-of-the-art approaches.
arXiv Detail & Related papers (2020-10-24T06:45:39Z) - Towards Better Generalization: Joint Depth-Pose Learning without PoseNet [36.414471128890284]
We tackle the essential problem of scale inconsistency for self-supervised joint depth-pose learning.
Most existing methods assume that a consistent scale of depth and pose can be learned across all input samples.
We propose a novel system that explicitly disentangles scale from the network estimation.
arXiv Detail & Related papers (2020-04-03T00:28:09Z) - Image Matching across Wide Baselines: From Paper to Practice [80.9424750998559]
We introduce a comprehensive benchmark for local features and robust estimation algorithms.
Our pipeline's modular structure allows easy integration, configuration, and combination of different methods.
We show that with proper settings, classical solutions may still outperform the perceived state of the art.
arXiv Detail & Related papers (2020-03-03T15:20:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.