Instant Visual Odometry Initialization for Mobile AR
- URL: http://arxiv.org/abs/2107.14659v1
- Date: Fri, 30 Jul 2021 14:25:40 GMT
- Title: Instant Visual Odometry Initialization for Mobile AR
- Authors: Alejo Concha, Michael Burri, Jes\'us Briales, Christian Forster and
Luc Oth
- Abstract summary: We present a 6-DoF monocular visual odometry that initializes instantly and without motion parallax.
Our main contribution is a pose estimator that decouples estimating the 5-DoF relative rotation and translation direction.
Our solution is either used as a full odometry or as a preSLAM component of any supported SLAM system.
- Score: 5.497296425129818
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Mobile AR applications benefit from fast initialization to display
world-locked effects instantly. However, standard visual odometry or SLAM
algorithms require motion parallax to initialize (see Figure 1) and, therefore,
suffer from delayed initialization. In this paper, we present a 6-DoF monocular
visual odometry that initializes instantly and without motion parallax. Our
main contribution is a pose estimator that decouples estimating the 5-DoF
relative rotation and translation direction from the 1-DoF translation
magnitude. While scale is not observable in a monocular vision-only setting, it
is still paramount to estimate a consistent scale over the whole trajectory
(even if not physically accurate) to avoid AR effects moving erroneously along
depth. In our approach, we leverage the fact that depth errors are not
perceivable to the user during rotation-only motion. However, as the user
starts translating the device, depth becomes perceivable and so does the
capability to estimate consistent scale. Our proposed algorithm naturally
transitions between these two modes. We perform extensive validations of our
contributions with both a publicly available dataset and synthetic data. We
show that the proposed pose estimator outperforms the classical approaches for
6-DoF pose estimation used in the literature in low-parallax configurations. We
release a dataset for the relative pose problem using real data to facilitate
the comparison with future solutions for the relative pose problem. Our
solution is either used as a full odometry or as a preSLAM component of any
supported SLAM system (ARKit, ARCore) in world-locked AR effects on platforms
such as Instagram and Facebook.
Related papers
- Gravity-aligned Rotation Averaging with Circular Regression [53.81374943525774]
We introduce a principled approach that integrates gravity direction into the rotation averaging phase of global pipelines.
We achieve state-of-the-art accuracy on four large-scale datasets.
arXiv Detail & Related papers (2024-10-16T17:37:43Z) - RD-VIO: Robust Visual-Inertial Odometry for Mobile Augmented Reality in
Dynamic Environments [55.864869961717424]
It is typically challenging for visual or visual-inertial odometry systems to handle the problems of dynamic scenes and pure rotation.
We design a novel visual-inertial odometry (VIO) system called RD-VIO to handle both of these problems.
arXiv Detail & Related papers (2023-10-23T16:30:39Z) - RGB-based Category-level Object Pose Estimation via Decoupled Metric
Scale Recovery [72.13154206106259]
We propose a novel pipeline that decouples the 6D pose and size estimation to mitigate the influence of imperfect scales on rigid transformations.
Specifically, we leverage a pre-trained monocular estimator to extract local geometric information.
A separate branch is designed to directly recover the metric scale of the object based on category-level statistics.
arXiv Detail & Related papers (2023-09-19T02:20:26Z) - Vanishing Point Estimation in Uncalibrated Images with Prior Gravity
Direction [82.72686460985297]
We tackle the problem of estimating a Manhattan frame.
We derive two new 2-line solvers, one of which does not suffer from singularities affecting existing solvers.
We also design a new non-minimal method, running on an arbitrary number of lines, to boost the performance in local optimization.
arXiv Detail & Related papers (2023-08-21T13:03:25Z) - EDI: ESKF-based Disjoint Initialization for Visual-Inertial SLAM Systems [9.937997167972743]
We propose a novel approach for fast, accurate, and robust visual-inertial initialization.
Our method achieves an average scale error of 5.8% in less than 3 seconds.
arXiv Detail & Related papers (2023-08-04T19:06:58Z) - Learned Monocular Depth Priors in Visual-Inertial Initialization [4.99761983273316]
Visual-inertial odometry (VIO) is the pose estimation backbone for most AR/VR and autonomous robotic systems today.
We propose to circumvent the limitations of classical visual-inertial structure-from-motion (SfM)
We leverage learned monocular depth images (mono-depth) to constrain the relative depth of features, and upgrade the mono-depth to metric scale by jointly optimizing for its scale and shift.
arXiv Detail & Related papers (2022-04-20T00:30:04Z) - DM-VIO: Delayed Marginalization Visual-Inertial Odometry [62.746533939737446]
We present DM-VIO, a visual-inertial system based on delayed marginalization and pose graph bundle adjustment.
We evaluate our system on the EuRoC, TUM-VI, and 4Seasons datasets, which comprise flying drone, large-scale handheld, and automotive scenarios.
arXiv Detail & Related papers (2022-01-11T18:30:37Z) - Accurate and Robust Scale Recovery for Monocular Visual Odometry Based
on Plane Geometry [7.169216737580712]
We develop a lightweight scale recovery framework leveraging an accurate and robust estimation of the ground plane.
Experiments on the KITTI dataset show that the proposed framework can achieve state-of-theart accuracy in terms of translation errors.
Due to the light-weight design, our framework also demonstrates a high frequency of 20Hz on the dataset.
arXiv Detail & Related papers (2021-01-15T07:21:24Z) - Pushing the Envelope of Rotation Averaging for Visual SLAM [69.7375052440794]
We propose a novel optimization backbone for visual SLAM systems.
We leverage averaging to improve the accuracy, efficiency and robustness of conventional monocular SLAM systems.
Our approach can exhibit up to 10x faster with comparable accuracy against the state-art on public benchmarks.
arXiv Detail & Related papers (2020-11-02T18:02:26Z) - Monocular Rotational Odometry with Incremental Rotation Averaging and
Loop Closure [35.467052373502575]
Estimating absolute camera orientations is essential for attitude estimation tasks.
We devise a fast algorithm to accurately estimate camera orientations with 2D-2D feature matches alone.
Underpinning our system is a new incremental rotation averaging method for fast and constant time iterative updating.
arXiv Detail & Related papers (2020-10-05T09:19:06Z) - Online Initialization and Extrinsic Spatial-Temporal Calibration for
Monocular Visual-Inertial Odometry [19.955414423860788]
This paper presents an online method for bootstrapping the optimization-based monocular visual-inertial odometry (VIO)
The method can online calibrate the relative transformation (spatial) and time offsets (temporal) among camera and IMU, as well as estimate the initial values of metric scale, velocity, gravity, gyroscope bias, and accelerometer bias.
Experimental results on public datasets show that the initial values and the parameters, as well as the sensor poses, can be accurately estimated by the proposed method.
arXiv Detail & Related papers (2020-04-12T03:13:08Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.