Related papers: Instant Visual Odometry Initialization for Mobile AR

Instant Visual Odometry Initialization for Mobile AR

URL: http://arxiv.org/abs/2107.14659v1
Date: Fri, 30 Jul 2021 14:25:40 GMT
Title: Instant Visual Odometry Initialization for Mobile AR
Authors: Alejo Concha, Michael Burri, Jes\'us Briales, Christian Forster and Luc Oth
Abstract summary: We present a 6-DoF monocular visual odometry that initializes instantly and without motion parallax. Our main contribution is a pose estimator that decouples estimating the 5-DoF relative rotation and translation direction. Our solution is either used as a full odometry or as a preSLAM component of any supported SLAM system.
Score: 5.497296425129818
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Mobile AR applications benefit from fast initialization to display world-locked effects instantly. However, standard visual odometry or SLAM algorithms require motion parallax to initialize (see Figure 1) and, therefore, suffer from delayed initialization. In this paper, we present a 6-DoF monocular visual odometry that initializes instantly and without motion parallax. Our main contribution is a pose estimator that decouples estimating the 5-DoF relative rotation and translation direction from the 1-DoF translation magnitude. While scale is not observable in a monocular vision-only setting, it is still paramount to estimate a consistent scale over the whole trajectory (even if not physically accurate) to avoid AR effects moving erroneously along depth. In our approach, we leverage the fact that depth errors are not perceivable to the user during rotation-only motion. However, as the user starts translating the device, depth becomes perceivable and so does the capability to estimate consistent scale. Our proposed algorithm naturally transitions between these two modes. We perform extensive validations of our contributions with both a publicly available dataset and synthetic data. We show that the proposed pose estimator outperforms the classical approaches for 6-DoF pose estimation used in the literature in low-parallax configurations. We release a dataset for the relative pose problem using real data to facilitate the comparison with future solutions for the relative pose problem. Our solution is either used as a full odometry or as a preSLAM component of any supported SLAM system (ARKit, ARCore) in world-locked AR effects on platforms such as Instagram and Facebook.

Related papers

Full-DoF Egomotion Estimation for Event Cameras Using Geometric Solvers [24.889607741245246]
We propose several solvers to estimate both rotational and translational velocities within a unified framework. We demonstrate the possibility of recovering full-DoF egomotion parameters for both angular and linear velocities without requiring extra sensor measurements or motion priors.
arXiv Detail & Related papers (2025-03-05T09:39:51Z)
RSAR: Restricted State Angle Resolver and Rotated SAR Benchmark [61.987291551925516]
We introduce the Unit Cycle Resolver, which incorporates a unit circle constraint loss to improve angle prediction accuracy. Our approach can effectively improve the performance of existing state-of-the-art weakly supervised methods. With the aid of UCR, we further annotate and introduce RSAR, the largest multi-class rotated SAR object detection dataset to date.
arXiv Detail & Related papers (2025-01-08T11:41:47Z)
RoMeO: Robust Metric Visual Odometry [11.381243799745729]
Visual odometry (VO) aims to estimate camera poses from visual inputs -- a fundamental building block for many applications such as VR/AR and robotics. Existing approaches lack robustness under this challenging scenario and fail to generalize to unseen data (especially outdoors) We propose Robust Metric Visual Odometry (RoMeO), a novel method that resolves these issues leveraging priors from pre-trained depth models.
arXiv Detail & Related papers (2024-12-16T08:08:35Z)
Gravity-aligned Rotation Averaging with Circular Regression [53.81374943525774]
We introduce a principled approach that integrates gravity direction into the rotation averaging phase of global pipelines. We achieve state-of-the-art accuracy on four large-scale datasets.
arXiv Detail & Related papers (2024-10-16T17:37:43Z)
DVMNet++: Rethinking Relative Pose Estimation for Unseen Objects [59.51874686414509]
Existing approaches typically predict 3D translation utilizing the ground-truth object bounding box and approximate 3D rotation with a large number of discrete hypotheses. We present a Deep Voxel Matching Network (DVMNet++) that computes the relative object pose in a single pass. Our approach delivers more accurate relative pose estimates for novel objects at a lower computational cost compared to state-of-the-art methods.
arXiv Detail & Related papers (2024-03-20T15:41:32Z)
RD-VIO: Robust Visual-Inertial Odometry for Mobile Augmented Reality in Dynamic Environments [55.864869961717424]
It is typically challenging for visual or visual-inertial odometry systems to handle the problems of dynamic scenes and pure rotation. We design a novel visual-inertial odometry (VIO) system called RD-VIO to handle both of these problems.
arXiv Detail & Related papers (2023-10-23T16:30:39Z)
RGB-based Category-level Object Pose Estimation via Decoupled Metric Scale Recovery [72.13154206106259]
We propose a novel pipeline that decouples the 6D pose and size estimation to mitigate the influence of imperfect scales on rigid transformations. Specifically, we leverage a pre-trained monocular estimator to extract local geometric information. A separate branch is designed to directly recover the metric scale of the object based on category-level statistics.
arXiv Detail & Related papers (2023-09-19T02:20:26Z)
Vanishing Point Estimation in Uncalibrated Images with Prior Gravity Direction [82.72686460985297]
We tackle the problem of estimating a Manhattan frame. We derive two new 2-line solvers, one of which does not suffer from singularities affecting existing solvers. We also design a new non-minimal method, running on an arbitrary number of lines, to boost the performance in local optimization.
arXiv Detail & Related papers (2023-08-21T13:03:25Z)
EDI: ESKF-based Disjoint Initialization for Visual-Inertial SLAM Systems [9.937997167972743]
We propose a novel approach for fast, accurate, and robust visual-inertial initialization. Our method achieves an average scale error of 5.8% in less than 3 seconds.
arXiv Detail & Related papers (2023-08-04T19:06:58Z)
Learned Monocular Depth Priors in Visual-Inertial Initialization [4.99761983273316]
Visual-inertial odometry (VIO) is the pose estimation backbone for most AR/VR and autonomous robotic systems today. We propose to circumvent the limitations of classical visual-inertial structure-from-motion (SfM) We leverage learned monocular depth images (mono-depth) to constrain the relative depth of features, and upgrade the mono-depth to metric scale by jointly optimizing for its scale and shift.
arXiv Detail & Related papers (2022-04-20T00:30:04Z)
DM-VIO: Delayed Marginalization Visual-Inertial Odometry [62.746533939737446]
We present DM-VIO, a visual-inertial system based on delayed marginalization and pose graph bundle adjustment. We evaluate our system on the EuRoC, TUM-VI, and 4Seasons datasets, which comprise flying drone, large-scale handheld, and automotive scenarios.
arXiv Detail & Related papers (2022-01-11T18:30:37Z)
Accurate and Robust Scale Recovery for Monocular Visual Odometry Based on Plane Geometry [7.169216737580712]
We develop a lightweight scale recovery framework leveraging an accurate and robust estimation of the ground plane. Experiments on the KITTI dataset show that the proposed framework can achieve state-of-theart accuracy in terms of translation errors. Due to the light-weight design, our framework also demonstrates a high frequency of 20Hz on the dataset.
arXiv Detail & Related papers (2021-01-15T07:21:24Z)
Pushing the Envelope of Rotation Averaging for Visual SLAM [69.7375052440794]
We propose a novel optimization backbone for visual SLAM systems. We leverage averaging to improve the accuracy, efficiency and robustness of conventional monocular SLAM systems. Our approach can exhibit up to 10x faster with comparable accuracy against the state-art on public benchmarks.
arXiv Detail & Related papers (2020-11-02T18:02:26Z)
Monocular Rotational Odometry with Incremental Rotation Averaging and Loop Closure [35.467052373502575]
Estimating absolute camera orientations is essential for attitude estimation tasks. We devise a fast algorithm to accurately estimate camera orientations with 2D-2D feature matches alone. Underpinning our system is a new incremental rotation averaging method for fast and constant time iterative updating.
arXiv Detail & Related papers (2020-10-05T09:19:06Z)
Online Initialization and Extrinsic Spatial-Temporal Calibration for Monocular Visual-Inertial Odometry [19.955414423860788]
This paper presents an online method for bootstrapping the optimization-based monocular visual-inertial odometry (VIO) The method can online calibrate the relative transformation (spatial) and time offsets (temporal) among camera and IMU, as well as estimate the initial values of metric scale, velocity, gravity, gyroscope bias, and accelerometer bias. Experimental results on public datasets show that the initial values and the parameters, as well as the sensor poses, can be accurately estimated by the proposed method.
arXiv Detail & Related papers (2020-04-12T03:13:08Z)

This list is automatically generated from the titles and abstracts of the papers in this site.