Accurate and Robust Scale Recovery for Monocular Visual Odometry Based
on Plane Geometry
- URL: http://arxiv.org/abs/2101.05995v1
- Date: Fri, 15 Jan 2021 07:21:24 GMT
- Title: Accurate and Robust Scale Recovery for Monocular Visual Odometry Based
on Plane Geometry
- Authors: Rui Tian, Yunzhou Zhang, Delong Zhu, Shiwen Liang, Sonya Coleman,
Dermot Kerr
- Abstract summary: We develop a lightweight scale recovery framework leveraging an accurate and robust estimation of the ground plane.
Experiments on the KITTI dataset show that the proposed framework can achieve state-of-theart accuracy in terms of translation errors.
Due to the light-weight design, our framework also demonstrates a high frequency of 20Hz on the dataset.
- Score: 7.169216737580712
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Scale ambiguity is a fundamental problem in monocular visual odometry.
Typical solutions include loop closure detection and environment information
mining. For applications like self-driving cars, loop closure is not always
available, hence mining prior knowledge from the environment becomes a more
promising approach. In this paper, with the assumption of a constant height of
the camera above the ground, we develop a light-weight scale recovery framework
leveraging an accurate and robust estimation of the ground plane. The framework
includes a ground point extraction algorithm for selecting high-quality points
on the ground plane, and a ground point aggregation algorithm for joining the
extracted ground points in a local sliding window. Based on the aggregated
data, the scale is finally recovered by solving a least-squares problem using a
RANSAC-based optimizer. Sufficient data and robust optimizer enable a highly
accurate scale recovery. Experiments on the KITTI dataset show that the
proposed framework can achieve state-of-the-art accuracy in terms of
translation errors, while maintaining competitive performance on the rotation
error. Due to the light-weight design, our framework also demonstrates a high
frequency of 20Hz on the dataset.
Related papers
- Mode-GS: Monocular Depth Guided Anchored 3D Gaussian Splatting for Robust Ground-View Scene Rendering [47.879695094904015]
We present a novelview rendering algorithm, Mode-GS, for ground-robot trajectory datasets.
Our approach is based on using anchored Gaussian splats, which are designed to overcome the limitations of existing 3D Gaussian splatting algorithms.
Our method results in improved rendering performance, based on PSNR, SSIM, and LPIPS metrics, in ground scenes with free trajectory patterns.
arXiv Detail & Related papers (2024-10-06T23:01:57Z) - OPUS: Occupancy Prediction Using a Sparse Set [64.60854562502523]
We present a framework to simultaneously predict occupied locations and classes using a set of learnable queries.
OPUS incorporates a suite of non-trivial strategies to enhance model performance.
Our lightest model achieves superior RayIoU on the Occ3D-nuScenes dataset at near 2x FPS, while our heaviest model surpasses previous best results by 6.1 RayIoU.
arXiv Detail & Related papers (2024-09-14T07:44:22Z) - Robust Second-order LiDAR Bundle Adjustment Algorithm Using Mean Squared Group Metric [5.153195958837083]
We propose a novel mean square group metric (MSGM) to build the optimization objective in the LiDAR BA algorithm.
By integrating a robust kernel function, the metrics involved in the BA algorithm are reweighted, and thus enhancing the robustness of the solution process.
arXiv Detail & Related papers (2024-09-03T12:53:39Z) - Arbitrary-Scale Point Cloud Upsampling by Voxel-Based Network with
Latent Geometric-Consistent Learning [52.825441454264585]
We propose an arbitrary-scale Point cloud Upsampling framework using Voxel-based Network (textbfPU-VoxelNet)
Thanks to the completeness and regularity inherited from the voxel representation, voxel-based networks are capable of providing predefined grid space to approximate 3D surface.
A density-guided grid resampling method is developed to generate high-fidelity points while effectively avoiding sampling outliers.
arXiv Detail & Related papers (2024-03-08T07:31:14Z) - Vanishing Point Estimation in Uncalibrated Images with Prior Gravity
Direction [82.72686460985297]
We tackle the problem of estimating a Manhattan frame.
We derive two new 2-line solvers, one of which does not suffer from singularities affecting existing solvers.
We also design a new non-minimal method, running on an arbitrary number of lines, to boost the performance in local optimization.
arXiv Detail & Related papers (2023-08-21T13:03:25Z) - Real-Time Simultaneous Localization and Mapping with LiDAR intensity [9.374695605941627]
We propose a novel real-time LiDAR intensity image-based simultaneous localization and mapping method.
Our method can run in real time with high accuracy and works well with illumination changes, low-texture, and unstructured environments.
arXiv Detail & Related papers (2023-01-23T03:59:48Z) - Visual SLAM with Graph-Cut Optimized Multi-Plane Reconstruction [11.215334675788952]
This paper presents a semantic planar SLAM system that improves pose estimation and mapping using cues from an instance planar segmentation network.
While the mainstream approaches are using RGB-D sensors, employing a monocular camera with such a system still faces challenges such as robust data association and precise geometric model fitting.
arXiv Detail & Related papers (2021-08-09T18:16:08Z) - GPO: Global Plane Optimization for Fast and Accurate Monocular SLAM
Initialization [22.847353792031488]
The algorithm starts by homography estimation in a sliding window.
The proposed method fully exploits the plane information from multiple frames and avoids the ambiguities in homography decomposition.
Experimental results show that our method outperforms the fine-tuned baselines in both accuracy and real-time.
arXiv Detail & Related papers (2020-04-25T03:57:50Z) - Refined Plane Segmentation for Cuboid-Shaped Objects by Leveraging Edge
Detection [63.942632088208505]
We propose a post-processing algorithm to align the segmented plane masks with edges detected in the image.
This allows us to increase the accuracy of state-of-the-art approaches, while limiting ourselves to cuboid-shaped objects.
arXiv Detail & Related papers (2020-03-28T18:51:43Z) - On the Arbitrary-Oriented Object Detection: Classification based
Approaches Revisited [94.5455251250471]
We first show that the boundary problem suffered in existing dominant regression-based rotation detectors, is caused by angular periodicity or corner ordering.
We transform the angular prediction task from a regression problem to a classification one.
For the resulting circularly distributed angle classification problem, we first devise a Circular Smooth Label technique to handle the periodicity of angle and increase the error tolerance to adjacent angles.
arXiv Detail & Related papers (2020-03-12T03:23:54Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.