Related papers: Online Adaptation of Monocular Depth Prediction with Visual SLAM

Online Adaptation of Monocular Depth Prediction with Visual SLAM

URL: http://arxiv.org/abs/2111.04096v1
Date: Sun, 7 Nov 2021 14:20:35 GMT
Title: Online Adaptation of Monocular Depth Prediction with Visual SLAM
Authors: Shing Yan Loo, Moein Shakeri, Sai Hong Tang, Syamsiah Mashohor, Hong Zhang
Abstract summary: The ability of accurate depth prediction by a CNN is a major challenge for its wide use in practical visual SLAM applications. We propose a novel online adaptation framework consisting of two complementary processes to fine-tune the depth prediction. Experimental results on both benchmark datasets and a real robot in our own experimental environments show that our proposed method improves the SLAM reconstruction accuracy.
Score: 8.478040209440868
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: The ability of accurate depth prediction by a CNN is a major challenge for its wide use in practical visual SLAM applications, such as enhanced camera tracking and dense mapping. This paper is set out to answer the following question: Can we tune a depth prediction CNN with the help of a visual SLAM algorithm even if the CNN is not trained for the current operating environment in order to benefit the SLAM performance? To this end, we propose a novel online adaptation framework consisting of two complementary processes: a SLAM algorithm that is used to generate keyframes to fine-tune the depth prediction and another algorithm that uses the online adapted depth to improve map quality. Once the potential noisy map points are removed, we perform global photometric bundle adjustment (BA) to improve the overall SLAM performance. Experimental results on both benchmark datasets and a real robot in our own experimental environments show that our proposed method improves the SLAM reconstruction accuracy. We demonstrate the use of regularization in the training loss as an effective means to prevent catastrophic forgetting. In addition, we compare our online adaptation framework against the state-of-the-art pre-trained depth prediction CNNs to show that our online adapted depth prediction CNN outperforms the depth prediction CNNs that have been trained on a large collection of datasets.

Related papers

UncLe-SLAM: Uncertainty Learning for Dense Neural SLAM [60.575435353047304]
We present an uncertainty learning framework for dense neural simultaneous localization and mapping (SLAM) We propose an online framework for sensor uncertainty estimation that can be trained in a self-supervised manner from only 2D input data.
arXiv Detail & Related papers (2023-06-19T16:26:25Z)
CNN-Augmented Visual-Inertial SLAM with Planar Constraints [26.024485121674328]
We present a robust visual-inertial SLAM system that combines the benefits of Convolutional Neural Networks (CNNs) and planar constraints. We use a CNN to predict the depth map and the corresponding uncertainty map for each image. We also present a fast plane detection method that detects horizontal planes via one-point RANSAC and vertical planes via two-point RANSAC.
arXiv Detail & Related papers (2022-05-05T21:49:57Z)
3DVNet: Multi-View Depth Prediction and Volumetric Refinement [68.68537312256144]
3DVNet is a novel multi-view stereo (MVS) depth-prediction method. Our key idea is the use of a 3D scene-modeling network that iteratively updates a set of coarse depth predictions. We show that our method exceeds state-of-the-art accuracy in both depth prediction and 3D reconstruction metrics.
arXiv Detail & Related papers (2021-12-01T00:52:42Z)
Application of 2-D Convolutional Neural Networks for Damage Detection in Steel Frame Structures [0.0]
We present an application of 2-D convolutional neural networks (2-D CNNs) designed to perform both feature extraction and classification stages. The method uses a network of lighted CNNs instead of deep and takes raw acceleration signals as input.
arXiv Detail & Related papers (2021-10-29T16:29:31Z)
A Front-End for Dense Monocular SLAM using a Learned Outlier Mask Prior [11.468537169201083]
Recent achievements in depth prediction from a single RGB image have powered the new research area of combining convolutional neural networks (CNNs) with classical simultaneous localization and mapping (SLAM) algorithms. Most of the current CNN-SLAM approaches have only taken advantage of the depth prediction but not yet other products from a CNN. We devise a dense CNN-assisted SLAM front-end that is implementable with sparse and evaluate it on both indoor and outdoor datasets.
arXiv Detail & Related papers (2021-04-01T15:43:28Z)
BreakingBED -- Breaking Binary and Efficient Deep Neural Networks by Adversarial Attacks [65.2021953284622]
We study robustness of CNNs against white-box and black-box adversarial attacks. Results are shown for distilled CNNs, agent-based state-of-the-art pruned models, and binarized neural networks.
arXiv Detail & Related papers (2021-03-14T20:43:19Z)
RIFLE: Backpropagation in Depth for Deep Transfer Learning through Re-Initializing the Fully-connected LayEr [60.07531696857743]
Fine-tuning the deep convolution neural network(CNN) using a pre-trained model helps transfer knowledge learned from larger datasets to the target task. We propose RIFLE - a strategy that deepens backpropagation in transfer learning settings. RIFLE brings meaningful updates to the weights of deep CNN layers and improves low-level feature learning.
arXiv Detail & Related papers (2020-07-07T11:27:43Z)
Cascaded Deep Video Deblurring Using Temporal Sharpness Prior [88.98348546566675]
The proposed algorithm mainly consists of optical flow estimation from intermediate latent frames and latent frame restoration steps. It first develops a deep CNN model to estimate optical flow from intermediate latent frames and then restores the latent frames based on the estimated optical flow. We show that exploring the domain knowledge of video deblurring is able to make the deep CNN model more compact and efficient.
arXiv Detail & Related papers (2020-04-06T09:13:49Z)
What Deep CNNs Benefit from Global Covariance Pooling: An Optimization Perspective [102.37204254403038]
We make an attempt to understand what deep CNNs benefit from GCP in a viewpoint of optimization. We show that GCP can make the optimization landscape more smooth and the gradients more predictive. We conduct extensive experiments using various deep CNN models on diversified tasks, and the results provide strong support to our findings.
arXiv Detail & Related papers (2020-03-25T07:00:45Z)

This list is automatically generated from the titles and abstracts of the papers in this site.

This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.