A Front-End for Dense Monocular SLAM using a Learned Outlier Mask Prior
- URL: http://arxiv.org/abs/2104.00562v1
- Date: Thu, 1 Apr 2021 15:43:28 GMT
- Title: A Front-End for Dense Monocular SLAM using a Learned Outlier Mask Prior
- Authors: Yihao Zhang and John J. Leonard
- Abstract summary: Recent achievements in depth prediction from a single RGB image have powered the new research area of combining convolutional neural networks (CNNs) with classical simultaneous localization and mapping (SLAM) algorithms.
Most of the current CNN-SLAM approaches have only taken advantage of the depth prediction but not yet other products from a CNN.
We devise a dense CNN-assisted SLAM front-end that is implementable with sparse and evaluate it on both indoor and outdoor datasets.
- Score: 11.468537169201083
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Recent achievements in depth prediction from a single RGB image have powered
the new research area of combining convolutional neural networks (CNNs) with
classical simultaneous localization and mapping (SLAM) algorithms. The depth
prediction from a CNN provides a reasonable initial point in the optimization
process in the traditional SLAM algorithms, while the SLAM algorithms further
improve the CNN prediction online. However, most of the current CNN-SLAM
approaches have only taken advantage of the depth prediction but not yet other
products from a CNN. In this work, we explore the use of the outlier mask, a
by-product from unsupervised learning of depth from video, as a prior in a
classical probability model for depth estimate fusion to step up the
outlier-resistant tracking performance of a SLAM front-end. On the other hand,
some of the previous CNN-SLAM work builds on feature-based sparse SLAM methods,
wasting the per-pixel dense prediction from a CNN. In contrast to these sparse
methods, we devise a dense CNN-assisted SLAM front-end that is implementable
with TensorFlow and evaluate it on both indoor and outdoor datasets.
Related papers
- Training Convolutional Neural Networks with the Forward-Forward
algorithm [1.74440662023704]
Forward Forward (FF) algorithm has up to now only been used in fully connected networks.
We show how the FF paradigm can be extended to CNNs.
Our FF-trained CNN, featuring a novel spatially-extended labeling technique, achieves a classification accuracy of 99.16% on the MNIST hand-written digits dataset.
arXiv Detail & Related papers (2023-12-22T18:56:35Z) - Point-SLAM: Dense Neural Point Cloud-based SLAM [61.96492935210654]
We propose a dense neural simultaneous localization and mapping (SLAM) approach for monocular RGBD input.
We demonstrate that both tracking and mapping can be performed with the same point-based neural scene representation.
arXiv Detail & Related papers (2023-04-09T16:48:26Z) - Scalable computation of prediction intervals for neural networks via
matrix sketching [79.44177623781043]
Existing algorithms for uncertainty estimation require modifying the model architecture and training procedure.
This work proposes a new algorithm that can be applied to a given trained neural network and produces approximate prediction intervals.
arXiv Detail & Related papers (2022-05-06T13:18:31Z) - 3DVNet: Multi-View Depth Prediction and Volumetric Refinement [68.68537312256144]
3DVNet is a novel multi-view stereo (MVS) depth-prediction method.
Our key idea is the use of a 3D scene-modeling network that iteratively updates a set of coarse depth predictions.
We show that our method exceeds state-of-the-art accuracy in both depth prediction and 3D reconstruction metrics.
arXiv Detail & Related papers (2021-12-01T00:52:42Z) - Online Adaptation of Monocular Depth Prediction with Visual SLAM [8.478040209440868]
The ability of accurate depth prediction by a CNN is a major challenge for its wide use in practical visual SLAM applications.
We propose a novel online adaptation framework consisting of two complementary processes to fine-tune the depth prediction.
Experimental results on both benchmark datasets and a real robot in our own experimental environments show that our proposed method improves the SLAM reconstruction accuracy.
arXiv Detail & Related papers (2021-11-07T14:20:35Z) - New SAR target recognition based on YOLO and very deep multi-canonical
correlation analysis [0.1503974529275767]
This paper proposes a robust feature extraction method for SAR image target classification by adaptively fusing effective features from different CNN layers.
Experiments on the MSTAR dataset demonstrate that the proposed method outperforms the state-of-the-art methods.
arXiv Detail & Related papers (2021-10-28T18:10:26Z) - Chroma Intra Prediction with attention-based CNN architectures [15.50693711359313]
This paper proposes a new neural network architecture for cross-component intra-prediction.
The network uses a novel attention module to model spatial relations between reference and predicted samples.
arXiv Detail & Related papers (2020-06-27T12:11:17Z) - Cascaded Deep Video Deblurring Using Temporal Sharpness Prior [88.98348546566675]
The proposed algorithm mainly consists of optical flow estimation from intermediate latent frames and latent frame restoration steps.
It first develops a deep CNN model to estimate optical flow from intermediate latent frames and then restores the latent frames based on the estimated optical flow.
We show that exploring the domain knowledge of video deblurring is able to make the deep CNN model more compact and efficient.
arXiv Detail & Related papers (2020-04-06T09:13:49Z) - MSE-Optimal Neural Network Initialization via Layer Fusion [68.72356718879428]
Deep neural networks achieve state-of-the-art performance for a range of classification and inference tasks.
The use of gradient combined nonvolutionity renders learning susceptible to novel problems.
We propose fusing neighboring layers of deeper networks that are trained with random variables.
arXiv Detail & Related papers (2020-01-28T18:25:15Z) - Approximation and Non-parametric Estimation of ResNet-type Convolutional
Neural Networks [52.972605601174955]
We show a ResNet-type CNN can attain the minimax optimal error rates in important function classes.
We derive approximation and estimation error rates of the aformentioned type of CNNs for the Barron and H"older classes.
arXiv Detail & Related papers (2019-03-24T19:42:39Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.