CMRNet++: Map and Camera Agnostic Monocular Visual Localization in LiDAR
Maps
- URL: http://arxiv.org/abs/2004.13795v2
- Date: Fri, 22 May 2020 09:00:25 GMT
- Title: CMRNet++: Map and Camera Agnostic Monocular Visual Localization in LiDAR
Maps
- Authors: Daniele Cattaneo, Domenico Giorgio Sorrenti, Abhinav Valada
- Abstract summary: CMRNet++ is a more robust model that generalizes to new places effectively and is also independent of the camera parameters.
We demonstrate the ability of a deep learning approach to accurately localize without any retraining or fine-tuning in a completely new environment.
- Score: 10.578312278413199
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Localization is a critically essential and crucial enabler of autonomous
robots. While deep learning has made significant strides in many computer
vision tasks, it is still yet to make a sizeable impact on improving
capabilities of metric visual localization. One of the major hindrances has
been the inability of existing Convolutional Neural Network (CNN)-based pose
regression methods to generalize to previously unseen places. Our recently
introduced CMRNet effectively addresses this limitation by enabling map
independent monocular localization in LiDAR-maps. In this paper, we now take it
a step further by introducing CMRNet++, which is a significantly more robust
model that not only generalizes to new places effectively, but is also
independent of the camera parameters. We enable this capability by combining
deep learning with geometric techniques, and by moving the metric reasoning
outside the learning process. In this way, the weights of the network are not
tied to a specific camera. Extensive evaluations of CMRNet++ on three
challenging autonomous driving datasets, i.e., KITTI, Argoverse, and Lyft5,
show that CMRNet++ outperforms CMRNet as well as other baselines by a large
margin. More importantly, for the first-time, we demonstrate the ability of a
deep learning approach to accurately localize without any retraining or
fine-tuning in a completely new environment and independent of the camera
parameters.
Related papers
- SODAWideNet -- Salient Object Detection with an Attention augmented Wide
Encoder Decoder network without ImageNet pre-training [3.66237529322911]
We explore developing a neural network from scratch directly trained on Salient Object Detection without ImageNet pre-training.
We propose SODAWideNet, an encoder-decoder-style network for Salient Object Detection.
Two variants, SODAWideNet-S (3.03M) and SODAWideNet (9.03M), achieve competitive performance against state-of-the-art models on five datasets.
arXiv Detail & Related papers (2023-11-08T16:53:44Z) - UnLoc: A Universal Localization Method for Autonomous Vehicles using
LiDAR, Radar and/or Camera Input [51.150605800173366]
UnLoc is a novel unified neural modeling approach for localization with multi-sensor input in all weather conditions.
Our method is extensively evaluated on Oxford Radar RobotCar, ApolloSouthBay and Perth-WA datasets.
arXiv Detail & Related papers (2023-07-03T04:10:55Z) - CondenseNeXt: An Ultra-Efficient Deep Neural Network for Embedded
Systems [0.0]
A Convolutional Neural Network (CNN) is a class of Deep Neural Network (DNN) widely used in the analysis of visual images captured by an image sensor.
In this paper, we propose a neoteric variant of deep convolutional neural network architecture to ameliorate the performance of existing CNN architectures for real-time inference on embedded systems.
arXiv Detail & Related papers (2021-12-01T18:20:52Z) - StereoSpike: Depth Learning with a Spiking Neural Network [0.0]
We present an end-to-end neuromorphic approach to depth estimation.
We use a Spiking Neural Network (SNN) with a slightly modified U-Net-like encoder-decoder architecture, that we named StereoSpike.
We demonstrate that this architecture generalizes very well, even better than its non-spiking counterparts.
arXiv Detail & Related papers (2021-09-28T14:11:36Z) - Markov Localisation using Heatmap Regression and Deep Convolutional
Odometry [59.33322623437816]
We present a novel CNN-based localisation approach that can leverage modern deep learning hardware.
We create a hybrid CNN that can perform image-based localisation and odometry-based likelihood propagation within a single neural network.
arXiv Detail & Related papers (2021-06-01T10:28:49Z) - PCLs: Geometry-aware Neural Reconstruction of 3D Pose with Perspective
Crop Layers [111.55817466296402]
We introduce Perspective Crop Layers (PCLs) - a form of perspective crop of the region of interest based on the camera geometry.
PCLs deterministically remove the location-dependent perspective effects while leaving end-to-end training and the number of parameters of the underlying neural network.
PCL offers an easy way to improve the accuracy of existing 3D reconstruction networks by making them geometry aware.
arXiv Detail & Related papers (2020-11-27T08:48:43Z) - Unsupervised Metric Relocalization Using Transform Consistency Loss [66.19479868638925]
Training networks to perform metric relocalization traditionally requires accurate image correspondences.
We propose a self-supervised solution, which exploits a key insight: localizing a query image within a map should yield the same absolute pose, regardless of the reference image used for registration.
We evaluate our framework on synthetic and real-world data, showing our approach outperforms other supervised methods when a limited amount of ground-truth information is available.
arXiv Detail & Related papers (2020-11-01T19:24:27Z) - SynDistNet: Self-Supervised Monocular Fisheye Camera Distance Estimation
Synergized with Semantic Segmentation for Autonomous Driving [37.50089104051591]
State-of-the-art self-supervised learning approaches for monocular depth estimation usually suffer from scale ambiguity.
This paper introduces a novel multi-task learning strategy to improve self-supervised monocular distance estimation on fisheye and pinhole camera images.
arXiv Detail & Related papers (2020-08-10T10:52:47Z) - When Residual Learning Meets Dense Aggregation: Rethinking the
Aggregation of Deep Neural Networks [57.0502745301132]
We propose Micro-Dense Nets, a novel architecture with global residual learning and local micro-dense aggregations.
Our micro-dense block can be integrated with neural architecture search based models to boost their performance.
arXiv Detail & Related papers (2020-04-19T08:34:52Z) - Deep Adaptive Inference Networks for Single Image Super-Resolution [72.7304455761067]
Single image super-resolution (SISR) has witnessed tremendous progress in recent years owing to the deployment of deep convolutional neural networks (CNNs)
In this paper, we take a step forward to address this issue by leveraging the adaptive inference networks for deep SISR (AdaDSR)
Our AdaDSR involves an SISR model as backbone and a lightweight adapter module which takes image features and resource constraint as input and predicts a map of local network depth.
arXiv Detail & Related papers (2020-04-08T10:08:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.