Lifelong-MonoDepth: Lifelong Learning for Multi-Domain Monocular Metric
Depth Estimation
- URL: http://arxiv.org/abs/2303.05050v3
- Date: Fri, 13 Oct 2023 02:44:40 GMT
- Title: Lifelong-MonoDepth: Lifelong Learning for Multi-Domain Monocular Metric
Depth Estimation
- Authors: Junjie Hu, Chenyou Fan, Liguang Zhou, Qing Gao, Honghai Liu, Tin Lun
Lam
- Abstract summary: Lifelong learning approaches potentially offer significant cost savings in terms of model training, data storage, and collection.
The quality of RGB images and depth maps is sensor-dependent, and depth maps in the real world exhibit domain-specific characteristics, leading to variations in depth ranges.
These challenges limit existing methods to lifelong learning scenarios with small domain gaps and relative depth map estimation.
- Score: 24.74888757777775
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: With the rapid advancements in autonomous driving and robot navigation, there
is a growing demand for lifelong learning models capable of estimating metric
(absolute) depth. Lifelong learning approaches potentially offer significant
cost savings in terms of model training, data storage, and collection. However,
the quality of RGB images and depth maps is sensor-dependent, and depth maps in
the real world exhibit domain-specific characteristics, leading to variations
in depth ranges. These challenges limit existing methods to lifelong learning
scenarios with small domain gaps and relative depth map estimation. To
facilitate lifelong metric depth learning, we identify three crucial technical
challenges that require attention: i) developing a model capable of addressing
the depth scale variation through scale-aware depth learning, ii) devising an
effective learning strategy to handle significant domain gaps, and iii)
creating an automated solution for domain-aware depth inference in practical
applications. Based on the aforementioned considerations, in this paper, we
present i) a lightweight multi-head framework that effectively tackles the
depth scale imbalance, ii) an uncertainty-aware lifelong learning solution that
adeptly handles significant domain gaps, and iii) an online domain-specific
predictor selection method for real-time inference. Through extensive numerical
studies, we show that the proposed method can achieve good efficiency,
stability, and plasticity, leading the benchmarks by 8% to 15%.
Related papers
- Underwater Object Detection in the Era of Artificial Intelligence: Current, Challenge, and Future [119.88454942558485]
Underwater object detection (UOD) aims to identify and localise objects in underwater images or videos.
In recent years, artificial intelligence (AI) based methods, especially deep learning methods, have shown promising performance in UOD.
arXiv Detail & Related papers (2024-10-08T00:25:33Z) - Hyperspectral Image Analysis in Single-Modal and Multimodal setting
using Deep Learning Techniques [1.2328446298523066]
Hyperspectral imaging provides precise classification for land use and cover due to its exceptional spectral resolution.
However, the challenges of high dimensionality and limited spatial resolution hinder its effectiveness.
This study addresses these challenges by employing deep learning techniques to efficiently process, extract features, and classify data in an integrated manner.
arXiv Detail & Related papers (2024-03-03T15:47:43Z) - Depth-discriminative Metric Learning for Monocular 3D Object Detection [14.554132525651868]
We introduce a novel metric learning scheme that encourages the model to extract depth-discriminative features regardless of the visual attributes.
Our method consistently improves the performance of various baselines by 23.51% and 5.78% on average.
arXiv Detail & Related papers (2024-01-02T07:34:09Z) - ADU-Depth: Attention-based Distillation with Uncertainty Modeling for
Depth Estimation [11.92011909884167]
We introduce spatial cues by training a teacher network that leverages left-right image pairs as inputs.
We apply both attention-adapted feature distillation and focal-depth-adapted response distillation in the training stage.
Our experiments on the real depth estimation datasets KITTI and DrivingStereo demonstrate the effectiveness of the proposed method.
arXiv Detail & Related papers (2023-09-26T08:12:37Z) - A Threefold Review on Deep Semantic Segmentation: Efficiency-oriented,
Temporal and Depth-aware design [77.34726150561087]
We conduct a survey on the most relevant and recent advances in Deep Semantic in the context of vision for autonomous vehicles.
Our main objective is to provide a comprehensive discussion on the main methods, advantages, limitations, results and challenges faced from each perspective.
arXiv Detail & Related papers (2023-03-08T01:29:55Z) - Unsupervised Domain Adaptation for Monocular 3D Object Detection via
Self-Training [57.25828870799331]
We propose STMono3D, a new self-teaching framework for unsupervised domain adaptation on Mono3D.
We develop a teacher-student paradigm to generate adaptive pseudo labels on the target domain.
STMono3D achieves remarkable performance on all evaluated datasets and even surpasses fully supervised results on the KITTI 3D object detection dataset.
arXiv Detail & Related papers (2022-04-25T12:23:07Z) - SelfTune: Metrically Scaled Monocular Depth Estimation through
Self-Supervised Learning [53.78813049373321]
We propose a self-supervised learning method for the pre-trained supervised monocular depth networks to enable metrically scaled depth estimation.
Our approach is useful for various applications such as mobile robot navigation and is applicable to diverse environments.
arXiv Detail & Related papers (2022-03-10T12:28:42Z) - Geometry Uncertainty Projection Network for Monocular 3D Object
Detection [138.24798140338095]
We propose a Geometry Uncertainty Projection Network (GUP Net) to tackle the error amplification problem at both inference and training stages.
Specifically, a GUP module is proposed to obtains the geometry-guided uncertainty of the inferred depth.
At the training stage, we propose a Hierarchical Task Learning strategy to reduce the instability caused by error amplification.
arXiv Detail & Related papers (2021-07-29T06:59:07Z) - Approaches, Challenges, and Applications for Deep Visual Odometry:
Toward to Complicated and Emerging Areas [6.1102842961275226]
Visual odometry (VO) is a prevalent way to deal with the relative localization problem.
Deep learning-based methods can automatically learn effective and robust representations.
This paper aims to gain a deep insight on how deep learning can profit and optimize the VO systems.
arXiv Detail & Related papers (2020-09-06T08:25:23Z) - Meta-Gradient Reinforcement Learning with an Objective Discovered Online [54.15180335046361]
We propose an algorithm based on meta-gradient descent that discovers its own objective, flexibly parameterised by a deep neural network.
Because the objective is discovered online, it can adapt to changes over time.
On the Atari Learning Environment, the meta-gradient algorithm adapts over time to learn with greater efficiency.
arXiv Detail & Related papers (2020-07-16T16:17:09Z) - SPCNet:Spatial Preserve and Content-aware Network for Human Pose
Estimation [3.2540745519652434]
We propose a novel Spatial Preserve and Content-aware Network(SPCNet), which includes two effective modules: Dilated Hourglass Module(DHM) and Selective Information Module(SIM)
In particular, we exceed previous methods and achieve the state-of-the-art performance on three benchmark datasets.
arXiv Detail & Related papers (2020-04-13T09:14:00Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.