A Review of Visual Odometry Methods and Its Applications for Autonomous
Driving
- URL: http://arxiv.org/abs/2009.09193v1
- Date: Sat, 19 Sep 2020 09:13:27 GMT
- Title: A Review of Visual Odometry Methods and Its Applications for Autonomous
Driving
- Authors: Kai Li Lim and Thomas Br\"aunl
- Abstract summary: This paper presents a review of methods that are pertinent to visual odometry with an emphasis on autonomous driving.
Discussions are drawn to outline the problems faced in the current state of research, and to summarise the works reviewed.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The research into autonomous driving applications has observed an increase in
computer vision-based approaches in recent years. In attempts to develop
exclusive vision-based systems, visual odometry is often considered as a key
element to achieve motion estimation and self-localisation, in place of wheel
odometry or inertial measurements. This paper presents a recent review to
methods that are pertinent to visual odometry with an emphasis on autonomous
driving. This review covers visual odometry in their monocular, stereoscopic
and visual-inertial form, individually presenting them with analyses related to
their applications. Discussions are drawn to outline the problems faced in the
current state of research, and to summarise the works reviewed. This paper
concludes with future work suggestions to aid prospective developments in
visual odometry.
Related papers
- A Survey on All-in-One Image Restoration: Taxonomy, Evaluation and Future Trends [67.43992456058541]
Image restoration (IR) refers to the process of improving visual quality of images while removing degradation, such as noise, blur, weather effects, and so on.
Traditional IR methods typically target specific types of degradation, which limits their effectiveness in real-world scenarios with complex distortions.
The all-in-one image restoration (AiOIR) paradigm has emerged, offering a unified framework that adeptly addresses multiple degradation types.
arXiv Detail & Related papers (2024-10-19T11:11:09Z) - The Roles of Contextual Semantic Relevance Metrics in Human Visual Processing [27.152245569974678]
This study introduces the metrics of contextual semantic relevance.
We evaluate semantic relationships between target objects and their surroundings from both vision-based and language-based perspectives.
We employ state-of-the-art deep learning techniques to compute these metrics and analyze their impacts on fixation measures on human visual processing.
arXiv Detail & Related papers (2024-10-13T17:05:47Z) - CodedVO: Coded Visual Odometry [11.33375308762075]
We present CodedVO, a novel monocular visual odometry method that overcomes the scale ambiguity problem.
We demonstrate our method in diverse indoor environments and demonstrate its robustness and adaptability.
arXiv Detail & Related papers (2024-07-25T17:54:58Z) - Automatic Discovery of Visual Circuits [66.99553804855931]
We explore scalable methods for extracting the subgraph of a vision model's computational graph that underlies recognition of a specific visual concept.
We find that our approach extracts circuits that causally affect model output, and that editing these circuits can defend large pretrained models from adversarial attacks.
arXiv Detail & Related papers (2024-04-22T17:00:57Z) - Transformer-based model for monocular visual odometry: a video
understanding approach [0.9790236766474201]
We deal with the monocular visual odometry as a video understanding task to estimate the 6-F camera's pose.
We contribute by presenting the TS-DoVO model based on on-temporal self-attention mechanisms to extract features from clips and estimate the motions in an end-to-end manner.
Our approach achieved competitive state-of-the-art performance compared with geometry-based and deep learning-based methods on the KITTI visual odometry dataset.
arXiv Detail & Related papers (2023-05-10T13:11:23Z) - A Survey on Computer Vision based Human Analysis in the COVID-19 Era [58.79053747159797]
The emergence of COVID-19 has had a global and profound impact, not only on society as a whole, but also on the lives of individuals.
Various prevention measures were introduced around the world to limit the transmission of the disease, including face masks, mandates for social distancing and regular disinfection in public spaces, and the use of screening applications.
These developments triggered the need for novel and improved computer vision techniques capable of (i) providing support to the prevention measures through an automated analysis of visual data, on the one hand, and (ii) facilitating normal operation of existing vision-based services, such as biometric authentication
arXiv Detail & Related papers (2022-11-07T17:20:39Z) - Dense Prediction Transformer for Scale Estimation in Monocular Visual
Odometry [0.0]
This paper contributes by showing an application of the dense prediction transformer model for scale estimation in monocular visual odometry systems.
Experimental results show that the scale drift problem of monocular systems can be reduced through the accurate estimation of the depth map.
arXiv Detail & Related papers (2022-10-04T16:29:21Z) - Eye Gaze Estimation Model Analysis [2.4366811507669124]
We discuss various model types for eye gaze estimation and present the results from predicting gaze direction using eye landmarks in unconstrained settings.
In unconstrained real-world settings, feature-based and model-based methods are outperformed by recent appearance-based methods due to factors like illumination changes and other visual artifacts.
arXiv Detail & Related papers (2022-07-28T20:40:03Z) - Deep Learning to See: Towards New Foundations of Computer Vision [88.69805848302266]
This book criticizes the supposed scientific progress in the field of computer vision.
It proposes the investigation of vision within the framework of information-based laws of nature.
arXiv Detail & Related papers (2022-06-30T15:20:36Z) - Automatic Gaze Analysis: A Survey of DeepLearning based Approaches [61.32686939754183]
Eye gaze analysis is an important research problem in the field of computer vision and Human-Computer Interaction.
There are several open questions including what are the important cues to interpret gaze direction in an unconstrained environment.
We review the progress across a range of gaze analysis tasks and applications to shed light on these fundamental questions.
arXiv Detail & Related papers (2021-08-12T00:30:39Z) - Survey on Visual Sentiment Analysis [87.20223213370004]
This paper reviews pertinent publications and tries to present an exhaustive overview of the field of Visual Sentiment Analysis.
The paper also describes principles of design of general Visual Sentiment Analysis systems from three main points of view.
A formalization of the problem is discussed, considering different levels of granularity, as well as the components that can affect the sentiment toward an image in different ways.
arXiv Detail & Related papers (2020-04-24T10:15:22Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.