Unstructured Road Vanishing Point Detection Using the Convolutional
Neural Network and Heatmap Regression
- URL: http://arxiv.org/abs/2006.04691v1
- Date: Mon, 8 Jun 2020 15:44:37 GMT
- Title: Unstructured Road Vanishing Point Detection Using the Convolutional
Neural Network and Heatmap Regression
- Authors: Yin-Bo Liu, Ming Zeng, Qing-Hao Meng
- Abstract summary: We propose a novel solution combining the convolutional neural network (CNN) and heatmap regression to detect unstructured road VP.
The proposed algorithm firstly adopts a lightweight backbone, i.e., depthwise convolution modified HRNet, to extract hierarchical features of the unstructured road image.
Three advanced strategies, i.e., multi-scale supervised learning, heatmap super-resolution, and coordinate regression techniques are utilized to achieve fast and high-precision unstructured road VP detection.
- Score: 3.8170259685864165
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Unstructured road vanishing point (VP) detection is a challenging problem,
especially in the field of autonomous driving. In this paper, we proposed a
novel solution combining the convolutional neural network (CNN) and heatmap
regression to detect unstructured road VP. The proposed algorithm firstly
adopts a lightweight backbone, i.e., depthwise convolution modified HRNet, to
extract hierarchical features of the unstructured road image. Then, three
advanced strategies, i.e., multi-scale supervised learning, heatmap
super-resolution, and coordinate regression techniques are utilized to achieve
fast and high-precision unstructured road VP detection. The empirical results
on Kong's dataset show that our proposed approach enjoys the highest detection
accuracy compared with state-of-the-art methods under various conditions in
real-time, achieving the highest speed of 33 fps.
Related papers
- UHRNet: A Deep Learning-Based Method for Accurate 3D Reconstruction from
a Single Fringe-Pattern [3.5401671460123576]
We propose using a U shaped High resolution Network (UHRNet) to improve method's accuracy.
The network uses UNet encoding and decoding structure as backbone, with Multi-Level convolution Block and High resolution Fusion Block applied.
Our experimental results show that our proposed method can increase the accuracy of 3D reconstruction from a single fringe pattern.
arXiv Detail & Related papers (2023-04-23T08:39:05Z) - Spatial-Temporal Map Vehicle Trajectory Detection Using Dynamic Mode
Decomposition and Res-UNet+ Neural Networks [0.0]
This paper presents a machine-learning-enhanced longitudinal scanline method to extract vehicle trajectories from high-angle traffic cameras.
The Dynamic Mode Decomposition (DMD) method is applied to extract vehicle strands by decomposing the Spatial-Temporal Map (STMap) into the sparse foreground and low-rank background.
A deep neural network named Res-UNet+ was designed for the semantic segmentation task by adapting two prevalent deep learning architectures.
arXiv Detail & Related papers (2022-01-13T00:49:24Z) - Robust lEarned Shrinkage-Thresholding (REST): Robust unrolling for
sparse recover [87.28082715343896]
We consider deep neural networks for solving inverse problems that are robust to forward model mis-specifications.
We design a new robust deep neural network architecture by applying algorithm unfolding techniques to a robust version of the underlying recovery problem.
The proposed REST network is shown to outperform state-of-the-art model-based and data-driven algorithms in both compressive sensing and radar imaging problems.
arXiv Detail & Related papers (2021-10-20T06:15:45Z) - Adaptive Anomaly Detection for Internet of Things in Hierarchical Edge
Computing: A Contextual-Bandit Approach [81.5261621619557]
We propose an adaptive anomaly detection scheme with hierarchical edge computing (HEC)
We first construct multiple anomaly detection DNN models with increasing complexity, and associate each of them to a corresponding HEC layer.
Then, we design an adaptive model selection scheme that is formulated as a contextual-bandit problem and solved by using a reinforcement learning policy network.
arXiv Detail & Related papers (2021-08-09T08:45:47Z) - Trilevel Neural Architecture Search for Efficient Single Image
Super-Resolution [127.92235484598811]
This paper proposes a trilevel neural architecture search (NAS) method for efficient single image super-resolution (SR)
For modeling the discrete search space, we apply a new continuous relaxation on the discrete search spaces to build a hierarchical mixture of network-path, cell-operations, and kernel-width.
An efficient search algorithm is proposed to perform optimization in a hierarchical supernet manner.
arXiv Detail & Related papers (2021-01-17T12:19:49Z) - NOMA in UAV-aided cellular offloading: A machine learning approach [59.32570888309133]
A novel framework is proposed for cellular offloading with the aid of multiple unmanned aerial vehicles (UAVs)
Non-orthogonal multiple access (NOMA) technique is employed at each UAV to further improve the spectrum efficiency of the wireless network.
A mutual deep Q-network (MDQN) algorithm is proposed to jointly determine the optimal 3D trajectory and power allocation of UAVs.
arXiv Detail & Related papers (2020-10-18T17:38:48Z) - A Deep-Unfolded Reference-Based RPCA Network For Video
Foreground-Background Separation [86.35434065681925]
This paper proposes a new deep-unfolding-based network design for the problem of Robust Principal Component Analysis (RPCA)
Unlike existing designs, our approach focuses on modeling the temporal correlation between the sparse representations of consecutive video frames.
Experimentation using the moving MNIST dataset shows that the proposed network outperforms a recently proposed state-of-the-art RPCA network in the task of video foreground-background separation.
arXiv Detail & Related papers (2020-10-02T11:40:09Z) - Deep-3DAligner: Unsupervised 3D Point Set Registration Network With
Optimizable Latent Vector [15.900382629390297]
We propose to develop a novel model that integrates the optimization to learning, aiming to address the technical challenges in 3D registration.
In addition to the deep transformation decoding network, our framework introduce an optimizable deep underlineSpatial underlineCorrelation underlineRepresentation.
arXiv Detail & Related papers (2020-09-29T22:44:38Z) - Risk-Averse MPC via Visual-Inertial Input and Recurrent Networks for
Online Collision Avoidance [95.86944752753564]
We propose an online path planning architecture that extends the model predictive control (MPC) formulation to consider future location uncertainties.
Our algorithm combines an object detection pipeline with a recurrent neural network (RNN) which infers the covariance of state estimates.
The robustness of our methods is validated on complex quadruped robot dynamics and can be generally applied to most robotic platforms.
arXiv Detail & Related papers (2020-07-28T07:34:30Z) - Learning Robust Feature Representations for Scene Text Detection [0.0]
We present a network architecture derived from the loss to maximize conditional log-likelihood.
By extending the layer of latent variables to multiple layers, the network is able to learn robust features on scale.
In experiments, the proposed algorithm significantly outperforms state-of-the-art methods in terms of both recall and precision.
arXiv Detail & Related papers (2020-05-26T01:06:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.