End-to-End Deep Learning of Lane Detection and Path Prediction for
Real-Time Autonomous Driving
- URL: http://arxiv.org/abs/2102.04738v1
- Date: Tue, 9 Feb 2021 10:04:39 GMT
- Title: End-to-End Deep Learning of Lane Detection and Path Prediction for
Real-Time Autonomous Driving
- Authors: Der-Hau Lee and Jinn-Liang Liu
- Abstract summary: We propose an end-to-end three-task convolutional neural network (3TCNN) for lane detection and road recognition.
Based on 3TCNN, we then propose lateral offset and path prediction (PP) algorithms to form an integrated model (3TCNN-PP)
We also develop a CNN-PP simulator that can be used to train a CNN by real or artificial traffic images, test it by artificial images, quantify its dynamic errors, and visualize its qualitative performance.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We propose an end-to-end three-task convolutional neural network (3TCNN)
having two regression branches of bounding boxes and Hu moments and one
classification branch of object masks for lane detection and road recognition.
The Hu-moment regressor performs lane localization and road guidance using
local and global Hu moments of segmented lane objects, respectively. Based on
3TCNN, we then propose lateral offset and path prediction (PP) algorithms to
form an integrated model (3TCNN-PP) that can predict driving path with dynamic
estimation of lane centerline and path curvature for real-time autonomous
driving. We also develop a CNN-PP simulator that can be used to train a CNN by
real or artificial traffic images, test it by artificial images, quantify its
dynamic errors, and visualize its qualitative performance. Simulation results
show that 3TCNN-PP is comparable to related CNNs and better than a previous
CNN-PP, respectively. The code, annotated data, and simulation videos of this
work can be found on our website for further research on NN-PP algorithms of
autonomous driving.
Related papers
- 3DPyranet Features Fusion for Spatio-temporal Feature Learning [2.327279581393927]
3D pyramidal neural pyramid called 3DPyraNet and a discriminative approach for classifier-temporal feature learning called 3DPyraNet-F are proposed.
3DPyraNet-F extract the features maps of the highest layer of the learned network, fuse them in a single vector, and provide it as input in a way to a linear-SVM.
Results are reported with 3DPyraNet in real-world environments, especially in the presence of camera induced motion.
arXiv Detail & Related papers (2025-04-26T17:32:37Z) - Imitation Learning for Autonomous Driving: Insights from Real-World Testing [2.526146573337397]
This work focuses on the design of a deep learning-based autonomous driving system deployed and tested on the real-world MIT Racecar.
The Deep Neural Network (DNN) translates raw image inputs into real-time steering commands in an end-to-end learning fashion.
arXiv Detail & Related papers (2025-04-26T08:21:12Z) - Graph-Based Spatial-Temporal Convolutional Network for Vehicle
Trajectory Prediction in Autonomous Driving [2.6774008509841005]
This paper proposes a graph-based spatial-temporal convolutional network ( GSTCN) to predict future trajectory distributions of all neighbor vehicles.
The spatial-temporal features are encoded and decoded by a gated recurrent unit (GRU) network to generate future trajectory distributions.
Our network is evaluated on two real-world freeway trajectory datasets: I-80 and US-101 in the Next Generation Simulation (NGSIM)
arXiv Detail & Related papers (2021-09-27T02:20:38Z) - Graph Attention Layer Evolves Semantic Segmentation for Road Pothole
Detection: A Benchmark and Algorithms [34.80667966432126]
Existing road pothole detection approaches can be classified as computer vision-based or machine learning-based.
The latter approaches generally address road pothole detection using convolutional neural networks (CNNs) in an end-to-end manner.
We propose a novel CNN layer, referred to as graph attention layer (GAL), which can be easily deployed in any existing CNN to optimize image feature representations for semantic segmentation.
arXiv Detail & Related papers (2021-09-06T19:44:50Z) - A Driving Behavior Recognition Model with Bi-LSTM and Multi-Scale CNN [59.57221522897815]
We propose a neural network model based on trajectories information for driving behavior recognition.
We evaluate the proposed model on the public BLVD dataset, achieving a satisfying performance.
arXiv Detail & Related papers (2021-03-01T06:47:29Z) - Driving Style Representation in Convolutional Recurrent Neural Network
Model of Driver Identification [8.007800530105191]
We present a deep-neural-network architecture, we term D-CRNN, for building high-fidelity representations for driving style.
Using CNN, we capture semantic patterns of driver behavior from trajectories.
We then find temporal dependencies between these semantic patterns using RNN to encode driving style.
arXiv Detail & Related papers (2021-02-11T04:33:43Z) - Incorporating Kinematic Wave Theory into a Deep Learning Method for
High-Resolution Traffic Speed Estimation [3.0969191504482243]
We propose a kinematic wave based Deep Convolutional Neural Network (Deep CNN) to estimate high resolution traffic speed dynamics from sparse probe vehicle trajectories.
We introduce two key approaches that allow us to incorporate kinematic wave theory principles to improve the robustness of existing learning-based estimation methods.
arXiv Detail & Related papers (2021-02-04T21:51:25Z) - Fine-Grained Vehicle Perception via 3D Part-Guided Visual Data
Augmentation [77.60050239225086]
We propose an effective training data generation process by fitting a 3D car model with dynamic parts to vehicles in real images.
Our approach is fully automatic without any human interaction.
We present a multi-task network for VUS parsing and a multi-stream network for VHI parsing.
arXiv Detail & Related papers (2020-12-15T03:03:38Z) - Learning Hybrid Representations for Automatic 3D Vessel Centerline
Extraction [57.74609918453932]
Automatic blood vessel extraction from 3D medical images is crucial for vascular disease diagnoses.
Existing methods may suffer from discontinuities of extracted vessels when segmenting such thin tubular structures from 3D images.
We argue that preserving the continuity of extracted vessels requires to take into account the global geometry.
We propose a hybrid representation learning approach to address this challenge.
arXiv Detail & Related papers (2020-12-14T05:22:49Z) - A Lane-Changing Prediction Method Based on Temporal Convolution Network [36.84793673877468]
Lane-changing is an important driving behavior and unreasonable lane changes can result in potentially dangerous traffic collisions.
This study proposes a temporal convolutional network (TCN) to predict the long-term lane-changing trajectory and behavior.
arXiv Detail & Related papers (2020-11-01T07:33:10Z) - LGNN: A Context-aware Line Segment Detector [53.424521592941936]
We present a novel real-time line segment detection scheme called Line Graph Neural Network (LGNN)
Our LGNN employs a deep convolutional neural network (DCNN) for proposing line segment directly, with a graph neural network (GNN) module for reasoning their connectivities.
Compared with the state-of-the-art, LGNN achieves near real-time performance without compromising accuracy.
arXiv Detail & Related papers (2020-08-13T13:23:18Z) - Risk-Averse MPC via Visual-Inertial Input and Recurrent Networks for
Online Collision Avoidance [95.86944752753564]
We propose an online path planning architecture that extends the model predictive control (MPC) formulation to consider future location uncertainties.
Our algorithm combines an object detection pipeline with a recurrent neural network (RNN) which infers the covariance of state estimates.
The robustness of our methods is validated on complex quadruped robot dynamics and can be generally applied to most robotic platforms.
arXiv Detail & Related papers (2020-07-28T07:34:30Z) - PerMO: Perceiving More at Once from a Single Image for Autonomous
Driving [76.35684439949094]
We present a novel approach to detect, segment, and reconstruct complete textured 3D models of vehicles from a single image.
Our approach combines the strengths of deep learning and the elegance of traditional techniques.
We have integrated these algorithms with an autonomous driving system.
arXiv Detail & Related papers (2020-07-16T05:02:45Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.