A Follow-the-Leader Strategy using Hierarchical Deep Neural Networks
with Grouped Convolutions
- URL: http://arxiv.org/abs/2011.07948v4
- Date: Wed, 28 Apr 2021 18:43:50 GMT
- Title: A Follow-the-Leader Strategy using Hierarchical Deep Neural Networks
with Grouped Convolutions
- Authors: Jose Solomon and Francois Charette
- Abstract summary: The task of following-the-leader is implemented using a hierarchical Deep Neural Network (DNN) end-to-end driving model.
The models are trained on the Intelligence Processing Unit (IPU) to leverage its fine-grain compute capabilities.
A recording of the vehicle tracking a pedestrian has been produced and is available on the web.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The task of following-the-leader is implemented using a hierarchical Deep
Neural Network (DNN) end-to-end driving model to match the direction and speed
of a target pedestrian. The model uses a classifier DNN to determine if the
pedestrian is within the field of view of the camera sensor. If the pedestrian
is present, the image stream from the camera is fed to a regression DNN which
simultaneously adjusts the autonomous vehicle's steering and throttle to keep
cadence with the pedestrian. If the pedestrian is not visible, the vehicle uses
a straightforward exploratory search strategy to reacquire the tracking
objective. The classifier and regression DNNs incorporate grouped convolutions
to boost model performance as well as to significantly reduce parameter count
and compute latency. The models are trained on the Intelligence Processing Unit
(IPU) to leverage its fine-grain compute capabilities in order to minimize
time-to-train. The results indicate very robust tracking behavior on the part
of the autonomous vehicle in terms of its steering and throttle profiles, while
requiring minimal data collection to produce. The throughput in terms of
processing training samples has been boosted by the use of the IPU in
conjunction with grouped convolutions by a factor ~3.5 for training of the
classifier and a factor of ~7 for the regression network. A recording of the
vehicle tracking a pedestrian has been produced and is available on the web.
This is a preprint of an article published in SN Computer Science. The final
authenticated version is available online at:
https://doi.org/https://doi.org/10.1007/s42979-021-00572-1.
Related papers
- Exploring Dynamic Transformer for Efficient Object Tracking [58.120191254379854]
We propose DyTrack, a dynamic transformer framework for efficient tracking.
DyTrack automatically learns to configure proper reasoning routes for various inputs, gaining better utilization of the available computational budget.
Experiments on multiple benchmarks demonstrate that DyTrack achieves promising speed-precision trade-offs with only a single model.
arXiv Detail & Related papers (2024-03-26T12:31:58Z) - Autonomous Slalom Maneuver Based on Expert Drivers' Behavior Using
Convolutional Neural Network [0.39146761527401425]
This paper presents a method to mimic drivers' behavior using a convolutional neural network (CNN)
CNN model computes the desired steering wheel angle and sends it to an adaptive PD controller.
In some trials, the presented method performed a smoother maneuver compared to the expert drivers.
arXiv Detail & Related papers (2023-01-18T10:47:43Z) - Cross-Camera Trajectories Help Person Retrieval in a Camera Network [124.65912458467643]
Existing methods often rely on purely visual matching or consider temporal constraints but ignore the spatial information of the camera network.
We propose a pedestrian retrieval framework based on cross-camera generation, which integrates both temporal and spatial information.
To verify the effectiveness of our method, we construct the first cross-camera pedestrian trajectory dataset.
arXiv Detail & Related papers (2022-04-27T13:10:48Z) - Monocular Vision-based Prediction of Cut-in Maneuvers with LSTM Networks [0.0]
This study proposes a method to predict potentially dangerous cut-in maneuvers happening in the ego lane.
We follow a computer vision-based approach that only employs a single in-vehicle RGB camera.
Our algorithm consists of a CNN-based vehicle detection and tracking step and an LSTM-based maneuver classification step.
arXiv Detail & Related papers (2022-03-21T02:30:36Z) - Pedestrian Detection: Domain Generalization, CNNs, Transformers and
Beyond [82.37430109152383]
We show that, current pedestrian detectors poorly handle even small domain shifts in cross-dataset evaluation.
We attribute the limited generalization to two main factors, the method and the current sources of data.
We propose a progressive fine-tuning strategy which improves generalization.
arXiv Detail & Related papers (2022-01-10T06:00:26Z) - Automatic Extraction of Road Networks from Satellite Images by using
Adaptive Structural Deep Belief Network [0.0]
Our model is applied to an automatic recognition method of road network system, called RoadTracer.
RoadTracer can generate a road map on the ground surface from aerial photograph data.
In order to improve the accuracy and the calculation time, our Adaptive DBN was implemented on the RoadTracer instead of the CNN.
arXiv Detail & Related papers (2021-10-25T07:06:10Z) - Real Time Monocular Vehicle Velocity Estimation using Synthetic Data [78.85123603488664]
We look at the problem of estimating the velocity of road vehicles from a camera mounted on a moving car.
We propose a two-step approach where first an off-the-shelf tracker is used to extract vehicle bounding boxes and then a small neural network is used to regress the vehicle velocity.
arXiv Detail & Related papers (2021-09-16T13:10:27Z) - ANNETTE: Accurate Neural Network Execution Time Estimation with Stacked
Models [56.21470608621633]
We propose a time estimation framework to decouple the architectural search from the target hardware.
The proposed methodology extracts a set of models from micro- kernel and multi-layer benchmarks and generates a stacked model for mapping and network execution time estimation.
We compare estimation accuracy and fidelity of the generated mixed models, statistical models with the roofline model, and a refined roofline model for evaluation.
arXiv Detail & Related papers (2021-05-07T11:39:05Z) - A Driving Behavior Recognition Model with Bi-LSTM and Multi-Scale CNN [59.57221522897815]
We propose a neural network model based on trajectories information for driving behavior recognition.
We evaluate the proposed model on the public BLVD dataset, achieving a satisfying performance.
arXiv Detail & Related papers (2021-03-01T06:47:29Z) - Spatio-Temporal Look-Ahead Trajectory Prediction using Memory Neural
Network [6.065344547161387]
This paper attempts to solve the problem of Spatio-temporal look-ahead trajectory prediction using a novel recurrent neural network called the Memory Neuron Network.
The proposed model is computationally less intensive and has a simple architecture as compared to other deep learning models that utilize LSTMs and GRUs.
arXiv Detail & Related papers (2021-02-24T05:02:19Z) - An Improved Deep Convolutional Neural Network-Based Autonomous Road
Inspection Scheme Using Unmanned Aerial Vehicles [12.618653234201089]
This work is an improved convolutional neural network (CNN) model and its implementation for the detection of road cracks, potholes, and yellow lane in the road.
The purpose of yellow lane detection and tracking is to realize autonomous navigation of unmanned aerial vehicle (UAV) by following yellow lane while detecting and reporting the road cracks and potholes to the server through WIFI or 5G medium.
arXiv Detail & Related papers (2020-08-14T04:35:10Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.