Learning Pedestrian Actions to Ensure Safe Autonomous Driving
- URL: http://arxiv.org/abs/2305.13051v1
- Date: Mon, 22 May 2023 14:03:38 GMT
- Title: Learning Pedestrian Actions to Ensure Safe Autonomous Driving
- Authors: Jia Huang, Alvika Gautam, Srikanth Saripalli
- Abstract summary: It is critical for Autonomous Vehicles to have the ability to predict pedestrians' short-term and immediate actions in real-time.
In this work, a novel multi-task sequence to sequence Transformer encoders-decoders (TF-ed) architecture is proposed for pedestrian action and trajectory prediction.
The proposed approach is compared against an existing LSTM encoders decoders (LSTM-ed) architecture for action and trajectory prediction.
- Score: 12.440017892152417
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: To ensure safe autonomous driving in urban environments with complex
vehicle-pedestrian interactions, it is critical for Autonomous Vehicles (AVs)
to have the ability to predict pedestrians' short-term and immediate actions in
real-time. In recent years, various methods have been developed to study
estimating pedestrian behaviors for autonomous driving scenarios, but there is
a lack of clear definitions for pedestrian behaviors. In this work, the
literature gaps are investigated and a taxonomy is presented for pedestrian
behavior characterization. Further, a novel multi-task sequence to sequence
Transformer encoders-decoders (TF-ed) architecture is proposed for pedestrian
action and trajectory prediction using only ego vehicle camera observations as
inputs. The proposed approach is compared against an existing LSTM encoders
decoders (LSTM-ed) architecture for action and trajectory prediction. The
performance of both models is evaluated on the publicly available Joint
Attention Autonomous Driving (JAAD) dataset, CARLA simulation data as well as
real-time self-driving shuttle data collected on university campus. Evaluation
results illustrate that the proposed method reaches an accuracy of 81% on
action prediction task on JAAD testing data and outperforms the LSTM-ed by
7.4%, while LSTM counterpart performs much better on trajectory prediction task
for a prediction sequence length of 25 frames.
Related papers
- Planning with Adaptive World Models for Autonomous Driving [50.4439896514353]
Motion planners (MPs) are crucial for safe navigation in complex urban environments.
nuPlan, a recently released MP benchmark, addresses this limitation by augmenting real-world driving logs with closed-loop simulation logic.
We present AdaptiveDriver, a model-predictive control (MPC) based planner that unrolls different world models conditioned on BehaviorNet's predictions.
arXiv Detail & Related papers (2024-06-15T18:53:45Z) - Uncertainty-Aware DRL for Autonomous Vehicle Crowd Navigation in Shared Space [3.487370856323828]
This work introduces an integrated prediction and planning approach that incorporates the uncertainties of predicted pedestrian states in the training of a model-free DRL algorithm.
A novel reward function encourages the AV to respect pedestrians' personal space, decrease speed during close approaches, and minimize the collision probability with their predicted paths.
Results show a 40% decrease in collision rate and a 15% increase in minimum distance to pedestrians compared to the state of the art model that does not account for prediction uncertainty.
arXiv Detail & Related papers (2024-05-22T20:09:21Z) - AMP: Autoregressive Motion Prediction Revisited with Next Token Prediction for Autonomous Driving [59.94343412438211]
We introduce the GPT style next token motion prediction into motion prediction.
Different from language data which is composed of homogeneous units -words, the elements in the driving scene could have complex spatial-temporal and semantic relations.
We propose to adopt three factorized attention modules with different neighbors for information aggregation and different position encoding styles to capture their relations.
arXiv Detail & Related papers (2024-03-20T06:22:37Z) - Probabilistic Prediction of Longitudinal Trajectory Considering Driving
Heterogeneity with Interpretability [12.929047288003213]
This study proposes a trajectory prediction framework that combines Mixture Density Networks (MDN) and considers the driving heterogeneity to provide probabilistic and personalized predictions.
The proposed framework is tested based on a wide-range vehicle trajectory dataset.
arXiv Detail & Related papers (2023-12-19T12:56:56Z) - BAT: Behavior-Aware Human-Like Trajectory Prediction for Autonomous
Driving [24.123577277806135]
We pioneer a novel behavior-aware trajectory prediction model (BAT)
Our model consists of behavior-aware, interaction-aware, priority-aware, and position-aware modules.
We evaluate BAT's performance across the Next Generation Simulation (NGSIM), Highway Drone (HighD), Roundabout Drone (RounD), and Macao Connected Autonomous Driving (MoCAD) datasets.
arXiv Detail & Related papers (2023-12-11T13:27:51Z) - Pedestrian Trajectory Prediction via Spatial Interaction Transformer
Network [7.150832716115448]
In traffic scenes, when encountering with oncoming people, pedestrians may make sudden turns or stop immediately.
To predict such unpredictable trajectories, we can gain insights into the interaction between pedestrians.
We present a novel generative method named Spatial Interaction Transformer (SIT), which learns the correlation of pedestrian trajectories through attention mechanisms.
arXiv Detail & Related papers (2021-12-13T13:08:04Z) - IntentNet: Learning to Predict Intention from Raw Sensor Data [86.74403297781039]
In this paper, we develop a one-stage detector and forecaster that exploits both 3D point clouds produced by a LiDAR sensor as well as dynamic maps of the environment.
Our multi-task model achieves better accuracy than the respective separate modules while saving computation, which is critical to reducing reaction time in self-driving applications.
arXiv Detail & Related papers (2021-01-20T00:31:52Z) - Pedestrian Behavior Prediction for Automated Driving: Requirements,
Metrics, and Relevant Features [1.1888947789336193]
We analyze the requirements on pedestrian behavior prediction for automated driving via a system-level approach.
Based on human driving behavior we derive appropriate reaction patterns of an automated vehicle.
We present a pedestrian prediction model based on a Variational Conditional Auto-Encoder which incorporates multiple contextual cues.
arXiv Detail & Related papers (2020-12-15T16:52:49Z) - Detecting 32 Pedestrian Attributes for Autonomous Vehicles [103.87351701138554]
In this paper, we address the problem of jointly detecting pedestrians and recognizing 32 pedestrian attributes.
We introduce a Multi-Task Learning (MTL) model relying on a composite field framework, which achieves both goals in an efficient way.
We show competitive detection and attribute recognition results, as well as a more stable MTL training.
arXiv Detail & Related papers (2020-12-04T15:10:12Z) - SMART: Simultaneous Multi-Agent Recurrent Trajectory Prediction [72.37440317774556]
We propose advances that address two key challenges in future trajectory prediction.
multimodality in both training data and predictions and constant time inference regardless of number of agents.
arXiv Detail & Related papers (2020-07-26T08:17:10Z) - Action Sequence Predictions of Vehicles in Urban Environments using Map
and Social Context [152.0714518512966]
This work studies the problem of predicting the sequence of future actions for surround vehicles in real-world driving scenarios.
The first contribution is an automatic method to convert the trajectories recorded in real-world driving scenarios to action sequences with the help of HD maps.
The second contribution lies in applying the method to the well-known traffic agent tracking and prediction dataset Argoverse, resulting in 228,000 action sequences.
The third contribution is to propose a novel action sequence prediction method by integrating past positions and velocities of the traffic agents, map information and social context into a single end-to-end trainable neural network
arXiv Detail & Related papers (2020-04-29T14:59:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.