Trajectory Prediction for Autonomous Driving based on Multi-Head
Attention with Joint Agent-Map Representation
- URL: http://arxiv.org/abs/2005.02545v3
- Date: Wed, 2 Sep 2020 22:41:33 GMT
- Title: Trajectory Prediction for Autonomous Driving based on Multi-Head
Attention with Joint Agent-Map Representation
- Authors: Kaouther Messaoud, Nachiket Deo, Mohan M. Trivedi, Fawzi Nashashibi
- Abstract summary: Future trajectories of agents can be inferred using two important cues: the locations and past motion of agents, and the static scene structure.
We propose a novel approach applying multi-head attention by considering a joint representation of the static scene and surrounding agents.
Our model achieves results on the nuScenes prediction benchmark and generates diverse future trajectories compliant with scene structure and agent configuration.
- Score: 8.203012391711932
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Predicting the trajectories of surrounding agents is an essential ability for
autonomous vehicles navigating through complex traffic scenes. The future
trajectories of agents can be inferred using two important cues: the locations
and past motion of agents, and the static scene structure. Due to the high
variability in scene structure and agent configurations, prior work has
employed the attention mechanism, applied separately to the scene and agent
configuration to learn the most salient parts of both cues. However, the two
cues are tightly linked. The agent configuration can inform what part of the
scene is most relevant to prediction. The static scene in turn can help
determine the relative influence of agents on each other's motion. Moreover,
the distribution of future trajectories is multimodal, with modes corresponding
to the agent's intent. The agent's intent also informs what part of the scene
and agent configuration is relevant to prediction. We thus propose a novel
approach applying multi-head attention by considering a joint representation of
the static scene and surrounding agents. We use each attention head to generate
a distinct future trajectory to address multimodality of future trajectories.
Our model achieves state of the art results on the nuScenes prediction
benchmark and generates diverse future trajectories compliant with scene
structure and agent configuration.
Related papers
- SIMMF: Semantics-aware Interactive Multiagent Motion Forecasting for
Autonomous Vehicle Driving [2.7195102129095003]
We propose a semantic-aware Interactive Multiagent Motion Forecasting (SIMMF) method to capture semantics along with spatial information.
Specifically, we achieve this by implementing a semantic-aware selection of relevant agents from the scene and passing them through an attention mechanism.
Our results show that the proposed approach outperforms state-of-the-art baselines and provides more accurate and scene-consistent predictions.
arXiv Detail & Related papers (2023-06-26T17:54:24Z) - Scene Transformer: A unified multi-task model for behavior prediction
and planning [42.758178896204036]
We formulate a model for predicting the behavior of all agents jointly in real-world driving environments.
Inspired by recent language modeling approaches, we use a masking strategy as the query to our model.
We evaluate our approach on autonomous driving datasets for behavior prediction, and achieve state-of-the-art performance.
arXiv Detail & Related papers (2021-06-15T20:20:44Z) - Latent Variable Nested Set Transformers & AutoBots [25.194344543085005]
We propose a theoretical framework for this problem setting based on autoregressively modelling sequences of nested sets.
We present a new model architecture which employs multi-head self-attention blocks over sets of sets that serve as a form of social attention between the elements of the sets at every timestep.
We validate the Nested Set Transformer for autonomous driving settings which we refer to as ("AutoBot"), where we model the trajectory of an ego-agent based on the sequential observations of key attributes of multiple agents in a scene.
arXiv Detail & Related papers (2021-02-19T18:53:26Z) - Instance-Aware Predictive Navigation in Multi-Agent Environments [93.15055834395304]
We propose an Instance-Aware Predictive Control (IPC) approach, which forecasts interactions between agents as well as future scene structures.
We adopt a novel multi-instance event prediction module to estimate the possible interaction among agents in the ego-centric view.
We design a sequential action sampling strategy to better leverage predicted states on both scene-level and instance-level.
arXiv Detail & Related papers (2021-01-14T22:21:25Z) - TNT: Target-driveN Trajectory Prediction [76.21200047185494]
We develop a target-driven trajectory prediction framework for moving agents.
We benchmark it on trajectory prediction of vehicles and pedestrians.
We outperform state-of-the-art on Argoverse Forecasting, INTERACTION, Stanford Drone and an in-house Pedestrian-at-Intersection dataset.
arXiv Detail & Related papers (2020-08-19T06:52:46Z) - Implicit Latent Variable Model for Scene-Consistent Motion Forecasting [78.74510891099395]
In this paper, we aim to learn scene-consistent motion forecasts of complex urban traffic directly from sensor data.
We model the scene as an interaction graph and employ powerful graph neural networks to learn a distributed latent representation of the scene.
arXiv Detail & Related papers (2020-07-23T14:31:25Z) - Traffic Agent Trajectory Prediction Using Social Convolution and
Attention Mechanism [57.68557165836806]
We propose a model to predict the trajectories of target agents around an autonomous vehicle.
We encode the target agent history trajectories as an attention mask and construct a social map to encode the interactive relationship between the target agent and its surrounding agents.
To verify the effectiveness of our method, we widely compare with several methods on a public dataset, achieving a 20% error decrease.
arXiv Detail & Related papers (2020-07-06T03:48:08Z) - Robust Trajectory Forecasting for Multiple Intelligent Agents in Dynamic
Scene [11.91073327154494]
We present a novel method for robust trajectory forecasting of multiple agents in dynamic scenes.
The proposed method outperforms the state-of-the-art prediction methods in terms of prediction accuracy.
arXiv Detail & Related papers (2020-05-27T02:32:55Z) - A Spatial-Temporal Attentive Network with Spatial Continuity for
Trajectory Prediction [74.00750936752418]
We propose a novel model named spatial-temporal attentive network with spatial continuity (STAN-SC)
First, spatial-temporal attention mechanism is presented to explore the most useful and important information.
Second, we conduct a joint feature sequence based on the sequence and instant state information to make the generative trajectories keep spatial continuity.
arXiv Detail & Related papers (2020-03-13T04:35:50Z) - Diverse and Admissible Trajectory Forecasting through Multimodal Context
Understanding [46.52703817997932]
Multi-agent trajectory forecasting in autonomous driving requires an agent to accurately anticipate the behaviors of the surrounding vehicles and pedestrians.
We propose a model that synthesizes multiple input signals from the multimodal world.
We show a significant performance improvement over previous state-of-the-art methods.
arXiv Detail & Related papers (2020-03-06T13:59:39Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.