Traffic Agent Trajectory Prediction Using Social Convolution and
Attention Mechanism
- URL: http://arxiv.org/abs/2007.02515v1
- Date: Mon, 6 Jul 2020 03:48:08 GMT
- Title: Traffic Agent Trajectory Prediction Using Social Convolution and
Attention Mechanism
- Authors: Tao Yang, Zhixiong Nan, He Zhang, Shitao Chen and Nanning Zheng
- Abstract summary: We propose a model to predict the trajectories of target agents around an autonomous vehicle.
We encode the target agent history trajectories as an attention mask and construct a social map to encode the interactive relationship between the target agent and its surrounding agents.
To verify the effectiveness of our method, we widely compare with several methods on a public dataset, achieving a 20% error decrease.
- Score: 57.68557165836806
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The trajectory prediction is significant for the decision-making of
autonomous driving vehicles. In this paper, we propose a model to predict the
trajectories of target agents around an autonomous vehicle. The main idea of
our method is considering the history trajectories of the target agent and the
influence of surrounding agents on the target agent. To this end, we encode the
target agent history trajectories as an attention mask and construct a social
map to encode the interactive relationship between the target agent and its
surrounding agents. Given a trajectory sequence, the LSTM networks are firstly
utilized to extract the features for all agents, based on which the attention
mask and social map are formed. Then, the attention mask and social map are
fused to get the fusion feature map, which is processed by the social
convolution to obtain a fusion feature representation. Finally, this fusion
feature is taken as the input of a variable-length LSTM to predict the
trajectory of the target agent. We note that the variable-length LSTM enables
our model to handle the case that the number of agents in the sensing scope is
highly dynamic in traffic scenes. To verify the effectiveness of our method, we
widely compare with several methods on a public dataset, achieving a 20% error
decrease. In addition, the model satisfies the real-time requirement with the
32 fps.
Related papers
- Spatial and social situation-aware transformer-based trajectory prediction of autonomous systems [2.498836880652668]
Anticipating the behavior of an agent in a given situation is required to adequately react to it in time.
Deep learning-based models has become the dominant approach to motion prediction recently.
For longer prediction horizons, the deviation of the predicted trajectory from the ground truth is lower compared to a spatially and socially agnostic model.
arXiv Detail & Related papers (2024-06-04T20:36:16Z) - SocialFormer: Social Interaction Modeling with Edge-enhanced Heterogeneous Graph Transformers for Trajectory Prediction [3.733790302392792]
SocialFormer is an agent interaction-aware trajectory prediction method.
We present a temporal encoder based on gated recurrent units (GRU) to model the temporal social behavior of agent movements.
We evaluate SocialFormer for the trajectory prediction task on the popular nuScenes benchmark and achieve state-of-the-art performance.
arXiv Detail & Related papers (2024-05-06T19:47:23Z) - Trajeglish: Traffic Modeling as Next-Token Prediction [67.28197954427638]
A longstanding challenge for self-driving development is simulating dynamic driving scenarios seeded from recorded driving logs.
We apply tools from discrete sequence modeling to model how vehicles, pedestrians and cyclists interact in driving scenarios.
Our model tops the Sim Agents Benchmark, surpassing prior work along the realism meta metric by 3.3% and along the interaction metric by 9.9%.
arXiv Detail & Related papers (2023-12-07T18:53:27Z) - JRDB-Traj: A Dataset and Benchmark for Trajectory Forecasting in Crowds [79.00975648564483]
Trajectory forecasting models, employed in fields such as robotics, autonomous vehicles, and navigation, face challenges in real-world scenarios.
This dataset provides comprehensive data, including the locations of all agents, scene images, and point clouds, all from the robot's perspective.
The objective is to predict the future positions of agents relative to the robot using raw sensory input data.
arXiv Detail & Related papers (2023-11-05T18:59:31Z) - SIMMF: Semantics-aware Interactive Multiagent Motion Forecasting for
Autonomous Vehicle Driving [2.7195102129095003]
We propose a semantic-aware Interactive Multiagent Motion Forecasting (SIMMF) method to capture semantics along with spatial information.
Specifically, we achieve this by implementing a semantic-aware selection of relevant agents from the scene and passing them through an attention mechanism.
Our results show that the proposed approach outperforms state-of-the-art baselines and provides more accurate and scene-consistent predictions.
arXiv Detail & Related papers (2023-06-26T17:54:24Z) - Traj-MAE: Masked Autoencoders for Trajectory Prediction [69.7885837428344]
Trajectory prediction has been a crucial task in building a reliable autonomous driving system by anticipating possible dangers.
We propose an efficient masked autoencoder for trajectory prediction (Traj-MAE) that better represents the complicated behaviors of agents in the driving environment.
Our experimental results in both multi-agent and single-agent settings demonstrate that Traj-MAE achieves competitive results with state-of-the-art methods.
arXiv Detail & Related papers (2023-03-12T16:23:27Z) - GoRela: Go Relative for Viewpoint-Invariant Motion Forecasting [121.42898228997538]
We propose an efficient shared encoding for all agents and the map without sacrificing accuracy or generalization.
We leverage pair-wise relative positional encodings to represent geometric relationships between the agents and the map elements in a heterogeneous spatial graph.
Our decoder is also viewpoint agnostic, predicting agent goals on the lane graph to enable diverse and context-aware multimodal prediction.
arXiv Detail & Related papers (2022-11-04T16:10:50Z) - Decoder Fusion RNN: Context and Interaction Aware Decoders for
Trajectory Prediction [53.473846742702854]
We propose a recurrent, attention-based approach for motion forecasting.
Decoder Fusion RNN (DF-RNN) is composed of a recurrent behavior encoder, an inter-agent multi-headed attention module, and a context-aware decoder.
We demonstrate the efficacy of our method by testing it on the Argoverse motion forecasting dataset and show its state-of-the-art performance on the public benchmark.
arXiv Detail & Related papers (2021-08-12T15:53:37Z) - Spatial-Channel Transformer Network for Trajectory Prediction on the
Traffic Scenes [2.7955111755177695]
We present a Spatial-Channel Transformer Network for trajectory prediction with attention functions.
A channel-wise module is inserted to measure the social interaction between agents.
We find that the network achieves promising results on real-world trajectory prediction datasets on the traffic scenes.
arXiv Detail & Related papers (2021-01-27T15:03:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.