Exploring Dynamic Context for Multi-path Trajectory Prediction
- URL: http://arxiv.org/abs/2010.16267v3
- Date: Wed, 24 Mar 2021 10:28:47 GMT
- Title: Exploring Dynamic Context for Multi-path Trajectory Prediction
- Authors: Hao Cheng, Wentong Liao, Xuejiao Tang, Michael Ying Yang, Monika
Sester, Bodo Rosenhahn
- Abstract summary: We propose a novel framework, named Dynamic Context Network (DCENet)
In our framework, the spatial context between agents is explored by using self-attention architectures.
A set of future trajectories for each agent is predicted conditioned on the learned spatial-temporal context.
- Score: 33.66335553588001
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: To accurately predict future positions of different agents in traffic
scenarios is crucial for safely deploying intelligent autonomous systems in the
real-world environment. However, it remains a challenge due to the behavior of
a target agent being affected by other agents dynamically and there being more
than one socially possible paths the agent could take. In this paper, we
propose a novel framework, named Dynamic Context Encoder Network (DCENet). In
our framework, first, the spatial context between agents is explored by using
self-attention architectures. Then, the two-stream encoders are trained to
learn temporal context between steps by taking the respective observed
trajectories and the extracted dynamic spatial context as input. The
spatial-temporal context is encoded into a latent space using a Conditional
Variational Auto-Encoder (CVAE) module. Finally, a set of future trajectories
for each agent is predicted conditioned on the learned spatial-temporal context
by sampling from the latent space, repeatedly. DCENet is evaluated on one of
the most popular challenging benchmarks for trajectory forecasting Trajnet and
reports a new state-of-the-art performance. It also demonstrates superior
performance evaluated on the benchmark inD for mixed traffic at intersections.
A series of ablation studies is conducted to validate the effectiveness of each
proposed module. Our code is available at https://github.com/wtliao/DCENet.
Related papers
- Deciphering Movement: Unified Trajectory Generation Model for Multi-Agent [53.637837706712794]
We propose a Unified Trajectory Generation model, UniTraj, that processes arbitrary trajectories as masked inputs.
Specifically, we introduce a Ghost Spatial Masking (GSM) module embedded within a Transformer encoder for spatial feature extraction.
We benchmark three practical sports game datasets, Basketball-U, Football-U, and Soccer-U, for evaluation.
arXiv Detail & Related papers (2024-05-27T22:15:23Z) - AMP: Autoregressive Motion Prediction Revisited with Next Token Prediction for Autonomous Driving [59.94343412438211]
We introduce the GPT style next token motion prediction into motion prediction.
Different from language data which is composed of homogeneous units -words, the elements in the driving scene could have complex spatial-temporal and semantic relations.
We propose to adopt three factorized attention modules with different neighbors for information aggregation and different position encoding styles to capture their relations.
arXiv Detail & Related papers (2024-03-20T06:22:37Z) - Interactive Autonomous Navigation with Internal State Inference and
Interactivity Estimation [58.21683603243387]
We propose three auxiliary tasks with relational-temporal reasoning and integrate them into the standard Deep Learning framework.
These auxiliary tasks provide additional supervision signals to infer the behavior patterns other interactive agents.
Our approach achieves robust and state-of-the-art performance in terms of standard evaluation metrics.
arXiv Detail & Related papers (2023-11-27T18:57:42Z) - Context-Aware Timewise VAEs for Real-Time Vehicle Trajectory Prediction [4.640835690336652]
We present ContextVAE, a context-aware approach for multi-modal vehicle trajectory prediction.
Our approach takes into account both the social features exhibited by agents on the scene and the physical environment constraints.
In all tested datasets, ContextVAE models are fast to train and provide high-quality multi-modal predictions in real-time.
arXiv Detail & Related papers (2023-02-21T18:42:24Z) - Exploring Attention GAN for Vehicle Motion Prediction [2.887073662645855]
We study the influence of attention in generative models for motion prediction, considering both physical and social context.
We validate our method using the Argoverse Motion Forecasting Benchmark 1.1, achieving competitive unimodal results.
arXiv Detail & Related papers (2022-09-26T13:18:32Z) - TNT: Target-driveN Trajectory Prediction [76.21200047185494]
We develop a target-driven trajectory prediction framework for moving agents.
We benchmark it on trajectory prediction of vehicles and pedestrians.
We outperform state-of-the-art on Argoverse Forecasting, INTERACTION, Stanford Drone and an in-house Pedestrian-at-Intersection dataset.
arXiv Detail & Related papers (2020-08-19T06:52:46Z) - SMART: Simultaneous Multi-Agent Recurrent Trajectory Prediction [72.37440317774556]
We propose advances that address two key challenges in future trajectory prediction.
multimodality in both training data and predictions and constant time inference regardless of number of agents.
arXiv Detail & Related papers (2020-07-26T08:17:10Z) - Robust Trajectory Forecasting for Multiple Intelligent Agents in Dynamic
Scene [11.91073327154494]
We present a novel method for robust trajectory forecasting of multiple agents in dynamic scenes.
The proposed method outperforms the state-of-the-art prediction methods in terms of prediction accuracy.
arXiv Detail & Related papers (2020-05-27T02:32:55Z) - SSP: Single Shot Future Trajectory Prediction [26.18589883075203]
We propose a robust solution to future trajectory forecast, which can be practically applicable to autonomous agents in highly crowded environments.
First, we use composite fields to predict future locations of all road agents in a single-shot, which results in a constant time.
Second, interactions between agents are modeled as non-local, response enabling spatial relationships between different locations to be captured temporally.
Third, the semantic context of the scene are modeled and take into account the environmental constraints that potentially influence the future motion.
arXiv Detail & Related papers (2020-04-13T09:56:38Z) - A Spatial-Temporal Attentive Network with Spatial Continuity for
Trajectory Prediction [74.00750936752418]
We propose a novel model named spatial-temporal attentive network with spatial continuity (STAN-SC)
First, spatial-temporal attention mechanism is presented to explore the most useful and important information.
Second, we conduct a joint feature sequence based on the sequence and instant state information to make the generative trajectories keep spatial continuity.
arXiv Detail & Related papers (2020-03-13T04:35:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.