Dynamic Scenario Representation Learning for Motion Forecasting with
Heterogeneous Graph Convolutional Recurrent Networks
- URL: http://arxiv.org/abs/2303.04364v1
- Date: Wed, 8 Mar 2023 04:10:04 GMT
- Title: Dynamic Scenario Representation Learning for Motion Forecasting with
Heterogeneous Graph Convolutional Recurrent Networks
- Authors: Xing Gao, Xiaogang Jia, Yikang Li, and Hongkai Xiong
- Abstract summary: We resort to dynamic heterogeneous graphs to model the evolving scenario.
We design a novel heterogeneous graphal recurrent network, aggregating diverse interaction information.
With a motion forecasting decoder, our model predicts realistic and multi-modal future trajectories of agents.
- Score: 25.383615554172778
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Due to the complex and changing interactions in dynamic scenarios, motion
forecasting is a challenging problem in autonomous driving. Most existing works
exploit static road graphs to characterize scenarios and are limited in
modeling evolving spatio-temporal dependencies in dynamic scenarios. In this
paper, we resort to dynamic heterogeneous graphs to model the scenario. Various
scenario components including vehicles (agents) and lanes, multi-type
interactions, and their changes over time are jointly encoded. Furthermore, we
design a novel heterogeneous graph convolutional recurrent network, aggregating
diverse interaction information and capturing their evolution, to learn to
exploit intrinsic spatio-temporal dependencies in dynamic graphs and obtain
effective representations of dynamic scenarios. Finally, with a motion
forecasting decoder, our model predicts realistic and multi-modal future
trajectories of agents and outperforms state-of-the-art published works on
several motion forecasting benchmarks.
Related papers
- UnityGraph: Unified Learning of Spatio-temporal features for Multi-person Motion Prediction [13.052342503276936]
Multi-person motion prediction is a complex emerging field with significant real-world applications.
We propose a novel graph structure UnityGraph, which treats multi-temporal features as a whole, enhancing model coherence and couplings-temporal features.
Our method achieves the state-of-the-art performance, confirming its effectiveness and innovative design.
arXiv Detail & Related papers (2024-11-06T08:05:36Z) - Multi-View Neural Differential Equations for Continuous-Time Stream Data in Long-Term Traffic Forecasting [10.70370586311912]
We propose a new NDE architecture called Multi-View Neural Differential Equations.
Our model captures current states, delayed states, and trends in different state variables (views) by learning latent multiple representations.
Our proposed method outperforms the state-of-the-art and achieves robustness with noisy or missing inputs.
arXiv Detail & Related papers (2024-08-12T18:49:02Z) - Disentangled Neural Relational Inference for Interpretable Motion
Prediction [38.40799770648501]
We develop a variational auto-encoder framework that integrates graph-based representations and timesequence models.
Our model infers dynamic interaction graphs augmented with interpretable edge features that characterize the interactions.
We validate our approach through extensive experiments on both simulated and real-world datasets.
arXiv Detail & Related papers (2024-01-07T22:49:24Z) - TimeGraphs: Graph-based Temporal Reasoning [64.18083371645956]
TimeGraphs is a novel approach that characterizes dynamic interactions as a hierarchical temporal graph.
Our approach models the interactions using a compact graph-based representation, enabling adaptive reasoning across diverse time scales.
We evaluate TimeGraphs on multiple datasets with complex, dynamic agent interactions, including a football simulator, the Resistance game, and the MOMA human activity dataset.
arXiv Detail & Related papers (2024-01-06T06:26:49Z) - Trajeglish: Traffic Modeling as Next-Token Prediction [67.28197954427638]
A longstanding challenge for self-driving development is simulating dynamic driving scenarios seeded from recorded driving logs.
We apply tools from discrete sequence modeling to model how vehicles, pedestrians and cyclists interact in driving scenarios.
Our model tops the Sim Agents Benchmark, surpassing prior work along the realism meta metric by 3.3% and along the interaction metric by 9.9%.
arXiv Detail & Related papers (2023-12-07T18:53:27Z) - MoCo-Flow: Neural Motion Consensus Flow for Dynamic Humans in Stationary
Monocular Cameras [98.40768911788854]
We introduce MoCo-Flow, a representation that models the dynamic scene using a 4D continuous time-variant function.
At the heart of our work lies a novel optimization formulation, which is constrained by a motion consensus regularization on the motion flow.
We extensively evaluate MoCo-Flow on several datasets that contain human motions of varying complexity.
arXiv Detail & Related papers (2021-06-08T16:03:50Z) - TCL: Transformer-based Dynamic Graph Modelling via Contrastive Learning [87.38675639186405]
We propose a novel graph neural network approach, called TCL, which deals with the dynamically-evolving graph in a continuous-time fashion.
To the best of our knowledge, this is the first attempt to apply contrastive learning to representation learning on dynamic graphs.
arXiv Detail & Related papers (2021-05-17T15:33:25Z) - Implicit Latent Variable Model for Scene-Consistent Motion Forecasting [78.74510891099395]
In this paper, we aim to learn scene-consistent motion forecasts of complex urban traffic directly from sensor data.
We model the scene as an interaction graph and employ powerful graph neural networks to learn a distributed latent representation of the scene.
arXiv Detail & Related papers (2020-07-23T14:31:25Z) - EvolveGraph: Multi-Agent Trajectory Prediction with Dynamic Relational
Reasoning [41.42230144157259]
We propose a generic trajectory forecasting framework with explicit relational structure recognition and prediction via latent interaction graphs.
Considering the uncertainty of future behaviors, the model is designed to provide multi-modal prediction hypotheses.
We introduce a double-stage training pipeline which not only improves training efficiency and accelerates convergence, but also enhances model performance.
arXiv Detail & Related papers (2020-03-31T02:49:23Z) - Trajectron++: Dynamically-Feasible Trajectory Forecasting With
Heterogeneous Data [37.176411554794214]
Reasoning about human motion is an important prerequisite to safe and socially-aware robotic navigation.
We present Trajectron++, a modular, graph-structured recurrent model that forecasts the trajectories of a general number of diverse agents.
We demonstrate its performance on several challenging real-world trajectory forecasting datasets.
arXiv Detail & Related papers (2020-01-09T16:47:17Z) - Spatial-Temporal Transformer Networks for Traffic Flow Forecasting [74.76852538940746]
We propose a novel paradigm of Spatial-Temporal Transformer Networks (STTNs) to improve the accuracy of long-term traffic forecasting.
Specifically, we present a new variant of graph neural networks, named spatial transformer, by dynamically modeling directed spatial dependencies.
The proposed model enables fast and scalable training over a long range spatial-temporal dependencies.
arXiv Detail & Related papers (2020-01-09T10:21:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.