Graph-Based Interaction-Aware Multimodal 2D Vehicle Trajectory
Prediction using Diffusion Graph Convolutional Networks
- URL: http://arxiv.org/abs/2309.01981v1
- Date: Tue, 5 Sep 2023 06:28:13 GMT
- Title: Graph-Based Interaction-Aware Multimodal 2D Vehicle Trajectory
Prediction using Diffusion Graph Convolutional Networks
- Authors: Keshu Wu, Yang Zhou, Haotian Shi, Xiaopeng Li, Bin Ran
- Abstract summary: This study presents the Graph-based Interaction-aware Multi-modal Trajectory Prediction framework.
Within this framework, vehicles' motions are conceptualized as nodes in a time-varying graph, and the traffic interactions are represented by a dynamic adjacency matrix.
We employ a driving intention-specific feature fusion, enabling the adaptive integration of historical and future embeddings.
- Score: 17.989423104706397
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Predicting vehicle trajectories is crucial for ensuring automated vehicle
operation efficiency and safety, particularly on congested multi-lane highways.
In such dynamic environments, a vehicle's motion is determined by its
historical behaviors as well as interactions with surrounding vehicles. These
intricate interactions arise from unpredictable motion patterns, leading to a
wide range of driving behaviors that warrant in-depth investigation. This study
presents the Graph-based Interaction-aware Multi-modal Trajectory Prediction
(GIMTP) framework, designed to probabilistically predict future vehicle
trajectories by effectively capturing these interactions. Within this
framework, vehicles' motions are conceptualized as nodes in a time-varying
graph, and the traffic interactions are represented by a dynamic adjacency
matrix. To holistically capture both spatial and temporal dependencies embedded
in this dynamic adjacency matrix, the methodology incorporates the Diffusion
Graph Convolutional Network (DGCN), thereby providing a graph embedding of both
historical states and future states. Furthermore, we employ a driving
intention-specific feature fusion, enabling the adaptive integration of
historical and future embeddings for enhanced intention recognition and
trajectory prediction. This model gives two-dimensional predictions for each
mode of longitudinal and lateral driving behaviors and offers probabilistic
future paths with corresponding probabilities, addressing the challenges of
complex vehicle interactions and multi-modality of driving behaviors.
Validation using real-world trajectory datasets demonstrates the efficiency and
potential.
Related papers
- Hypergraph-based Motion Generation with Multi-modal Interaction Relational Reasoning [13.294396870431399]
Real-world driving environments are characterized by dynamic and diverse interactions among vehicles.
This research introduces an integrated framework for autonomous vehicles (AVs) motion prediction.
arXiv Detail & Related papers (2024-09-18T03:30:38Z) - Interactive Autonomous Navigation with Internal State Inference and
Interactivity Estimation [58.21683603243387]
We propose three auxiliary tasks with relational-temporal reasoning and integrate them into the standard Deep Learning framework.
These auxiliary tasks provide additional supervision signals to infer the behavior patterns other interactive agents.
Our approach achieves robust and state-of-the-art performance in terms of standard evaluation metrics.
arXiv Detail & Related papers (2023-11-27T18:57:42Z) - D2-TPred: Discontinuous Dependency for Trajectory Prediction under
Traffic Lights [68.76631399516823]
We present a trajectory prediction approach with respect to traffic lights, D2-TPred, using a spatial dynamic interaction graph (SDG) and a behavior dependency graph (BDG)
Our experimental results show that our model achieves more than 20.45% and 20.78% in terms of ADE and FDE, respectively, on VTP-TL.
arXiv Detail & Related papers (2022-07-21T10:19:07Z) - RSG-Net: Towards Rich Sematic Relationship Prediction for Intelligent
Vehicle in Complex Environments [72.04891523115535]
We propose RSG-Net (Road Scene Graph Net): a graph convolutional network designed to predict potential semantic relationships from object proposals.
The experimental results indicate that this network, trained on Road Scene Graph dataset, could efficiently predict potential semantic relationships among objects around the ego-vehicle.
arXiv Detail & Related papers (2022-07-16T12:40:17Z) - Multi-Modal Fusion Transformer for End-to-End Autonomous Driving [59.60483620730437]
We propose TransFuser, a novel Multi-Modal Fusion Transformer, to integrate image and LiDAR representations using attention.
Our approach achieves state-of-the-art driving performance while reducing collisions by 76% compared to geometry-based fusion.
arXiv Detail & Related papers (2021-04-19T11:48:13Z) - Multi-modal Trajectory Prediction for Autonomous Driving with Semantic
Map and Dynamic Graph Attention Network [12.791191495432829]
There are several challenges in trajectory prediction in real-world traffic scenarios.
Inspired by people's natural habit of navigating traffic with attention to their goals and surroundings, this paper presents a unique graph attention network.
The network is designed to model the dynamic social interactions among agents and conform to traffic rules with a semantic map.
arXiv Detail & Related papers (2021-03-30T11:53:12Z) - Interaction-Based Trajectory Prediction Over a Hybrid Traffic Graph [4.574413934477815]
We propose to use a hybrid graph whose nodes represent both the traffic actors as well as the static and dynamic traffic elements present in the scene.
The different modes of temporal interaction (e.g., stopping and going) among actors and traffic elements are explicitly modeled by graph edges.
We show that our proposed model, TrafficGraphNet, achieves state-of-the-art trajectory prediction accuracy while maintaining a high level of interpretability.
arXiv Detail & Related papers (2020-09-27T18:20:03Z) - Implicit Latent Variable Model for Scene-Consistent Motion Forecasting [78.74510891099395]
In this paper, we aim to learn scene-consistent motion forecasts of complex urban traffic directly from sensor data.
We model the scene as an interaction graph and employ powerful graph neural networks to learn a distributed latent representation of the scene.
arXiv Detail & Related papers (2020-07-23T14:31:25Z) - Probabilistic Crowd GAN: Multimodal Pedestrian Trajectory Prediction
using a Graph Vehicle-Pedestrian Attention Network [12.070251470948772]
We show how Probabilistic Crowd GAN can output probabilistic multimodal predictions.
We also propose the use of Graph Vehicle-Pedestrian Attention Network (GVAT), which models social interactions.
We demonstrate improvements on the existing state of the art methods for trajectory prediction and illustrate how the true multimodal and uncertain nature of crowd interactions can be directly modelled.
arXiv Detail & Related papers (2020-06-23T11:25:16Z) - The Importance of Prior Knowledge in Precise Multimodal Prediction [71.74884391209955]
Roads have well defined geometries, topologies, and traffic rules.
In this paper we propose to incorporate structured priors as a loss function.
We demonstrate the effectiveness of our approach on real-world self-driving datasets.
arXiv Detail & Related papers (2020-06-04T03:56:11Z) - Social-WaGDAT: Interaction-aware Trajectory Prediction via Wasserstein
Graph Double-Attention Network [29.289670231364788]
In this paper, we propose a generic generative neural system for multi-agent trajectory prediction.
We also employ an efficient kinematic constraint layer applied to vehicle trajectory prediction.
The proposed system is evaluated on three public benchmark datasets for trajectory prediction.
arXiv Detail & Related papers (2020-02-14T20:11:13Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.