STF: Spatial Temporal Fusion for Trajectory Prediction
- URL: http://arxiv.org/abs/2311.18149v1
- Date: Wed, 29 Nov 2023 23:31:40 GMT
- Title: STF: Spatial Temporal Fusion for Trajectory Prediction
- Authors: Pengqian Han, Partha Roop, Jiamou Liu, Tianzhe Bao, Yifei Wang
- Abstract summary: Trajectory prediction is a challenging task that aims to predict the future trajectory of vehicles or pedestrians over a short time horizon.
The more information the model can capture, the more precise the future trajectory can be predicted.
In this study, we introduce an integrated 3D graph that incorporates both spatial and temporal edges.
- Score: 18.359362362173098
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Trajectory prediction is a challenging task that aims to predict the future
trajectory of vehicles or pedestrians over a short time horizon based on their
historical positions. The main reason is that the trajectory is a kind of
complex data, including spatial and temporal information, which is crucial for
accurate prediction. Intuitively, the more information the model can capture,
the more precise the future trajectory can be predicted. However, previous
works based on deep learning methods processed spatial and temporal information
separately, leading to inadequate spatial information capture, which means they
failed to capture the complete spatial information. Therefore, it is of
significance to capture information more fully and effectively on vehicle
interactions. In this study, we introduced an integrated 3D graph that
incorporates both spatial and temporal edges. Based on this, we proposed the
integrated 3D graph, which considers the cross-time interaction information. In
specific, we design a Spatial-Temporal Fusion (STF) model including Multi-layer
perceptions (MLP) and Graph Attention (GAT) to capture the spatial and temporal
information historical trajectories simultaneously on the 3D graph. Our
experiment on the ApolloScape Trajectory Datasets shows that the proposed STF
outperforms several baseline methods, especially on the long-time-horizon
trajectory prediction.
Related papers
- Future Does Matter: Boosting 3D Object Detection with Temporal Motion Estimation in Point Cloud Sequences [25.74000325019015]
We introduce a novel LiDAR 3D object detection framework, namely LiSTM, to facilitate spatial-temporal feature learning with cross-frame motion forecasting information.
We have conducted experiments on the aggregation and nuScenes datasets to demonstrate that the proposed framework achieves superior 3D detection performance.
arXiv Detail & Related papers (2024-09-06T16:29:04Z) - Deep Learning for Trajectory Data Management and Mining: A Survey and Beyond [58.63558696061679]
Trajectory computing is crucial in various practical applications such as location services, urban traffic, and public safety.
We present a review of development and recent advances in deep learning for trajectory computing (DL4Traj)
Notably, we encapsulate recent advancements in Large Language Models (LLMs) that hold potential to augment trajectory computing.
arXiv Detail & Related papers (2024-03-21T05:57:27Z) - TrajectoryNAS: A Neural Architecture Search for Trajectory Prediction [0.0]
Trajectory prediction is a critical component of autonomous driving systems.
This paper introduces TrajectoryNAS, a pioneering method that focuses on utilizing point cloud data for trajectory prediction.
arXiv Detail & Related papers (2024-03-18T11:48:41Z) - S-T CRF: Spatial-Temporal Conditional Random Field for Human Trajectory
Prediction [11.302568159452889]
Trajectory prediction is of significant importance in computer vision.
This study introduces a novel model, termed the textbfS-Tjectory CRF: textbfSpatial-textbfTemporal textbfConditional textbfRandom textbfField.
arXiv Detail & Related papers (2023-11-30T02:33:01Z) - PDFormer: Propagation Delay-Aware Dynamic Long-Range Transformer for
Traffic Flow Prediction [78.05103666987655]
spatial-temporal Graph Neural Network (GNN) models have emerged as one of the most promising methods to solve this problem.
We propose a novel propagation delay-aware dynamic long-range transFormer, namely PDFormer, for accurate traffic flow prediction.
Our method can not only achieve state-of-the-art performance but also exhibit competitive computational efficiency.
arXiv Detail & Related papers (2023-01-19T08:42:40Z) - ST-P3: End-to-end Vision-based Autonomous Driving via Spatial-Temporal
Feature Learning [132.20119288212376]
We propose a spatial-temporal feature learning scheme towards a set of more representative features for perception, prediction and planning tasks simultaneously.
To the best of our knowledge, we are the first to systematically investigate each part of an interpretable end-to-end vision-based autonomous driving system.
arXiv Detail & Related papers (2022-07-15T16:57:43Z) - PreTraM: Self-Supervised Pre-training via Connecting Trajectory and Map [58.53373202647576]
We propose PreTraM, a self-supervised pre-training scheme for trajectory forecasting.
It consists of two parts: 1) Trajectory-Map Contrastive Learning, where we project trajectories and maps to a shared embedding space with cross-modal contrastive learning, and 2) Map Contrastive Learning, where we enhance map representation with contrastive learning on large quantities of HD-maps.
On top of popular baselines such as AgentFormer and Trajectron++, PreTraM boosts their performance by 5.5% and 6.9% relatively in FDE-10 on the challenging nuScenes dataset.
arXiv Detail & Related papers (2022-04-21T23:01:21Z) - Spatial-Temporal Correlation and Topology Learning for Person
Re-Identification in Videos [78.45050529204701]
We propose a novel framework to pursue discriminative and robust representation by modeling cross-scale spatial-temporal correlation.
CTL utilizes a CNN backbone and a key-points estimator to extract semantic local features from human body.
It explores a context-reinforced topology to construct multi-scale graphs by considering both global contextual information and physical connections of human body.
arXiv Detail & Related papers (2021-04-15T14:32:12Z) - A Spatial-Temporal Attentive Network with Spatial Continuity for
Trajectory Prediction [74.00750936752418]
We propose a novel model named spatial-temporal attentive network with spatial continuity (STAN-SC)
First, spatial-temporal attention mechanism is presented to explore the most useful and important information.
Second, we conduct a joint feature sequence based on the sequence and instant state information to make the generative trajectories keep spatial continuity.
arXiv Detail & Related papers (2020-03-13T04:35:50Z) - A Graph Attention Spatio-temporal Convolutional Network for 3D Human
Pose Estimation in Video [7.647599484103065]
We improve the learning of constraints in human skeleton by modeling local global spatial information via attention mechanisms.
Our approach effectively mitigates depth ambiguity and self-occlusion, generalizes to half upper body estimation, and achieves competitive performance on 2D-to-3D video pose estimation.
arXiv Detail & Related papers (2020-03-11T14:54:40Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.