VectorFlow: Combining Images and Vectors for Traffic Occupancy and Flow
Prediction
- URL: http://arxiv.org/abs/2208.04530v1
- Date: Tue, 9 Aug 2022 03:49:04 GMT
- Title: VectorFlow: Combining Images and Vectors for Traffic Occupancy and Flow
Prediction
- Authors: Xin Huang, Xiaoyu Tian, Junru Gu, Qiao Sun, Hang Zhao
- Abstract summary: We propose a novel occupancy flow fields predictor to produce accurate occupancy and flow predictions.
Our model ranks 3rd place on the Open dataset Occupancy and Flow Prediction Challenge, and achieves the best performance in the occluded occupancy and flow prediction task.
- Score: 18.277777620073685
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Predicting future behaviors of road agents is a key task in autonomous
driving. While existing models have demonstrated great success in predicting
marginal agent future behaviors, it remains a challenge to efficiently predict
consistent joint behaviors of multiple agents. Recently, the occupancy flow
fields representation was proposed to represent joint future states of road
agents through a combination of occupancy grid and flow, which supports
efficient and consistent joint predictions. In this work, we propose a novel
occupancy flow fields predictor to produce accurate occupancy and flow
predictions, by combining the power of an image encoder that learns features
from a rasterized traffic image and a vector encoder that captures information
of continuous agent trajectories and map states. The two encoded features are
fused by multiple attention modules before generating final predictions. Our
simple but effective model ranks 3rd place on the Waymo Open Dataset Occupancy
and Flow Prediction Challenge, and achieves the best performance in the
occluded occupancy and flow prediction task.
Related papers
- Multi-Agent Trajectory Prediction with Difficulty-Guided Feature Enhancement Network [1.5888246742280365]
Trajectory prediction is crucial for autonomous driving as it aims to forecast future movements of traffic participants.
Traditional methods usually perform holistic inference on trajectories of agents, neglecting the differences in difficulty among agents.
This paper proposes a novel DifficultyGuided Feature Enhancement (DGFNet), which leverages the prediction difficulty differences among agents.
arXiv Detail & Related papers (2024-07-26T07:04:30Z) - FutureNet-LOF: Joint Trajectory Prediction and Lane Occupancy Field Prediction with Future Context Encoding [10.188379670636092]
We propose FutureNet, which explicitly integrates initially predicted trajectories into the future scenario.
We also propose Lane Occupancy Field (LOF), a new representation with lane semantics for motion forecasting in autonomous driving.
Our approach ranks 1st on two large-scale motion forecasting benchmarks: Argoverse 1 and Argoverse 2.
arXiv Detail & Related papers (2024-06-20T15:41:53Z) - HPNet: Dynamic Trajectory Forecasting with Historical Prediction Attention [76.37139809114274]
HPNet is a novel dynamic trajectory forecasting method.
We propose a Historical Prediction Attention module to automatically encode the dynamic relationship between successive predictions.
Our code is available at https://github.com/XiaolongTang23/HPNet.
arXiv Detail & Related papers (2024-04-09T14:42:31Z) - Implicit Occupancy Flow Fields for Perception and Prediction in
Self-Driving [68.95178518732965]
A self-driving vehicle (SDV) must be able to perceive its surroundings and predict the future behavior of other traffic participants.
Existing works either perform object detection followed by trajectory of the detected objects, or predict dense occupancy and flow grids for the whole scene.
This motivates our unified approach to perception and future prediction that implicitly represents occupancy and flow over time with a single neural network.
arXiv Detail & Related papers (2023-08-02T23:39:24Z) - Occupancy Flow Fields for Motion Forecasting in Autonomous Driving [36.64394937525725]
We propose Occupancy Flow Fields, a new representation for motion forecasting of multiple agents.
Our representation is a-temporal grid with each grid cell containing both the probability magnitude of the cell being occupied by any agent, and a two-dimensional flow vector representing the direction of the motion in that cell.
We report experimental results on a large in-house autonomous driving dataset and the INTERACTION dataset, and show that our model outperforms state-of-the-art models.
arXiv Detail & Related papers (2022-03-08T06:26:50Z) - LaPred: Lane-Aware Prediction of Multi-Modal Future Trajectories of
Dynamic Agents [10.869902339190949]
We propose a novel prediction model, referred to as the lane-aware prediction (LaPred) network.
LaPred uses the instance-level lane entities extracted from a semantic map to predict the multi-modal future trajectories.
The experiments conducted on the public nuScenes and Argoverse dataset demonstrate that the proposed LaPred method significantly outperforms the existing prediction models.
arXiv Detail & Related papers (2021-04-01T04:33:36Z) - SMART: Simultaneous Multi-Agent Recurrent Trajectory Prediction [72.37440317774556]
We propose advances that address two key challenges in future trajectory prediction.
multimodality in both training data and predictions and constant time inference regardless of number of agents.
arXiv Detail & Related papers (2020-07-26T08:17:10Z) - Implicit Latent Variable Model for Scene-Consistent Motion Forecasting [78.74510891099395]
In this paper, we aim to learn scene-consistent motion forecasts of complex urban traffic directly from sensor data.
We model the scene as an interaction graph and employ powerful graph neural networks to learn a distributed latent representation of the scene.
arXiv Detail & Related papers (2020-07-23T14:31:25Z) - AMENet: Attentive Maps Encoder Network for Trajectory Prediction [35.22312783822563]
Trajectory prediction is critical for applications of planning safe future movements.
We propose an end-to-end generative model named Attentive Maps Network (AMENet)
AMENet encodes the agent's motion and interaction information for accurate and realistic multi-path trajectory prediction.
arXiv Detail & Related papers (2020-06-15T10:00:07Z) - VectorNet: Encoding HD Maps and Agent Dynamics from Vectorized
Representation [74.56282712099274]
This paper introduces VectorNet, a hierarchical graph neural network that exploits the spatial locality of individual road components represented by vectors.
By operating on the vectorized high definition (HD) maps and agent trajectories, we avoid lossy rendering and computationally intensive ConvNet encoding steps.
We evaluate VectorNet on our in-house behavior prediction benchmark and the recently released Argoverse forecasting dataset.
arXiv Detail & Related papers (2020-05-08T19:07:03Z) - TPNet: Trajectory Proposal Network for Motion Prediction [81.28716372763128]
Trajectory Proposal Network (TPNet) is a novel two-stage motion prediction framework.
TPNet first generates a candidate set of future trajectories as hypothesis proposals, then makes the final predictions by classifying and refining the proposals.
Experiments on four large-scale trajectory prediction datasets, show that TPNet achieves the state-of-the-art results both quantitatively and qualitatively.
arXiv Detail & Related papers (2020-04-26T00:01:49Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.