PTP: Parallelized Tracking and Prediction with Graph Neural Networks and
Diversity Sampling
- URL: http://arxiv.org/abs/2003.07847v2
- Date: Sat, 3 Apr 2021 13:56:15 GMT
- Title: PTP: Parallelized Tracking and Prediction with Graph Neural Networks and
Diversity Sampling
- Authors: Xinshuo Weng and Ye Yuan and Kris Kitani
- Abstract summary: Multi-object tracking (MOT) and trajectory prediction are two critical components in modern 3D perception systems.
We propose a parallelized framework to learn a shared feature representation of agent interaction.
Our method with socially-aware feature learning and diversity sampling achieves new state-of-the-art performance on 3D MOT and trajectory prediction.
- Score: 34.68114553744956
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Multi-object tracking (MOT) and trajectory prediction are two critical
components in modern 3D perception systems that require accurate modeling of
multi-agent interaction. We hypothesize that it is beneficial to unify both
tasks under one framework in order to learn a shared feature representation of
agent interaction. Furthermore, instead of performing tracking and prediction
sequentially which can propagate errors from tracking to prediction, we propose
a parallelized framework to mitigate the issue. Also, our parallel
track-forecast framework incorporates two additional novel computational units.
First, we use a feature interaction technique by introducing Graph Neural
Networks (GNNs) to capture the way in which agents interact with one another.
The GNN is able to improve discriminative feature learning for MOT association
and provide socially-aware contexts for trajectory prediction. Second, we use a
diversity sampling function to improve the quality and diversity of our
forecasted trajectories. The learned sampling function is trained to
efficiently extract a variety of outcomes from a generative trajectory
distribution and helps avoid the problem of generating duplicate trajectory
samples. We evaluate on KITTI and nuScenes datasets showing that our method
with socially-aware feature learning and diversity sampling achieves new
state-of-the-art performance on 3D MOT and trajectory prediction. Project
website is: https://www.xinshuoweng.com/projects/PTP
Related papers
- StreamMOTP: Streaming and Unified Framework for Joint 3D Multi-Object Tracking and Trajectory Prediction [22.29257945966914]
We propose a streaming and unified framework for joint 3D Multi-Object Tracking and trajectory Prediction (StreamMOTP)
We construct the model in a streaming manner and exploit a memory bank to preserve and leverage the long-term latent features for tracked objects more effectively.
We also improve the quality and consistency of predicted trajectories with a dual-stream predictor.
arXiv Detail & Related papers (2024-06-28T11:35:35Z) - GroupNet: Multiscale Hypergraph Neural Networks for Trajectory
Prediction with Relational Reasoning [37.64048110462638]
GroupNet is a multiscale hypergraph neural network that captures both pair-wise and group-wise interactions.
We apply GroupNet into both CVAE-based prediction system and previous state-of-the-art prediction systems.
We show that with GroupNet, the CVAE-based prediction system outperforms state-of-the-art methods.
arXiv Detail & Related papers (2022-04-19T09:36:20Z) - Transforming Model Prediction for Tracking [109.08417327309937]
Transformers capture global relations with little inductive bias, allowing it to learn the prediction of more powerful target models.
We train the proposed tracker end-to-end and validate its performance by conducting comprehensive experiments on multiple tracking datasets.
Our tracker sets a new state of the art on three benchmarks, achieving an AUC of 68.5% on the challenging LaSOT dataset.
arXiv Detail & Related papers (2022-03-21T17:59:40Z) - Adaptive Trajectory Prediction via Transferable GNN [74.09424229172781]
We propose a novel Transferable Graph Neural Network (T-GNN) framework, which jointly conducts trajectory prediction as well as domain alignment in a unified framework.
Specifically, a domain invariant GNN is proposed to explore the structural motion knowledge where the domain specific knowledge is reduced.
An attention-based adaptive knowledge learning module is further proposed to explore fine-grained individual-level feature representation for knowledge transfer.
arXiv Detail & Related papers (2022-03-09T21:08:47Z) - Online Multi-Agent Forecasting with Interpretable Collaborative Graph
Neural Network [65.11999700562869]
We propose a novel collaborative prediction unit (CoPU), which aggregates predictions from multiple collaborative predictors according to a collaborative graph.
Our methods outperform state-of-the-art works on the three tasks by 28.6%, 17.4% and 21.0% on average.
arXiv Detail & Related papers (2021-07-02T08:20:06Z) - Spatio-Temporal Graph Dual-Attention Network for Multi-Agent Prediction
and Tracking [23.608125748229174]
We propose a generic generative neural system for multi-agent trajectory prediction involving heterogeneous agents.
The proposed system is evaluated on three public benchmark datasets for trajectory prediction.
arXiv Detail & Related papers (2021-02-18T02:25:35Z) - End-to-End 3D Multi-Object Tracking and Trajectory Forecasting [34.68114553744956]
We propose a unified solution for 3D MOT and trajectory forecasting.
We employ a feature interaction technique by introducing Graph Neural Networks.
We also use a diversity sampling function to improve the quality and diversity of our forecasted trajectories.
arXiv Detail & Related papers (2020-08-25T16:54:46Z) - SMART: Simultaneous Multi-Agent Recurrent Trajectory Prediction [72.37440317774556]
We propose advances that address two key challenges in future trajectory prediction.
multimodality in both training data and predictions and constant time inference regardless of number of agents.
arXiv Detail & Related papers (2020-07-26T08:17:10Z) - Multi-Task Learning for Dense Prediction Tasks: A Survey [87.66280582034838]
Multi-task learning (MTL) techniques have shown promising results w.r.t. performance, computations and/or memory footprint.
We provide a well-rounded view on state-of-the-art deep learning approaches for MTL in computer vision.
arXiv Detail & Related papers (2020-04-28T09:15:50Z) - Diversity inducing Information Bottleneck in Model Ensembles [73.80615604822435]
In this paper, we target the problem of generating effective ensembles of neural networks by encouraging diversity in prediction.
We explicitly optimize a diversity inducing adversarial loss for learning latent variables and thereby obtain diversity in the output predictions necessary for modeling multi-modal data.
Compared to the most competitive baselines, we show significant improvements in classification accuracy, under a shift in the data distribution.
arXiv Detail & Related papers (2020-03-10T03:10:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.