An End-to-End Framework of Road User Detection, Tracking, and Prediction
from Monocular Images
- URL: http://arxiv.org/abs/2308.05026v1
- Date: Wed, 9 Aug 2023 15:46:25 GMT
- Title: An End-to-End Framework of Road User Detection, Tracking, and Prediction
from Monocular Images
- Authors: Hao Cheng, Mengmeng liu, Lin Chen
- Abstract summary: We build an end-to-end framework for detection, tracking, and trajectory prediction called ODTP.
It adopts the state-of-the-art online multi-object tracking model, QD-3DT, for perception and trains the trajectory predictor, DCENet++, directly based on the detection results.
We evaluate the performance of ODTP on the widely used nuScenes dataset for autonomous driving.
- Score: 11.733622044569486
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Perception that involves multi-object detection and tracking, and trajectory
prediction are two major tasks of autonomous driving. However, they are
currently mostly studied separately, which results in most trajectory
prediction modules being developed based on ground truth trajectories without
taking into account that trajectories extracted from the detection and tracking
modules in real-world scenarios are noisy. These noisy trajectories can have a
significant impact on the performance of the trajectory predictor and can lead
to serious prediction errors. In this paper, we build an end-to-end framework
for detection, tracking, and trajectory prediction called ODTP (Online
Detection, Tracking and Prediction). It adopts the state-of-the-art online
multi-object tracking model, QD-3DT, for perception and trains the trajectory
predictor, DCENet++, directly based on the detection results without purely
relying on ground truth trajectories. We evaluate the performance of ODTP on
the widely used nuScenes dataset for autonomous driving. Extensive experiments
show that ODPT achieves high performance end-to-end trajectory prediction.
DCENet++, with the enhanced dynamic maps, predicts more accurate trajectories
than its base model. It is also more robust when compared with other generative
and deterministic trajectory prediction models trained on noisy detection
results.
Related papers
- JRDB-Traj: A Dataset and Benchmark for Trajectory Forecasting in Crowds [79.00975648564483]
Trajectory forecasting models, employed in fields such as robotics, autonomous vehicles, and navigation, face challenges in real-world scenarios.
This dataset provides comprehensive data, including the locations of all agents, scene images, and point clouds, all from the robot's perspective.
The objective is to predict the future positions of agents relative to the robot using raw sensory input data.
arXiv Detail & Related papers (2023-11-05T18:59:31Z) - PBP: Path-based Trajectory Prediction for Autonomous Driving [3.640190809479174]
Trajectory prediction plays a crucial role in the autonomous driving stack.
Goal-based prediction models have gained traction in recent years for addressing the multimodal nature of future trajectories.
arXiv Detail & Related papers (2023-09-07T14:45:41Z) - Class-Aware Attention for Multimodal Trajectory Prediction [0.7130302992490973]
We present a novel framework for multimodal trajectory prediction in autonomous driving.
Our model is able to run in real-time, achieving a high inference rate of over 300 FPS.
arXiv Detail & Related papers (2022-08-31T18:43:23Z) - Transforming Model Prediction for Tracking [109.08417327309937]
Transformers capture global relations with little inductive bias, allowing it to learn the prediction of more powerful target models.
We train the proposed tracker end-to-end and validate its performance by conducting comprehensive experiments on multiple tracking datasets.
Our tracker sets a new state of the art on three benchmarks, achieving an AUC of 68.5% on the challenging LaSOT dataset.
arXiv Detail & Related papers (2022-03-21T17:59:40Z) - Trajectory Forecasting from Detection with Uncertainty-Aware Motion
Encoding [121.66374635092097]
Trajectories obtained from object detection and tracking are inevitably noisy.
We propose a trajectory predictor directly based on detection results without relying on explicitly formed trajectories.
arXiv Detail & Related papers (2022-02-03T09:09:56Z) - Human Trajectory Prediction via Counterfactual Analysis [87.67252000158601]
Forecasting human trajectories in complex dynamic environments plays a critical role in autonomous vehicles and intelligent robots.
Most existing methods learn to predict future trajectories by behavior clues from history trajectories and interaction clues from environments.
We propose a counterfactual analysis method for human trajectory prediction to investigate the causality between the predicted trajectories and input clues.
arXiv Detail & Related papers (2021-07-29T17:41:34Z) - Spatio-Temporal Graph Dual-Attention Network for Multi-Agent Prediction
and Tracking [23.608125748229174]
We propose a generic generative neural system for multi-agent trajectory prediction involving heterogeneous agents.
The proposed system is evaluated on three public benchmark datasets for trajectory prediction.
arXiv Detail & Related papers (2021-02-18T02:25:35Z) - TNT: Target-driveN Trajectory Prediction [76.21200047185494]
We develop a target-driven trajectory prediction framework for moving agents.
We benchmark it on trajectory prediction of vehicles and pedestrians.
We outperform state-of-the-art on Argoverse Forecasting, INTERACTION, Stanford Drone and an in-house Pedestrian-at-Intersection dataset.
arXiv Detail & Related papers (2020-08-19T06:52:46Z) - BiTraP: Bi-directional Pedestrian Trajectory Prediction with Multi-modal
Goal Estimation [28.10445924083422]
BiTraP is a goal-conditioned bi-directional multi-modal trajectory prediction method based on the CVAE.
BiTraP generalizes to both first-person view (FPV) and bird's-eye view (BEV) scenarios and outperforms state-of-the-art results by 10-50%.
arXiv Detail & Related papers (2020-07-29T02:40:17Z) - PnPNet: End-to-End Perception and Prediction with Tracking in the Loop [82.97006521937101]
We tackle the problem of joint perception and motion forecasting in the context of self-driving vehicles.
We propose Net, an end-to-end model that takes as input sensor data, and outputs at each time step object tracks and their future level.
arXiv Detail & Related papers (2020-05-29T17:57:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.