MapTrack: Tracking in the Map
- URL: http://arxiv.org/abs/2402.12968v1
- Date: Tue, 20 Feb 2024 12:35:23 GMT
- Title: MapTrack: Tracking in the Map
- Authors: Fei Wang, Ruohui Zhang, Chenglin Chen, Min Yang, Yun Bai
- Abstract summary: Multi-Object Tracking (MOT) aims to maintain stable and uninterrupted trajectories for each target.
Most state-of-the-art approaches first detect objects in each frame and then implement data association between new detections and existing tracks.
We propose a new framework comprising of three lightweight and plug-and-play algorithms: the probability map, the prediction map, and the covariance adaptive Kalman filter.
- Score: 14.991113420276767
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Multi-Object Tracking (MOT) aims to maintain stable and uninterrupted
trajectories for each target. Most state-of-the-art approaches first detect
objects in each frame and then implement data association between new
detections and existing tracks using motion models and appearance similarities.
Despite achieving satisfactory results, occlusion and crowds can easily lead to
missing and distorted detections, followed by missing and false associations.
In this paper, we first revisit the classic tracker DeepSORT, enhancing its
robustness over crowds and occlusion significantly by placing greater trust in
predictions when detections are unavailable or of low quality in crowded and
occluded scenes. Specifically, we propose a new framework comprising of three
lightweight and plug-and-play algorithms: the probability map, the prediction
map, and the covariance adaptive Kalman filter. The probability map identifies
whether undetected objects have genuinely disappeared from view (e.g., out of
the image or entered a building) or are only temporarily undetected due to
occlusion or other reasons. Trajectories of undetected targets that are still
within the probability map are extended by state estimations directly. The
prediction map determines whether an object is in a crowd, and we prioritize
state estimations over observations when severe deformation of observations
occurs, accomplished through the covariance adaptive Kalman filter. The
proposed method, named MapTrack, achieves state-of-the-art results on popular
multi-object tracking benchmarks such as MOT17 and MOT20. Despite its superior
performance, our method remains simple, online, and real-time. The code will be
open-sourced later.
Related papers
- Real-time Multi-Object Tracking Based on Bi-directional Matching [0.0]
This study offers a bi-directional matching algorithm for multi-object tracking.
A stranded area is used in the matching algorithm to temporarily store the objects that fail to be tracked.
In the MOT17 challenge, the proposed algorithm achieves 63.4% MOTA, 55.3% IDF1, and 20.1 FPS tracking speed.
arXiv Detail & Related papers (2023-03-15T08:38:08Z) - Joint Counting, Detection and Re-Identification for Multi-Object
Tracking [8.89262850257871]
In crowded scenes, joint detection and tracking usually fail to find accurate object associations due to missed or false detections.
We jointly model counting, detection and re-identification in an end-to-end framework, named CountingMOT, tailored for crowded scenes.
The proposed MOT tracker can perform online and real-time tracking, and achieves the state-of-the-art results on public benchmarks MOT16 (MOTA of 79.7), MOT17 (MOTA of 81.3%) and MOT20 (MOTA of 78.9%)
arXiv Detail & Related papers (2022-12-12T12:53:58Z) - ShaSTA: Modeling Shape and Spatio-Temporal Affinities for 3D
Multi-Object Tracking [25.093138732809738]
Multi-object tracking is a cornerstone capability of any robotic system.
Detectors function in a low precision recall regime, ensuring a low number of false-negatives while producing a high rate of false-positives.
We propose a method that learns shape andtemporal affinities between consecutive frames to better distinguish between true-positive and false-positive detections and tracks.
arXiv Detail & Related papers (2022-11-08T00:20:38Z) - Real-time AdaBoost cascade face tracker based on likelihood map and
optical flow [59.17685450892182]
The authors present a novel face tracking approach where optical flow information is incorporated into a modified version of the Viola Jones detection algorithm.
In the original algorithm, detection is static, as information from previous frames is not considered.
The proposed tracker preserves information about the number of classification stages passed by each window.
arXiv Detail & Related papers (2022-10-25T10:15:07Z) - A Two-Block RNN-based Trajectory Prediction from Incomplete Trajectory [14.725386295605666]
We introduce a two-block RNN model that approximates the inference steps of the Bayesian filtering framework.
We show that the proposed model improves the prediction accuracy compared to the three baseline imputation methods.
We also show that our proposed method can achieve better prediction compared to the baselines when there is no miss-detection.
arXiv Detail & Related papers (2022-03-14T13:39:44Z) - Learning to Track with Object Permanence [61.36492084090744]
We introduce an end-to-end trainable approach for joint object detection and tracking.
Our model, trained jointly on synthetic and real data, outperforms the state of the art on KITTI, and MOT17 datasets.
arXiv Detail & Related papers (2021-03-26T04:43:04Z) - Detecting Invisible People [58.49425715635312]
We re-purpose tracking benchmarks and propose new metrics for the task of detecting invisible objects.
We demonstrate that current detection and tracking systems perform dramatically worse on this task.
Second, we build dynamic models that explicitly reason in 3D, making use of observations produced by state-of-the-art monocular depth estimation networks.
arXiv Detail & Related papers (2020-12-15T16:54:45Z) - Tracking-by-Counting: Using Network Flows on Crowd Density Maps for
Tracking Multiple Targets [96.98888948518815]
State-of-the-art multi-object tracking(MOT) methods follow the tracking-by-detection paradigm.
We propose a new MOT paradigm, tracking-by-counting, tailored for crowded scenes.
arXiv Detail & Related papers (2020-07-18T19:51:53Z) - ArTIST: Autoregressive Trajectory Inpainting and Scoring for Tracking [80.02322563402758]
One of the core components in online multiple object tracking (MOT) frameworks is associating new detections with existing tracklets.
We introduce a probabilistic autoregressive generative model to score tracklet proposals by directly measuring the likelihood that a tracklet represents natural motion.
arXiv Detail & Related papers (2020-04-16T06:43:11Z) - Detection in Crowded Scenes: One Proposal, Multiple Predictions [79.28850977968833]
We propose a proposal-based object detector, aiming at detecting highly-overlapped instances in crowded scenes.
The key of our approach is to let each proposal predict a set of correlated instances rather than a single one in previous proposal-based frameworks.
Our detector can obtain 4.9% AP gains on challenging CrowdHuman dataset and 1.0% $textMR-2$ improvements on CityPersons dataset.
arXiv Detail & Related papers (2020-03-20T09:48:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.