StrongSORT: Make DeepSORT Great Again
- URL: http://arxiv.org/abs/2202.13514v1
- Date: Mon, 28 Feb 2022 02:37:19 GMT
- Title: StrongSORT: Make DeepSORT Great Again
- Authors: Yunhao Du, Yang Song, Bo Yang, Yanyun Zhao
- Abstract summary: We revisit the classic tracker DeepSORT and upgrade it from various aspects, i.e., detection, embedding and association.
The resulting tracker, called StrongSORT, sets new HOTA and IDF1 records on MOT17 and MOT20.
We present two lightweight and plug-and-play algorithms to further refine the tracking results.
- Score: 19.099510933467148
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Existing Multi-Object Tracking (MOT) methods can be roughly classified as
tracking-by-detection and joint-detection-association paradigms. Although the
latter has elicited more attention and demonstrates comparable performance
relative to the former, we claim that the tracking-by-detection paradigm is
still the optimal solution in terms of tracking accuracy. In this paper, we
revisit the classic tracker DeepSORT and upgrade it from various aspects, i.e.,
detection, embedding and association. The resulting tracker, called StrongSORT,
sets new HOTA and IDF1 records on MOT17 and MOT20. We also present two
lightweight and plug-and-play algorithms to further refine the tracking
results. Firstly, an appearance-free link model (AFLink) is proposed to
associate short tracklets into complete trajectories. To the best of our
knowledge, this is the first global link model without appearance information.
Secondly, we propose Gaussian-smoothed interpolation (GSI) to compensate for
missing detections. Instead of ignoring motion information like linear
interpolation, GSI is based on the Gaussian process regression algorithm and
can achieve more accurate localizations. Moreover, AFLink and GSI can be
plugged into various trackers with a negligible extra computational cost (591.9
and 140.9 Hz, respectively, on MOT17). By integrating StrongSORT with the two
algorithms, the final tracker StrongSORT++ ranks first on MOT17 and MOT20 in
terms of HOTA and IDF1 metrics and surpasses the second-place one by 1.3 - 2.2.
Code will be released soon.
Related papers
- LITE: A Paradigm Shift in Multi-Object Tracking with Efficient ReID Feature Integration [0.3277163122167433]
Lightweight Integrated Tracking-Feature Extraction paradigm is introduced as a novel multi-object tracking (MOT) approach.
It enhances ReID-based trackers by eliminating inference, pre-processing, post-processing, and ReID model training costs.
arXiv Detail & Related papers (2024-09-06T11:05:12Z) - RTAT: A Robust Two-stage Association Tracker for Multi-Object Tracking [12.406076162419877]
We propose a Robust Two-stage Association Tracker, named RTAT.
The first-stage association is performed between tracklets and detections to generate tracklets with high purity.
The second-stage association is performed between tracklets to form complete trajectories.
Our tracker ranks first on the test set of MOT17 and MOT20 benchmarks in most of the main MOT metrics.
arXiv Detail & Related papers (2024-08-14T07:37:24Z) - Temporal Correlation Meets Embedding: Towards a 2nd Generation of JDE-based Real-Time Multi-Object Tracking [52.04679257903805]
Joint Detection and Embedding (JDE) trackers have demonstrated excellent performance in Multi-Object Tracking (MOT) tasks.
Our tracker, named TCBTrack, achieves state-of-the-art performance on multiple public benchmarks.
arXiv Detail & Related papers (2024-07-19T07:48:45Z) - Comparative study of multi-person tracking methods [0.0]
The purpose of this study is to discover the techniques used and to provide useful insights about these algorithms in the tracking pipeline.
We trained our own Pedestrian Detection model using the MOT17Det dataset.
We then present experimental results which shows that Tracktor++ is a better multi-person tracking algorithm than SORT.
arXiv Detail & Related papers (2023-10-07T14:29:57Z) - Joint Spatial-Temporal and Appearance Modeling with Transformer for
Multiple Object Tracking [59.79252390626194]
We propose a novel solution named TransSTAM, which leverages Transformer to model both the appearance features of each object and the spatial-temporal relationships among objects.
The proposed method is evaluated on multiple public benchmarks including MOT16, MOT17, and MOT20, and it achieves a clear performance improvement in both IDF1 and HOTA.
arXiv Detail & Related papers (2022-05-31T01:19:18Z) - Joint Feature Learning and Relation Modeling for Tracking: A One-Stream
Framework [76.70603443624012]
We propose a novel one-stream tracking (OSTrack) framework that unifies feature learning and relation modeling.
In this way, discriminative target-oriented features can be dynamically extracted by mutual guidance.
OSTrack achieves state-of-the-art performance on multiple benchmarks, in particular, it shows impressive results on the one-shot tracking benchmark GOT-10k.
arXiv Detail & Related papers (2022-03-22T18:37:11Z) - Multi-object Tracking with Tracked Object Bounding Box Association [18.539658212171062]
CenterTrack tracking algorithm achieves state-of-the-art tracking performance using a simple detection model and single-frame spatial offsets.
We propose to incorporate a simple tracked object bounding box and overlapping prediction based on the current frame onto the CenterTrack algorithm.
arXiv Detail & Related papers (2021-05-17T14:32:47Z) - Global Correlation Network: End-to-End Joint Multi-Object Detection and
Tracking [2.749204052800622]
We present a novel network to realize joint multi-object detection and tracking in an end-to-end way, called Global Correlation Network (GCNet)
GCNet introduces the global correlation layer for regression of absolute size and coordinates of bounding boxes instead of offsets prediction.
The pipeline of detection and tracking by GCNet is conceptually simple, which does not need non-maximum suppression, data association, and other complicated tracking strategies.
arXiv Detail & Related papers (2021-03-23T13:16:42Z) - Chained-Tracker: Chaining Paired Attentive Regression Results for
End-to-End Joint Multiple-Object Detection and Tracking [102.31092931373232]
We propose a simple online model named Chained-Tracker (CTracker), which naturally integrates all the three subtasks into an end-to-end solution.
The two major novelties: chained structure and paired attentive regression, make CTracker simple, fast and effective.
arXiv Detail & Related papers (2020-07-29T02:38:49Z) - Quasi-Dense Similarity Learning for Multiple Object Tracking [82.93471035675299]
We present Quasi-Dense Similarity Learning, which densely samples hundreds of region proposals on a pair of images for contrastive learning.
We can directly combine this similarity learning with existing detection methods to build Quasi-Dense Tracking (QDTrack)
arXiv Detail & Related papers (2020-06-11T17:57:12Z) - Tracking by Instance Detection: A Meta-Learning Approach [99.66119903655711]
We propose a principled three-step approach to build a high-performance tracker.
We build two trackers, named Retina-MAML and FCOS-MAML, based on two modern detectors RetinaNet and FCOS.
Both trackers run in real-time at 40 FPS.
arXiv Detail & Related papers (2020-04-02T05:55:06Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.