Joint Counting, Detection and Re-Identification for Multi-Object
Tracking
- URL: http://arxiv.org/abs/2212.05861v3
- Date: Mon, 19 Feb 2024 08:51:17 GMT
- Title: Joint Counting, Detection and Re-Identification for Multi-Object
Tracking
- Authors: Weihong Ren, Denglu Wu, Hui Cao, Xi'ai Chen, Zhi Han and Honghai Liu
- Abstract summary: In crowded scenes, joint detection and tracking usually fail to find accurate object associations due to missed or false detections.
We jointly model counting, detection and re-identification in an end-to-end framework, named CountingMOT, tailored for crowded scenes.
The proposed MOT tracker can perform online and real-time tracking, and achieves the state-of-the-art results on public benchmarks MOT16 (MOTA of 79.7), MOT17 (MOTA of 81.3%) and MOT20 (MOTA of 78.9%)
- Score: 8.89262850257871
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The recent trend in 2D multiple object tracking (MOT) is jointly solving
detection and tracking, where object detection and appearance feature (or
motion) are learned simultaneously. Despite competitive performance, in crowded
scenes, joint detection and tracking usually fail to find accurate object
associations due to missed or false detections. In this paper, we jointly model
counting, detection and re-identification in an end-to-end framework, named
CountingMOT, tailored for crowded scenes. By imposing mutual object-count
constraints between detection and counting, the CountingMOT tries to find a
balance between object detection and crowd density map estimation, which can
help it to recover missed detections or reject false detections. Our approach
is an attempt to bridge the gap of object detection, counting, and
re-Identification. This is in contrast to prior MOT methods that either ignore
the crowd density and thus are prone to failure in crowded scenes,or depend on
local correlations to build a graphical relationship for matching targets. The
proposed MOT tracker can perform online and real-time tracking, and achieves
the state-of-the-art results on public benchmarks MOT16 (MOTA of 79.7), MOT17
(MOTA of 81.3%) and MOT20 (MOTA of 78.9%).
Related papers
- Temporal Correlation Meets Embedding: Towards a 2nd Generation of JDE-based Real-Time Multi-Object Tracking [52.04679257903805]
Joint Detection and Embedding (JDE) trackers have demonstrated excellent performance in Multi-Object Tracking (MOT) tasks.
Our tracker, named TCBTrack, achieves state-of-the-art performance on multiple public benchmarks.
arXiv Detail & Related papers (2024-07-19T07:48:45Z) - ADA-Track: End-to-End Multi-Camera 3D Multi-Object Tracking with Alternating Detection and Association [15.161640917854363]
We introduce ADA-Track, a novel end-to-end framework for 3D MOT from multi-view cameras.
We introduce a learnable data association module based on edge-augmented cross-attention.
We integrate this association module into the decoder layer of a DETR-based 3D detector.
arXiv Detail & Related papers (2024-05-14T19:02:33Z) - MapTrack: Tracking in the Map [14.991113420276767]
Multi-Object Tracking (MOT) aims to maintain stable and uninterrupted trajectories for each target.
Most state-of-the-art approaches first detect objects in each frame and then implement data association between new detections and existing tracks.
We propose a new framework comprising of three lightweight and plug-and-play algorithms: the probability map, the prediction map, and the covariance adaptive Kalman filter.
arXiv Detail & Related papers (2024-02-20T12:35:23Z) - SparseTrack: Multi-Object Tracking by Performing Scene Decomposition
based on Pseudo-Depth [84.64121608109087]
We propose a pseudo-depth estimation method for obtaining the relative depth of targets from 2D images.
Secondly, we design a depth cascading matching (DCM) algorithm, which can use the obtained depth information to convert a dense target set into multiple sparse target subsets.
By integrating the pseudo-depth method and the DCM strategy into the data association process, we propose a new tracker, called SparseTrack.
arXiv Detail & Related papers (2023-06-08T14:36:10Z) - QDTrack: Quasi-Dense Similarity Learning for Appearance-Only Multiple
Object Tracking [73.52284039530261]
We present Quasi-Dense Similarity Learning, which densely samples hundreds of object regions on a pair of images for contrastive learning.
We find that the resulting distinctive feature space admits a simple nearest neighbor search at inference time for object association.
We show that our similarity learning scheme is not limited to video data, but can learn effective instance similarity even from static input.
arXiv Detail & Related papers (2022-10-12T15:47:36Z) - mvHOTA: A multi-view higher order tracking accuracy metric to measure
spatial and temporal associations in multi-point detection [1.039718070553655]
Multi-object tracking (MOT) is a challenging task that involves detecting objects in the scene and tracking them across a sequence of frames.
The main evaluation metric to benchmark MOT methods on datasets such as KITTI has recently become the higher order tracking accuracy (HOTA) metric.
We propose a multi-view higher order tracking metric (mvHOTA) to determine the accuracy of multi-point (multi-instance and multi-class) detection.
arXiv Detail & Related papers (2022-06-19T10:31:53Z) - Learning to Track with Object Permanence [61.36492084090744]
We introduce an end-to-end trainable approach for joint object detection and tracking.
Our model, trained jointly on synthetic and real data, outperforms the state of the art on KITTI, and MOT17 datasets.
arXiv Detail & Related papers (2021-03-26T04:43:04Z) - Probabilistic Tracklet Scoring and Inpainting for Multiple Object
Tracking [83.75789829291475]
We introduce a probabilistic autoregressive motion model to score tracklet proposals.
This is achieved by training our model to learn the underlying distribution of natural tracklets.
Our experiments demonstrate the superiority of our approach at tracking objects in challenging sequences.
arXiv Detail & Related papers (2020-12-03T23:59:27Z) - Tracking-by-Counting: Using Network Flows on Crowd Density Maps for
Tracking Multiple Targets [96.98888948518815]
State-of-the-art multi-object tracking(MOT) methods follow the tracking-by-detection paradigm.
We propose a new MOT paradigm, tracking-by-counting, tailored for crowded scenes.
arXiv Detail & Related papers (2020-07-18T19:51:53Z) - Joint Detection and Tracking in Videos with Identification Features [36.55599286568541]
We propose the first joint optimization of detection, tracking and re-identification features for videos.
Our method reaches the state-of-the-art on MOT, it ranks 1st in the UA-DETRAC'18 tracking challenge among online trackers, and 3rd overall.
arXiv Detail & Related papers (2020-05-21T21:06:40Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.