Adaptive Affinity for Associations in Multi-Target Multi-Camera Tracking
- URL: http://arxiv.org/abs/2112.07664v1
- Date: Tue, 14 Dec 2021 18:59:11 GMT
- Title: Adaptive Affinity for Associations in Multi-Target Multi-Camera Tracking
- Authors: Yunzhong Hou, Zhongdao Wang, Shengjin Wang, Liang Zheng
- Abstract summary: We propose a simple yet effective approach to adapt affinity estimations to corresponding matching scopes in MTMCT.
Instead of trying to deal with all appearance changes, we tailor the affinity metric to specialize in ones that might emerge during data associations.
Minimizing the mismatch, the adaptive affinity module brings significant improvements over global re-ID distance.
- Score: 53.668757725179056
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Data associations in multi-target multi-camera tracking (MTMCT) usually
estimate affinity directly from re-identification (re-ID) feature distances.
However, we argue that it might not be the best choice given the difference in
matching scopes between re-ID and MTMCT problems. Re-ID systems focus on global
matching, which retrieves targets from all cameras and all times. In contrast,
data association in tracking is a local matching problem, since its candidates
only come from neighboring locations and time frames. In this paper, we design
experiments to verify such misfit between global re-ID feature distances and
local matching in tracking, and propose a simple yet effective approach to
adapt affinity estimations to corresponding matching scopes in MTMCT. Instead
of trying to deal with all appearance changes, we tailor the affinity metric to
specialize in ones that might emerge during data associations. To this end, we
introduce a new data sampling scheme with temporal windows originally used for
data associations in tracking. Minimizing the mismatch, the adaptive affinity
module brings significant improvements over global re-ID distance, and produces
competitive performance on CityFlow and DukeMTMC datasets.
Related papers
- STCMOT: Spatio-Temporal Cohesion Learning for UAV-Based Multiple Object Tracking [13.269416985959404]
Multiple object tracking (MOT) in Unmanned Aerial Vehicle (UAV) videos is important for diverse applications in computer vision.
We propose a novel Spatio-Temporal Cohesion Multiple Object Tracking framework (STCMOT)
We use historical embedding features to model the representation of ReID and detection features in a sequential order.
Our framework sets a new state-of-the-art performance in MOTA and IDF1 metrics.
arXiv Detail & Related papers (2024-09-17T14:34:18Z) - Local All-Pair Correspondence for Point Tracking [59.76186266230608]
We introduce LocoTrack, a highly accurate and efficient model designed for the task of tracking any point (TAP) across video sequences.
LocoTrack achieves unmatched accuracy on all TAP-Vid benchmarks and operates at a speed almost 6 times faster than the current state-of-the-art.
arXiv Detail & Related papers (2024-07-22T06:49:56Z) - PeFAD: A Parameter-Efficient Federated Framework for Time Series Anomaly Detection [51.20479454379662]
We propose a.
Federated Anomaly Detection framework named PeFAD with the increasing privacy concerns.
We conduct extensive evaluations on four real datasets, where PeFAD outperforms existing state-of-the-art baselines by up to 28.74%.
arXiv Detail & Related papers (2024-06-04T13:51:08Z) - DeconfuseTrack:Dealing with Confusion for Multi-Object Tracking [11.739356936959622]
We propose a simple, versatile, and highly interpretable data association approach called Decomposed Data Association (DDA)
DDA decomposes the traditional association problem into multiple sub-problems using a series of non-learning-based modules.
We also introduce Occlusion-aware Non-Maximum Suppression (ONMS) to retain more occluded detections.
arXiv Detail & Related papers (2024-03-05T08:35:09Z) - Generalizable Metric Network for Cross-domain Person Re-identification [55.71632958027289]
Cross-domain (i.e., domain generalization) scene presents a challenge in Re-ID tasks.
Most existing methods aim to learn domain-invariant or robust features for all domains.
We propose a Generalizable Metric Network (GMN) to explore sample similarity in the sample-pair space.
arXiv Detail & Related papers (2023-06-21T03:05:25Z) - Transformer-based assignment decision network for multiple object
tracking [0.0]
We introduce Transformer-based Assignment Decision Network (TADN) that tackles data association without the need of explicit optimization during inference.
Our proposed approach outperforms the state-of-the-art in most evaluation metrics despite its simple nature as a tracker.
arXiv Detail & Related papers (2022-08-06T19:47:32Z) - Lifelong Unsupervised Domain Adaptive Person Re-identification with
Coordinated Anti-forgetting and Adaptation [127.6168183074427]
We propose a new task, Lifelong Unsupervised Domain Adaptive (LUDA) person ReID.
This is challenging because it requires the model to continuously adapt to unlabeled data of the target environments.
We design an effective scheme for this task, dubbed CLUDA-ReID, where the anti-forgetting is harmoniously coordinated with the adaptation.
arXiv Detail & Related papers (2021-12-13T13:19:45Z) - Multi-Object Tracking and Segmentation with a Space-Time Memory Network [12.043574473965318]
We propose a method for multi-object tracking and segmentation based on a novel memory-based mechanism to associate tracklets.
The proposed tracker, MeNToS, addresses particularly the long-term data association problem.
arXiv Detail & Related papers (2021-10-21T17:13:17Z) - Discriminative Appearance Modeling with Multi-track Pooling for
Real-time Multi-object Tracking [20.66906781151]
In multi-object tracking, the tracker maintains in its memory the appearance and motion information for each object in the scene.
Many approaches model each target in isolation and lack the ability to use all the targets in the scene to jointly update the memory.
We propose a training strategy adapted to multi-track pooling which generates hard tracking episodes online.
arXiv Detail & Related papers (2021-01-28T18:12:39Z) - Joint Visual and Temporal Consistency for Unsupervised Domain Adaptive
Person Re-Identification [64.37745443119942]
This paper jointly enforces visual and temporal consistency in the combination of a local one-hot classification and a global multi-class classification.
Experimental results on three large-scale ReID datasets demonstrate the superiority of proposed method in both unsupervised and unsupervised domain adaptive ReID tasks.
arXiv Detail & Related papers (2020-07-21T14:31:27Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.