SDTracker: Synthetic Data Based Multi-Object Tracking
- URL: http://arxiv.org/abs/2303.14653v1
- Date: Sun, 26 Mar 2023 08:21:22 GMT
- Title: SDTracker: Synthetic Data Based Multi-Object Tracking
- Authors: Yingda Guan, Zhengyang Feng, Huiying Chang, Kuo Du, Tingting Li, Min
Wang
- Abstract summary: We present SDTracker, a method that harnesses the potential of synthetic data for multi-object tracking of real-world scenes.
We use the ImageNet dataset as an auxiliary to randomize the style of synthetic data.
We also adopt the pseudo-labeling method to effectively utilize the unlabeled MOT17 training data.
- Score: 8.43201092674197
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We present SDTracker, a method that harnesses the potential of synthetic data
for multi-object tracking of real-world scenes in a domain generalization and
semi-supervised fashion. First, we use the ImageNet dataset as an auxiliary to
randomize the style of synthetic data. With out-of-domain data, we further
enforce pyramid consistency loss across different "stylized" images from the
same sample to learn domain invariant features. Second, we adopt the
pseudo-labeling method to effectively utilize the unlabeled MOT17 training
data. To obtain high-quality pseudo-labels, we apply proximal policy
optimization (PPO2) algorithm to search confidence thresholds for each
sequence. When using the unlabeled MOT17 training set, combined with the
pure-motion tracking strategy upgraded via developed post-processing, we
finally reach 61.4 HOTA.
Related papers
- TrajSSL: Trajectory-Enhanced Semi-Supervised 3D Object Detection [59.498894868956306]
Pseudo-labeling approaches to semi-supervised learning adopt a teacher-student framework.
We leverage pre-trained motion-forecasting models to generate object trajectories on pseudo-labeled data.
Our approach improves pseudo-label quality in two distinct manners.
arXiv Detail & Related papers (2024-09-17T05:35:00Z) - Semi-Supervised Image Captioning by Adversarially Propagating Labeled
Data [95.0476489266988]
We present a novel data-efficient semi-supervised framework to improve the generalization of image captioning models.
Our proposed method trains a captioner to learn from a paired data and to progressively associate unpaired data.
Our extensive and comprehensive empirical results both on (1) image-based and (2) dense region-based captioning datasets followed by comprehensive analysis on the scarcely-paired dataset.
arXiv Detail & Related papers (2023-01-26T15:25:43Z) - Robust Target Training for Multi-Source Domain Adaptation [110.77704026569499]
We propose a novel Bi-level Optimization based Robust Target Training (BORT$2$) method for MSDA.
Our proposed method achieves the state of the art performance on three MSDA benchmarks, including the large-scale DomainNet dataset.
arXiv Detail & Related papers (2022-10-04T15:20:01Z) - PieTrack: An MOT solution based on synthetic data training and
self-supervised domain adaptation [17.716808322509667]
PieTrack is developed based on synthetic data without using any pre-trained weights.
By leveraging the proposed multi-scale ensemble inference, we achieved a final HOTA score of 58.7 on the MOT17 testing set, ranked third place in the challenge.
arXiv Detail & Related papers (2022-07-22T20:34:49Z) - Bag of Tricks for Domain Adaptive Multi-Object Tracking [4.084199842578325]
The proposed method was built from pre-existing detector and tracker under the tracking-by-detection paradigm.
The tracker we used is an online tracker that merely links newly received detections with existing tracks.
Our method, SIA_Track, takes the first place on MOT Synth2MOT17 track at BMTT 2022 challenge.
arXiv Detail & Related papers (2022-05-31T08:49:20Z) - Unsupervised Domain Adaptive Learning via Synthetic Data for Person
Re-identification [101.1886788396803]
Person re-identification (re-ID) has gained more and more attention due to its widespread applications in video surveillance.
Unfortunately, the mainstream deep learning methods still need a large quantity of labeled data to train models.
In this paper, we develop a data collector to automatically generate synthetic re-ID samples in a computer game, and construct a data labeler to simultaneously annotate them.
arXiv Detail & Related papers (2021-09-12T15:51:41Z) - Improving Semi-Supervised and Domain-Adaptive Semantic Segmentation with
Self-Supervised Depth Estimation [94.16816278191477]
We present a framework for semi-adaptive and domain-supervised semantic segmentation.
It is enhanced by self-supervised monocular depth estimation trained only on unlabeled image sequences.
We validate the proposed model on the Cityscapes dataset.
arXiv Detail & Related papers (2021-08-28T01:33:38Z) - Semi-Supervised Domain Adaptation with Prototypical Alignment and
Consistency Learning [86.6929930921905]
This paper studies how much it can help address domain shifts if we further have a few target samples labeled.
To explore the full potential of landmarks, we incorporate a prototypical alignment (PA) module which calculates a target prototype for each class from the landmarks.
Specifically, we severely perturb the labeled images, making PA non-trivial to achieve and thus promoting model generalizability.
arXiv Detail & Related papers (2021-04-19T08:46:08Z) - Unsupervised Domain Adaptation for Person Re-Identification through
Source-Guided Pseudo-Labeling [2.449909275410288]
Person Re-Identification (re-ID) aims at retrieving images of the same person taken by different cameras.
Unsupervised Domain Adaptation (UDA) is an interesting research direction for this challenge as it avoids a costly annotation of the target data.
We introduce a framework which relies on a two-branch architecture optimizing classification and triplet loss based metric learning in source and target domains.
arXiv Detail & Related papers (2020-09-20T14:54:42Z) - Synthetic-to-Real Domain Adaptation for Lane Detection [5.811502603310248]
We explore learning from abundant, randomly generated synthetic data, together with unlabeled or partially labeled target domain data.
This poses the challenge of adapting models learned on the unrealistic synthetic domain to real images.
We develop a novel autoencoder-based approach that uses synthetic labels unaligned with particular images for adapting to target domain data.
arXiv Detail & Related papers (2020-07-08T10:54:21Z) - Multi-Domain Learning and Identity Mining for Vehicle Re-Identification [38.35753364518881]
This paper introduces our solution for the Track2 in AI City Challenge 2020 (AICITY20)
The Track2 is a vehicle re-identification task with both the real-world data and synthetic data.
With multiple-model ensemble, our method achieves 0.7322 in the mAP score which yields third place in the competition.
arXiv Detail & Related papers (2020-04-22T13:03:52Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.