Consistent-Teacher: Towards Reducing Inconsistent Pseudo-targets in
Semi-supervised Object Detection
- URL: http://arxiv.org/abs/2209.01589v3
- Date: Tue, 28 Mar 2023 14:15:08 GMT
- Title: Consistent-Teacher: Towards Reducing Inconsistent Pseudo-targets in
Semi-supervised Object Detection
- Authors: Xinjiang Wang, Xingyi Yang, Shilong Zhang, Yijiang Li, Litong Feng,
Shijie Fang, Chengqi Lyu, Kai Chen, Wayne Zhang
- Abstract summary: Pseudo-targets undermine the training of an accurate detector.
It injects noise into the student's training, leading to severe overfitting problems.
We propose a systematic solution, termed ConsistentTeacher, to reduce the inconsistency.
- Score: 28.40887130075552
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: In this study, we dive deep into the inconsistency of pseudo targets in
semi-supervised object detection (SSOD). Our core observation is that the
oscillating pseudo-targets undermine the training of an accurate detector. It
injects noise into the student's training, leading to severe overfitting
problems. Therefore, we propose a systematic solution, termed
ConsistentTeacher, to reduce the inconsistency. First, adaptive anchor
assignment~(ASA) substitutes the static IoU-based strategy, which enables the
student network to be resistant to noisy pseudo-bounding boxes. Then we
calibrate the subtask predictions by designing a 3D feature alignment
module~(FAM-3D). It allows each classification feature to adaptively query the
optimal feature vector for the regression task at arbitrary scales and
locations. Lastly, a Gaussian Mixture Model (GMM) dynamically revises the score
threshold of pseudo-bboxes, which stabilizes the number of ground truths at an
early stage and remedies the unreliable supervision signal during training.
ConsistentTeacher provides strong results on a large range of SSOD evaluations.
It achieves 40.0 mAP with ResNet-50 backbone given only 10% of annotated
MS-COCO data, which surpasses previous baselines using pseudo labels by around
3 mAP. When trained on fully annotated MS-COCO with additional unlabeled data,
the performance further increases to 47.7 mAP. Our code is available at
\url{https://github.com/Adamdad/ConsistentTeacher}.
Related papers
- Gradient-based Sampling for Class Imbalanced Semi-supervised Object Detection [111.0991686509715]
We study the class imbalance problem for semi-supervised object detection (SSOD) under more challenging scenarios.
We propose a simple yet effective gradient-based sampling framework that tackles the class imbalance problem from the perspective of two types of confirmation biases.
Experiments on three proposed sub-tasks, namely MS-COCO, MS-COCO to Object365 and LVIS, suggest that our method outperforms current class imbalanced object detectors by clear margins.
arXiv Detail & Related papers (2024-03-22T11:30:10Z) - Revisiting Domain-Adaptive 3D Object Detection by Reliable, Diverse and
Class-balanced Pseudo-Labeling [38.07637524378327]
Unsupervised domain adaptation (DA) with the aid of pseudo labeling techniques has emerged as a crucial approach for domain-adaptive 3D object detection.
Existing DA methods suffer from a substantial drop in performance when applied to a multi-class training setting.
We propose a novel ReDB framework tailored for learning to detect all classes at once.
arXiv Detail & Related papers (2023-07-16T04:34:11Z) - Hierarchical Supervision and Shuffle Data Augmentation for 3D
Semi-Supervised Object Detection [90.32180043449263]
State-of-the-art 3D object detectors are usually trained on large-scale datasets with high-quality 3D annotations.
A natural remedy is to adopt semi-supervised learning (SSL) by leveraging a limited amount of labeled samples and abundant unlabeled samples.
This paper introduces a novel approach of Hierarchical Supervision and Shuffle Data Augmentation (HSSDA), which is a simple yet effective teacher-student framework.
arXiv Detail & Related papers (2023-04-04T02:09:32Z) - Consistent Diffusion Models: Mitigating Sampling Drift by Learning to be
Consistent [97.64313409741614]
We propose to enforce a emphconsistency property which states that predictions of the model on its own generated data are consistent across time.
We show that our novel training objective yields state-of-the-art results for conditional and unconditional generation in CIFAR-10 and baseline improvements in AFHQ and FFHQ.
arXiv Detail & Related papers (2023-02-17T18:45:04Z) - Efficient Teacher: Semi-Supervised Object Detection for YOLOv5 [2.2290171169275492]
One-stage anchor-based detectors lack the structure to generate high-quality or flexible pseudo labels.
Dense Detector is a baseline model that extends RetinaNet with dense sampling techniques inspired by YOLOv5.
Pseudo Label Assigner makes more refined use of pseudo labels from Dense Detector.
Epoch Adaptor is a method that enables a stable and efficient end-to-end semi-supervised training schedule.
arXiv Detail & Related papers (2023-02-15T10:40:19Z) - DSLA: Dynamic smooth label assignment for efficient anchor-free object
detection [18.043176234010517]
Anchor-free detectors basically formulate object detection as dense classification and regression.
It is common to introduce an individual prediction branch to estimate the quality of localization.
The following inconsistencies are observed when we delve into the practices of classification and quality estimation.
arXiv Detail & Related papers (2022-08-01T12:56:44Z) - Self-Supervised 3D Monocular Object Detection by Recycling Bounding
Boxes [3.3299316770988625]
The paper studies the application of established self-supervised bounding box recycling by labeling random windows as the pretext task.
We demonstrate improvements of between 2-3 % in mAP 3D and 0.9-1.5 % BEV scores using SSL over the baseline scores.
arXiv Detail & Related papers (2022-06-25T21:48:43Z) - ST3D++: Denoised Self-training for Unsupervised Domain Adaptation on 3D
Object Detection [78.71826145162092]
We present a self-training method, named ST3D++, with a holistic pseudo label denoising pipeline for unsupervised domain adaptation on 3D object detection.
We equip the pseudo label generation process with a hybrid quality-aware triplet memory to improve the quality and stability of generated pseudo labels.
In the model training stage, we propose a source data assisted training strategy and a curriculum data augmentation policy.
arXiv Detail & Related papers (2021-08-15T07:49:06Z) - 3DIoUMatch: Leveraging IoU Prediction for Semi-Supervised 3D Object
Detection [76.42897462051067]
3DIoUMatch is a novel semi-supervised method for 3D object detection applicable to both indoor and outdoor scenes.
We leverage a teacher-student mutual learning framework to propagate information from the labeled to the unlabeled train set in the form of pseudo-labels.
Our method consistently improves state-of-the-art methods on both ScanNet and SUN-RGBD benchmarks by significant margins under all label ratios.
arXiv Detail & Related papers (2020-12-08T11:06:26Z) - Learning a Unified Sample Weighting Network for Object Detection [113.98404690619982]
Region sampling or weighting is significantly important to the success of modern region-based object detectors.
We argue that sample weighting should be data-dependent and task-dependent.
We propose a unified sample weighting network to predict a sample's task weights.
arXiv Detail & Related papers (2020-06-11T16:19:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.