Revisiting Class Imbalance for End-to-end Semi-Supervised Object
Detection
- URL: http://arxiv.org/abs/2306.02268v1
- Date: Sun, 4 Jun 2023 06:01:53 GMT
- Title: Revisiting Class Imbalance for End-to-end Semi-Supervised Object
Detection
- Authors: Purbayan Kar, Vishal Chudasama, Naoyuki Onoe, Pankaj Wasnik
- Abstract summary: Semi-supervised object detection (SSOD) has made significant progress with the development of pseudo-label-based end-to-end methods.
Many methods face challenges due to class imbalance, which hinders the effectiveness of the pseudo-label generator.
In this paper, we examine the root causes of low-quality pseudo-labels and present novel learning mechanisms to improve the label generation quality.
- Score: 1.6249267147413524
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Semi-supervised object detection (SSOD) has made significant progress with
the development of pseudo-label-based end-to-end methods. However, many of
these methods face challenges due to class imbalance, which hinders the
effectiveness of the pseudo-label generator. Furthermore, in the literature, it
has been observed that low-quality pseudo-labels severely limit the performance
of SSOD. In this paper, we examine the root causes of low-quality pseudo-labels
and present novel learning mechanisms to improve the label generation quality.
To cope with high false-negative and low precision rates, we introduce an
adaptive thresholding mechanism that helps the proposed network to filter out
optimal bounding boxes. We further introduce a Jitter-Bagging module to provide
accurate information on localization to help refine the bounding boxes.
Additionally, two new losses are introduced using the background and foreground
scores predicted by the teacher and student networks to improvise the
pseudo-label recall rate. Furthermore, our method applies strict supervision to
the teacher network by feeding strong & weak augmented data to generate robust
pseudo-labels so that it can detect small and complex objects. Finally, the
extensive experiments show that the proposed network outperforms
state-of-the-art methods on MS-COCO and Pascal VOC datasets and allows the
baseline network to achieve 100% supervised performance with much less (i.e.,
20%) labeled data.
Related papers
- Dual-Decoupling Learning and Metric-Adaptive Thresholding for Semi-Supervised Multi-Label Learning [81.83013974171364]
Semi-supervised multi-label learning (SSMLL) is a powerful framework for leveraging unlabeled data to reduce the expensive cost of collecting precise multi-label annotations.
Unlike semi-supervised learning, one cannot select the most probable label as the pseudo-label in SSMLL due to multiple semantics contained in an instance.
We propose a dual-perspective method to generate high-quality pseudo-labels.
arXiv Detail & Related papers (2024-07-26T09:33:53Z) - Class Prior-Free Positive-Unlabeled Learning with Taylor Variational
Loss for Hyperspectral Remote Sensing Imagery [12.54504113062557]
Positive-unlabeled learning (PU learning) in hyperspectral remote sensing imagery (HSI) is aimed at learning a binary classifier from positive and unlabeled data.
In this paper, a Taylor variational loss is proposed for HSI PU learning, which reduces the weight of the gradient of the unlabeled data.
Experiments on 7 benchmark datasets (21 tasks in total) validate the effectiveness of the proposed method.
arXiv Detail & Related papers (2023-08-29T07:29:30Z) - Efficient Teacher: Semi-Supervised Object Detection for YOLOv5 [2.2290171169275492]
One-stage anchor-based detectors lack the structure to generate high-quality or flexible pseudo labels.
Dense Detector is a baseline model that extends RetinaNet with dense sampling techniques inspired by YOLOv5.
Pseudo Label Assigner makes more refined use of pseudo labels from Dense Detector.
Epoch Adaptor is a method that enables a stable and efficient end-to-end semi-supervised training schedule.
arXiv Detail & Related papers (2023-02-15T10:40:19Z) - A Weakly Supervised Learning Framework for Salient Object Detection via
Hybrid Labels [96.56299163691979]
This paper focuses on a new weakly-supervised salient object detection (SOD) task under hybrid labels.
To address the issues of label noise and quantity imbalance in this task, we design a new pipeline framework with three sophisticated training strategies.
Experiments on five SOD benchmarks show that our method achieves competitive performance against weakly-supervised/unsupervised methods.
arXiv Detail & Related papers (2022-09-07T06:45:39Z) - Towards Reducing Labeling Cost in Deep Object Detection [61.010693873330446]
We propose a unified framework for active learning, that considers both the uncertainty and the robustness of the detector.
Our method is able to pseudo-label the very confident predictions, suppressing a potential distribution drift.
arXiv Detail & Related papers (2021-06-22T16:53:09Z) - Rethinking Pseudo Labels for Semi-Supervised Object Detection [84.697097472401]
We introduce certainty-aware pseudo labels tailored for object detection.
We dynamically adjust the thresholds used to generate pseudo labels and reweight loss functions for each category to alleviate the class imbalance problem.
Our approach improves supervised baselines by up to 10% AP using only 1-10% labeled data from COCO.
arXiv Detail & Related papers (2021-06-01T01:32:03Z) - Boosting Semi-Supervised Face Recognition with Noise Robustness [54.342992887966616]
This paper presents an effective solution to semi-supervised face recognition that is robust to the label noise aroused by the auto-labelling.
We develop a semi-supervised face recognition solution, named Noise Robust Learning-Labelling (NRoLL), which is based on the robust training ability empowered by GN.
arXiv Detail & Related papers (2021-05-10T14:43:11Z) - Semi-supervised Long-tailed Recognition using Alternate Sampling [95.93760490301395]
Main challenges in long-tailed recognition come from the imbalanced data distribution and sample scarcity in its tail classes.
We propose a new recognition setting, namely semi-supervised long-tailed recognition.
We demonstrate significant accuracy improvements over other competitive methods on two datasets.
arXiv Detail & Related papers (2021-05-01T00:43:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.