S4OD: Semi-Supervised learning for Single-Stage Object Detection
- URL: http://arxiv.org/abs/2204.04492v1
- Date: Sat, 9 Apr 2022 15:19:37 GMT
- Title: S4OD: Semi-Supervised learning for Single-Stage Object Detection
- Authors: Yueming Zhang, Xingxu Yao, Chao Liu, Feng Chen, Xiaolin Song, Tengfei
Xing, Runbo Hu, Hua Chai, Pengfei Xu, and Guoshan Zhang
- Abstract summary: Single-stage detectors suffer from extreme foreground-background class imbalance, while two-stage detectors do not.
In semi-supervised object detection, two-stage detectors can deliver remarkable performance by only selecting high-quality pseudo labels.
- Score: 17.44592826610368
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Single-stage detectors suffer from extreme foreground-background class
imbalance, while two-stage detectors do not. Therefore, in semi-supervised
object detection, two-stage detectors can deliver remarkable performance by
only selecting high-quality pseudo labels based on classification scores.
However, directly applying this strategy to single-stage detectors would
aggravate the class imbalance with fewer positive samples. Thus, single-stage
detectors have to consider both quality and quantity of pseudo labels
simultaneously. In this paper, we design a dynamic self-adaptive threshold
(DSAT) strategy in classification branch, which can automatically select pseudo
labels to achieve an optimal trade-off between quality and quantity. Besides,
to assess the regression quality of pseudo labels in single-stage detectors, we
propose a module to compute the regression uncertainty of boxes based on
Non-Maximum Suppression. By leveraging only 10% labeled data from COCO, our
method achieves 35.0% AP on anchor-free detector (FCOS) and 32.9% on
anchor-based detector (RetinaNet).
Related papers
- KECOR: Kernel Coding Rate Maximization for Active 3D Object Detection [48.66703222700795]
We resort to a novel kernel strategy to identify the most informative point clouds to acquire labels.
To accommodate both one-stage (i.e., SECOND) and two-stage detectors, we incorporate the classification entropy tangent and well trade-off between detection performance and the total number of bounding boxes selected for annotation.
Our results show that approximately 44% box-level annotation costs and 26% computational time are reduced compared to the state-of-the-art method.
arXiv Detail & Related papers (2023-07-16T04:27:03Z) - Ambiguity-Resistant Semi-Supervised Learning for Dense Object Detection [98.66771688028426]
We propose a Ambiguity-Resistant Semi-supervised Learning (ARSL) for one-stage detectors.
Joint-Confidence Estimation (JCE) is proposed to quantifies the classification and localization quality of pseudo labels.
ARSL effectively mitigates the ambiguities and achieves state-of-the-art SSOD performance on MS COCO and PASCAL VOC.
arXiv Detail & Related papers (2023-03-27T07:46:58Z) - S3: Supervised Self-supervised Learning under Label Noise [53.02249460567745]
In this paper we address the problem of classification in the presence of label noise.
In the heart of our method is a sample selection mechanism that relies on the consistency between the annotated label of a sample and the distribution of the labels in its neighborhood in the feature space.
Our method significantly surpasses previous methods on both CIFARCIFAR100 with artificial noise and real-world noisy datasets such as WebVision and ANIMAL-10N.
arXiv Detail & Related papers (2021-11-22T15:49:20Z) - Rethinking Pseudo Labels for Semi-Supervised Object Detection [84.697097472401]
We introduce certainty-aware pseudo labels tailored for object detection.
We dynamically adjust the thresholds used to generate pseudo labels and reweight loss functions for each category to alleviate the class imbalance problem.
Our approach improves supervised baselines by up to 10% AP using only 1-10% labeled data from COCO.
arXiv Detail & Related papers (2021-06-01T01:32:03Z) - Semi-Anchored Detector for One-Stage Object Detection [24.42034747738174]
A standard one-stage detector is comprised of two tasks: classification and regression.
With ResNet-101 as the backbone, the proposed semi-anchored detector achieves 43.6% mAP on COCO data set.
arXiv Detail & Related papers (2020-09-10T16:57:09Z) - SADet: Learning An Efficient and Accurate Pedestrian Detector [68.66857832440897]
This paper proposes a series of systematic optimization strategies for the detection pipeline of one-stage detector.
It forms a single shot anchor-based detector (SADet) for efficient and accurate pedestrian detection.
Though structurally simple, it presents state-of-the-art result and real-time speed of $20$ FPS for VGA-resolution images.
arXiv Detail & Related papers (2020-07-26T12:32:38Z) - EHSOD: CAM-Guided End-to-end Hybrid-Supervised Object Detection with
Cascade Refinement [53.69674636044927]
We present EHSOD, an end-to-end hybrid-supervised object detection system.
It can be trained in one shot on both fully and weakly-annotated data.
It achieves comparable results on multiple object detection benchmarks with only 30% fully-annotated data.
arXiv Detail & Related papers (2020-02-18T08:04:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.