Data-Uncertainty Guided Multi-Phase Learning for Semi-Supervised Object
Detection
- URL: http://arxiv.org/abs/2103.16368v1
- Date: Mon, 29 Mar 2021 09:27:23 GMT
- Title: Data-Uncertainty Guided Multi-Phase Learning for Semi-Supervised Object
Detection
- Authors: Zhenyu Wang, Yali Li, Ye Guo, Lu Fang, Shengjin Wang
- Abstract summary: We propose a data-uncertainty guided multi-phase learning method for semi-supervised object detection.
Our method behaves extraordinarily compared to baseline approaches and outperforms them by a large margin.
- Score: 66.10057490293981
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this paper, we delve into semi-supervised object detection where unlabeled
images are leveraged to break through the upper bound of fully-supervised
object detection models. Previous semi-supervised methods based on pseudo
labels are severely degenerated by noise and prone to overfit to noisy labels,
thus are deficient in learning different unlabeled knowledge well. To address
this issue, we propose a data-uncertainty guided multi-phase learning method
for semi-supervised object detection. We comprehensively consider divergent
types of unlabeled images according to their difficulty levels, utilize them in
different phases and ensemble models from different phases together to generate
ultimate results. Image uncertainty guided easy data selection and region
uncertainty guided RoI Re-weighting are involved in multi-phase learning and
enable the detector to concentrate on more certain knowledge. Through extensive
experiments on PASCAL VOC and MS COCO, we demonstrate that our method behaves
extraordinarily compared to baseline approaches and outperforms them by a large
margin, more than 3% on VOC and 2% on COCO.
Related papers
- Learning Camouflaged Object Detection from Noisy Pseudo Label [60.9005578956798]
This paper introduces the first weakly semi-supervised Camouflaged Object Detection (COD) method.
It aims for budget-efficient and high-precision camouflaged object segmentation with an extremely limited number of fully labeled images.
We propose a noise correction loss that facilitates the model's learning of correct pixels in the early learning stage.
When using only 20% of fully labeled data, our method shows superior performance over the state-of-the-art methods.
arXiv Detail & Related papers (2024-07-18T04:53:51Z) - Virtual Category Learning: A Semi-Supervised Learning Method for Dense
Prediction with Extremely Limited Labels [63.16824565919966]
This paper proposes to use confusing samples proactively without label correction.
A Virtual Category (VC) is assigned to each confusing sample in such a way that it can safely contribute to the model optimisation.
Our intriguing findings highlight the usage of VC learning in dense vision tasks.
arXiv Detail & Related papers (2023-12-02T16:23:52Z) - Variational Self-Supervised Contrastive Learning Using Beta Divergence [0.0]
We present a contrastive self-supervised learning method which is robust to data noise, grounded in the domain of variational methods.
We demonstrate the effectiveness of the proposed method through rigorous experiments including linear evaluation and fine-tuning scenarios with multi-label datasets in the face understanding domain.
arXiv Detail & Related papers (2023-09-05T17:21:38Z) - Exploring the Boundaries of Semi-Supervised Facial Expression
Recognition: Learning from In-Distribution, Out-of-Distribution, and
Unconstrained Data [19.442685015494316]
We present a study on 11 of the most recent semi-supervised methods, in the context of facial expression recognition (FER)
Our investigation covers semi-supervised learning from in-distribution, out-of-distribution, unconstrained, and very small unlabelled data.
Our results demonstrate that FixMatch consistently achieves better performance on in-distribution unlabelled data, while ReMixMatch stands out among all methods for out-of-distribution, unconstrained, and scarce unlabelled data scenarios.
arXiv Detail & Related papers (2023-06-02T01:40:08Z) - MuRAL: Multi-Scale Region-based Active Learning for Object Detection [20.478741635006116]
We propose a novel approach called Multi-scale Region-based Active Learning (MuRAL) for object detection.
MuRAL identifies informative regions of various scales to reduce annotation costs for well-learned objects.
Our proposed method surpasses all existing coarse-grained and fine-grained baselines on Cityscapes and MS COCO datasets.
arXiv Detail & Related papers (2023-03-29T12:52:27Z) - The Treasure Beneath Multiple Annotations: An Uncertainty-aware Edge
Detector [70.43599299422813]
Existing methods fuse multiple annotations using a simple voting process, ignoring the inherent ambiguity of edges and labeling bias of annotators.
We propose a novel uncertainty-aware edge detector (UAED), which employs uncertainty to investigate the subjectivity and ambiguity of diverse annotations.
UAED achieves superior performance consistently across multiple edge detection benchmarks.
arXiv Detail & Related papers (2023-03-21T13:14:36Z) - Uncertain Facial Expression Recognition via Multi-task Assisted
Correction [43.02119884581332]
We propose a novel method of multi-task assisted correction in addressing uncertain facial expression recognition called MTAC.
Specifically, a confidence estimation block and a weighted regularization module are applied to highlight solid samples and suppress uncertain samples in every batch.
Experiments on RAF-DB, AffectNet, and AffWild2 datasets demonstrate that the MTAC obtains substantial improvements over baselines when facing synthetic and real uncertainties.
arXiv Detail & Related papers (2022-12-14T10:28:08Z) - Combating Noise: Semi-supervised Learning by Region Uncertainty
Quantification [55.23467274564417]
Current methods are easily distracted by noisy regions generated by pseudo labels.
We propose noise-resistant semi-supervised learning by quantifying the region uncertainty.
Experiments on both PASCAL VOC and MS COCO demonstrate the extraordinary performance of our method.
arXiv Detail & Related papers (2021-11-01T13:23:42Z) - Deep Semi-supervised Metric Learning with Dual Alignment for Cervical
Cancer Cell Detection [49.78612417406883]
We propose a novel semi-supervised deep metric learning method for cervical cancer cell detection.
Our model learns an embedding metric space and conducts dual alignment of semantic features on both the proposal and prototype levels.
We construct a large-scale dataset for semi-supervised cervical cancer cell detection for the first time, consisting of 240,860 cervical cell images.
arXiv Detail & Related papers (2021-04-07T17:11:27Z) - Pixel-wise Anomaly Detection in Complex Driving Scenes [30.884375526254836]
We present a pixel-wise anomaly detection framework that uses uncertainty maps to improve anomaly detection.
Our approach works as a general framework around already trained segmentation networks.
Top-2 performance across a range of different anomaly datasets shows the robustness of our approach to handling different anomaly instances.
arXiv Detail & Related papers (2021-03-09T14:26:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.