ALWOD: Active Learning for Weakly-Supervised Object Detection
- URL: http://arxiv.org/abs/2309.07914v1
- Date: Thu, 14 Sep 2023 17:59:05 GMT
- Title: ALWOD: Active Learning for Weakly-Supervised Object Detection
- Authors: Yuting Wang, Velibor Ilic, Jiatong Li, Branislav Kisacanin, and
Vladimir Pavlovic
- Abstract summary: We propose ALWOD, a new framework that fuses active learning with weakly and semi-supervised object detection paradigms.
We demonstrate that ALWOD significantly narrows the gap between the ODs trained on few partially labeled but strategically selected image instances and those that rely on the fully-labeled data.
- Score: 14.063031246614488
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Object detection (OD), a crucial vision task, remains challenged by the lack
of large training datasets with precise object localization labels. In this
work, we propose ALWOD, a new framework that addresses this problem by fusing
active learning (AL) with weakly and semi-supervised object detection
paradigms. Because the performance of AL critically depends on the model
initialization, we propose a new auxiliary image generator strategy that
utilizes an extremely small labeled set, coupled with a large weakly tagged set
of images, as a warm-start for AL. We then propose a new AL acquisition
function, another critical factor in AL success, that leverages the
student-teacher OD pair disagreement and uncertainty to effectively propose the
most informative images to annotate. Finally, to complete the AL loop, we
introduce a new labeling task delegated to human annotators, based on selection
and correction of model-proposed detections, which is both rapid and effective
in labeling the informative images. We demonstrate, across several challenging
benchmarks, that ALWOD significantly narrows the gap between the ODs trained on
few partially labeled but strategically selected image instances and those that
rely on the fully-labeled data. Our code is publicly available on
https://github.com/seqam-lab/ALWOD.
Related papers
- Exploiting Unlabeled Data with Multiple Expert Teachers for Open Vocabulary Aerial Object Detection and Its Orientation Adaptation [58.37525311718006]
We put forth a novel formulation of the aerial object detection problem, namely open-vocabulary aerial object detection (OVAD)
We propose CastDet, a CLIP-activated student-teacher detection framework that serves as the first OVAD detector specifically designed for the challenging aerial scenario.
Our framework integrates a robust localization teacher along with several box selection strategies to generate high-quality proposals for novel objects.
arXiv Detail & Related papers (2024-11-04T12:59:13Z) - Boosting Semi-Supervised Object Detection in Remote Sensing Images With
Active Teaching [34.26972464240673]
We propose a novel active learning (AL) method to boost object detection in remote sensing images.
The proposed method incorporates an RoI comparison module (RoICM) to generate high-confidence pseudo-labels for regions of interest.
Our proposed method outperforms state-of-the-art methods for object detection in RSIs.
arXiv Detail & Related papers (2024-02-29T08:52:38Z) - Weakly-supervised Contrastive Learning for Unsupervised Object Discovery [52.696041556640516]
Unsupervised object discovery is promising due to its ability to discover objects in a generic manner.
We design a semantic-guided self-supervised learning model to extract high-level semantic features from images.
We introduce Principal Component Analysis (PCA) to localize object regions.
arXiv Detail & Related papers (2023-07-07T04:03:48Z) - SOOD: Towards Semi-Supervised Oriented Object Detection [57.05141794402972]
This paper proposes a novel Semi-supervised Oriented Object Detection model, termed SOOD, built upon the mainstream pseudo-labeling framework.
Our experiments show that when trained with the two proposed losses, SOOD surpasses the state-of-the-art SSOD methods under various settings on the DOTA-v1.5 benchmark.
arXiv Detail & Related papers (2023-04-10T11:10:42Z) - Identification of Novel Classes for Improving Few-Shot Object Detection [12.013345715187285]
Few-shot object detection (FSOD) methods offer a remedy by realizing robust object detection using only a few training samples per class.
We develop a semi-supervised algorithm to detect and then utilize unlabeled novel objects as positive samples during training to improve FSOD performance.
Our experimental results indicate that our method is effective and outperforms the existing state-of-the-art (SOTA) FSOD methods.
arXiv Detail & Related papers (2023-03-18T14:12:52Z) - DETA: Denoised Task Adaptation for Few-Shot Learning [135.96805271128645]
Test-time task adaptation in few-shot learning aims to adapt a pre-trained task-agnostic model for capturing taskspecific knowledge.
With only a handful of samples available, the adverse effect of either the image noise (a.k.a. X-noise) or the label noise (a.k.a. Y-noise) from support samples can be severely amplified.
We propose DEnoised Task Adaptation (DETA), a first, unified image- and label-denoising framework to existing task adaptation approaches.
arXiv Detail & Related papers (2023-03-11T05:23:20Z) - MUS-CDB: Mixed Uncertainty Sampling with Class Distribution Balancing
for Active Annotation in Aerial Object Detection [40.94800050576902]
Recent aerial object detection models rely on a large amount of labeled training data.
Active learning effectively reduces the data labeling cost by selectively querying the informative and representative unlabelled samples.
We propose a novel active learning method for cost-effective aerial object detection.
arXiv Detail & Related papers (2022-12-06T07:50:00Z) - Activation to Saliency: Forming High-Quality Labels for Unsupervised
Salient Object Detection [54.92703325989853]
We propose a two-stage Activation-to-Saliency (A2S) framework that effectively generates high-quality saliency cues.
No human annotations are involved in our framework during the whole training process.
Our framework reports significant performance compared with existing USOD methods.
arXiv Detail & Related papers (2021-12-07T11:54:06Z) - Discovery-and-Selection: Towards Optimal Multiple Instance Learning for
Weakly Supervised Object Detection [86.86602297364826]
We propose a discoveryand-selection approach fused with multiple instance learning (DS-MIL)
Our proposed DS-MIL approach can consistently improve the baselines, reporting state-of-the-art performance.
arXiv Detail & Related papers (2021-10-18T07:06:57Z) - Dynamic Relevance Learning for Few-Shot Object Detection [6.550840743803705]
We propose a dynamic relevance learning model, which utilizes the relationship between all support images and Region of Interest (RoI) on the query images to construct a dynamic graph convolutional network (GCN)
The proposed model achieves the best overall performance, which shows its effectiveness of learning more generalized features.
arXiv Detail & Related papers (2021-08-04T18:29:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.