Boosting Weakly Supervised Object Detection via Learning Bounding Box
Adjusters
- URL: http://arxiv.org/abs/2108.01499v1
- Date: Tue, 3 Aug 2021 13:38:20 GMT
- Title: Boosting Weakly Supervised Object Detection via Learning Bounding Box
Adjusters
- Authors: Bowen Dong and Zitong Huang and Yuelin Guo and Qilong Wang and
Zhenxing Niu and Wangmeng Zuo
- Abstract summary: Weakly-supervised object detection (WSOD) has emerged as an inspiring recent topic to avoid expensive instance-level object annotations.
We defend the problem setting for improving localization performance by leveraging the bounding box regression knowledge from a well-annotated auxiliary dataset.
Our method performs favorably against state-of-the-art WSOD methods and knowledge transfer model with similar problem setting.
- Score: 76.36104006511684
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Weakly-supervised object detection (WSOD) has emerged as an inspiring recent
topic to avoid expensive instance-level object annotations. However, the
bounding boxes of most existing WSOD methods are mainly determined by
precomputed proposals, thereby being limited in precise object localization. In
this paper, we defend the problem setting for improving localization
performance by leveraging the bounding box regression knowledge from a
well-annotated auxiliary dataset. First, we use the well-annotated auxiliary
dataset to explore a series of learnable bounding box adjusters (LBBAs) in a
multi-stage training manner, which is class-agnostic. Then, only LBBAs and a
weakly-annotated dataset with non-overlapped classes are used for training
LBBA-boosted WSOD. As such, our LBBAs are practically more convenient and
economical to implement while avoiding the leakage of the auxiliary
well-annotated dataset. In particular, we formulate learning bounding box
adjusters as a bi-level optimization problem and suggest an EM-like multi-stage
training algorithm. Then, a multi-stage scheme is further presented for
LBBA-boosted WSOD. Additionally, a masking strategy is adopted to improve
proposal classification. Experimental results verify the effectiveness of our
method. Our method performs favorably against state-of-the-art WSOD methods and
knowledge transfer model with similar problem setting. Code is publicly
available at \url{https://github.com/DongSky/lbba_boosted_wsod}.
Related papers
- Collaborative Feature-Logits Contrastive Learning for Open-Set Semi-Supervised Object Detection [75.02249869573994]
In open-set scenarios, the unlabeled dataset contains both in-distribution (ID) classes and out-of-distribution (OOD) classes.
Applying semi-supervised detectors in such settings can lead to misclassifying OOD class as ID classes.
We propose a simple yet effective method, termed Collaborative Feature-Logits Detector (CFL-Detector)
arXiv Detail & Related papers (2024-11-20T02:57:35Z) - Data Adaptive Traceback for Vision-Language Foundation Models in Image Classification [34.37262622415682]
We propose a new adaptation framework called Data Adaptive Traceback.
Specifically, we utilize a zero-shot-based method to extract the most downstream task-related subset of the pre-training data.
We adopt a pseudo-label-based semi-supervised technique to reuse the pre-training images and a vision-language contrastive learning method to address the confirmation bias issue in semi-supervised learning.
arXiv Detail & Related papers (2024-07-11T18:01:58Z) - DST-Det: Simple Dynamic Self-Training for Open-Vocabulary Object Detection [72.25697820290502]
This work introduces a straightforward and efficient strategy to identify potential novel classes through zero-shot classification.
We refer to this approach as the self-training strategy, which enhances recall and accuracy for novel classes without requiring extra annotations, datasets, and re-training.
Empirical evaluations on three datasets, including LVIS, V3Det, and COCO, demonstrate significant improvements over the baseline performance.
arXiv Detail & Related papers (2023-10-02T17:52:24Z) - Improving Localization for Semi-Supervised Object Detection [3.5493798890908104]
We introduce an additional classification task for bounding box localization to improve the filtering of predicted bounding boxes.
Our experiments show that our IL-net increases SSOD performance by 1.14% AP on dataset in limited-annotation regime.
arXiv Detail & Related papers (2022-06-21T08:39:38Z) - Open-Set Semi-Supervised Learning for 3D Point Cloud Understanding [62.17020485045456]
It is commonly assumed in semi-supervised learning (SSL) that the unlabeled data are drawn from the same distribution as that of the labeled ones.
We propose to selectively utilize unlabeled data through sample weighting, so that only conducive unlabeled data would be prioritized.
arXiv Detail & Related papers (2022-05-02T16:09:17Z) - Dense Learning based Semi-Supervised Object Detection [46.885301243656045]
Semi-supervised object detection (SSOD) aims to facilitate the training and deployment of object detectors with the help of a large amount of unlabeled data.
In this paper, we propose a DenSe Learning based anchor-free SSOD algorithm.
Experiments are conducted on MS-COCO and PASCAL-VOC, and the results show that our proposed DSL method records new state-of-the-art SSOD performance.
arXiv Detail & Related papers (2022-04-15T02:31:02Z) - Semi-supervised Batch Active Learning via Bilevel Optimization [89.37476066973336]
We formulate our approach as a data summarization problem via bilevel optimization.
We show that our method is highly effective in keyword detection tasks in the regime when only few labeled samples are available.
arXiv Detail & Related papers (2020-10-19T16:53:24Z) - Pairwise Similarity Knowledge Transfer for Weakly Supervised Object
Localization [53.99850033746663]
We study the problem of learning localization model on target classes with weakly supervised image labels.
In this work, we argue that learning only an objectness function is a weak form of knowledge transfer.
Experiments on the COCO and ILSVRC 2013 detection datasets show that the performance of the localization model improves significantly with the inclusion of pairwise similarity function.
arXiv Detail & Related papers (2020-03-18T17:53:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.