D2DF2WOD: Learning Object Proposals for Weakly-Supervised Object
Detection via Progressive Domain Adaptation
- URL: http://arxiv.org/abs/2212.01376v1
- Date: Fri, 2 Dec 2022 18:58:03 GMT
- Title: D2DF2WOD: Learning Object Proposals for Weakly-Supervised Object
Detection via Progressive Domain Adaptation
- Authors: Yuting Wang, Ricardo Guerrero, Vladimir Pavlovic
- Abstract summary: D2DF2WOD is a Fully-to-Weakly Supervised Object Detection framework.
It uses synthetic data, annotated with precise object localization, to supplement a natural image target domain.
Our method results in consistently improved object detection and localization compared with state-of-the-art methods.
- Score: 25.41133780678981
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Weakly-supervised object detection (WSOD) models attempt to leverage
image-level annotations in lieu of accurate but costly-to-obtain object
localization labels. This oftentimes leads to substandard object detection and
localization at inference time. To tackle this issue, we propose D2DF2WOD, a
Dual-Domain Fully-to-Weakly Supervised Object Detection framework that
leverages synthetic data, annotated with precise object localization, to
supplement a natural image target domain, where only image-level labels are
available. In its warm-up domain adaptation stage, the model learns a
fully-supervised object detector (FSOD) to improve the precision of the object
proposals in the target domain, and at the same time learns
target-domain-specific and detection-aware proposal features. In its main WSOD
stage, a WSOD model is specifically tuned to the target domain. The feature
extractor and the object proposal generator of the WSOD model are built upon
the fine-tuned FSOD model. We test D2DF2WOD on five dual-domain image
benchmarks. The results show that our method results in consistently improved
object detection and localization compared with state-of-the-art methods.
Related papers
- Cross-Domain Few-Shot Object Detection via Enhanced Open-Set Object Detector [72.05791402494727]
This paper studies the challenging cross-domain few-shot object detection (CD-FSOD)
It aims to develop an accurate object detector for novel domains with minimal labeled examples.
arXiv Detail & Related papers (2024-02-05T15:25:32Z) - Detect, Augment, Compose, and Adapt: Four Steps for Unsupervised Domain
Adaptation in Object Detection [7.064953237013352]
Unsupervised domain adaptation (UDA) plays a crucial role in object detection when adapting a source-trained detector to a target domain without annotated data.
We propose a novel and effective four-step UDA approach that leverages self-supervision and trains source and target data concurrently.
Our approach achieves state-of-the-art performance, improving upon the nearest competitor by more than 2% in terms of mean Average Precision (mAP)
arXiv Detail & Related papers (2023-08-29T14:48:29Z) - Weakly-supervised Contrastive Learning for Unsupervised Object Discovery [52.696041556640516]
Unsupervised object discovery is promising due to its ability to discover objects in a generic manner.
We design a semantic-guided self-supervised learning model to extract high-level semantic features from images.
We introduce Principal Component Analysis (PCA) to localize object regions.
arXiv Detail & Related papers (2023-07-07T04:03:48Z) - STFAR: Improving Object Detection Robustness at Test-Time by
Self-Training with Feature Alignment Regularization [35.16122933158808]
Domain adaptation helps generalizing object detection models to target domain data with distribution shift.
We explore adapting an object detection model at test-time, a.k.a. test-time adaptation (TTAOD)
Our proposed method sets the state-of-the-art on test-time adaptive object detection task.
arXiv Detail & Related papers (2023-03-31T10:04:44Z) - CLIP the Gap: A Single Domain Generalization Approach for Object
Detection [60.20931827772482]
Single Domain Generalization tackles the problem of training a model on a single source domain so that it generalizes to any unseen target domain.
We propose to leverage a pre-trained vision-language model to introduce semantic domain concepts via textual prompts.
We achieve this via a semantic augmentation strategy acting on the features extracted by the detector backbone, as well as a text-based classification loss.
arXiv Detail & Related papers (2023-01-13T12:01:18Z) - Cross Domain Object Detection by Target-Perceived Dual Branch
Distillation [49.68119030818388]
Cross domain object detection is a realistic and challenging task in the wild.
We propose a novel Target-perceived Dual-branch Distillation (TDD) framework.
Our TDD significantly outperforms the state-of-the-art methods on all the benchmarks.
arXiv Detail & Related papers (2022-05-03T03:51:32Z) - An Unsupervised Domain Adaptive Approach for Multimodal 2D Object
Detection in Adverse Weather Conditions [5.217255784808035]
We propose an unsupervised domain adaptation framework to bridge the domain gap between source and target domains.
We use a data augmentation scheme that simulates weather distortions to add domain confusion and prevent overfitting on the source data.
Experiments performed on the DENSE dataset show that our method can substantially alleviate the domain gap.
arXiv Detail & Related papers (2022-03-07T18:10:40Z) - Weakly Supervised Object Localization as Domain Adaption [19.854125742336688]
Weakly supervised object localization (WSOL) focuses on localizing objects only with the supervision of image-level classification masks.
Most previous WSOL methods follow the classification activation map (CAM) that localizes objects based on the classification structure with the multi-instance learning (MIL) mechanism.
This work provides a novel perspective that models WSOL as a domain adaption (DA) task, where the score estimator trained on the source/image domain is tested on the target/pixel domain to locate objects.
arXiv Detail & Related papers (2022-03-03T13:50:22Z) - Frequency Spectrum Augmentation Consistency for Domain Adaptive Object
Detection [107.52026281057343]
We introduce a Frequency Spectrum Augmentation Consistency (FSAC) framework with four different low-frequency filter operations.
In the first stage, we utilize all the original and augmented source data to train an object detector.
In the second stage, augmented source and target data with pseudo labels are adopted to perform the self-training for prediction consistency.
arXiv Detail & Related papers (2021-12-16T04:07:01Z) - Adaptive Object Detection with Dual Multi-Label Prediction [78.69064917947624]
We propose a novel end-to-end unsupervised deep domain adaptation model for adaptive object detection.
The model exploits multi-label prediction to reveal the object category information in each image.
We introduce a prediction consistency regularization mechanism to assist object detection.
arXiv Detail & Related papers (2020-03-29T04:23:22Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.