Weakly Supervised Object Localization as Domain Adaption
- URL: http://arxiv.org/abs/2203.01714v1
- Date: Thu, 3 Mar 2022 13:50:22 GMT
- Title: Weakly Supervised Object Localization as Domain Adaption
- Authors: Lei Zhu, Qi She, Qian Chen, Yunfei You, Boyu Wang, Yanye Lu
- Abstract summary: Weakly supervised object localization (WSOL) focuses on localizing objects only with the supervision of image-level classification masks.
Most previous WSOL methods follow the classification activation map (CAM) that localizes objects based on the classification structure with the multi-instance learning (MIL) mechanism.
This work provides a novel perspective that models WSOL as a domain adaption (DA) task, where the score estimator trained on the source/image domain is tested on the target/pixel domain to locate objects.
- Score: 19.854125742336688
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Weakly supervised object localization (WSOL) focuses on localizing objects
only with the supervision of image-level classification masks. Most previous
WSOL methods follow the classification activation map (CAM) that localizes
objects based on the classification structure with the multi-instance learning
(MIL) mechanism. However, the MIL mechanism makes CAM only activate
discriminative object parts rather than the whole object, weakening its
performance for localizing objects. To avoid this problem, this work provides a
novel perspective that models WSOL as a domain adaption (DA) task, where the
score estimator trained on the source/image domain is tested on the
target/pixel domain to locate objects. Under this perspective, a DA-WSOL
pipeline is designed to better engage DA approaches into WSOL to enhance
localization performance. It utilizes a proposed target sampling strategy to
select different types of target samples. Based on these types of target
samples, domain adaption localization (DAL) loss is elaborated. It aligns the
feature distribution between the two domains by DA and makes the estimator
perceive target domain cues by Universum regularization. Experiments show that
our pipeline outperforms SOTA methods on multi benchmarks. Code are released at
\url{https://github.com/zh460045050/DA-WSOL_CVPR2022}.
Related papers
- Object-Aware Domain Generalization for Object Detection [10.28961895672321]
We propose an object-aware domain generalization (OA-DG) method for single-domain generalization in object detection.
Our method consists of data augmentation and training strategy, which are called OA-Mix and OA-Loss, respectively.
Our proposed method outperforms state-of-the-art works on standard benchmarks.
arXiv Detail & Related papers (2023-12-19T13:11:35Z) - Background Activation Suppression for Weakly Supervised Object
Localization and Semantic Segmentation [84.62067728093358]
Weakly supervised object localization and semantic segmentation aim to localize objects using only image-level labels.
New paradigm has emerged by generating a foreground prediction map to achieve pixel-level localization.
This paper presents two astonishing experimental observations on the object localization learning process.
arXiv Detail & Related papers (2023-09-22T15:44:10Z) - Weakly-supervised Contrastive Learning for Unsupervised Object Discovery [52.696041556640516]
Unsupervised object discovery is promising due to its ability to discover objects in a generic manner.
We design a semantic-guided self-supervised learning model to extract high-level semantic features from images.
We introduce Principal Component Analysis (PCA) to localize object regions.
arXiv Detail & Related papers (2023-07-07T04:03:48Z) - CLIP the Gap: A Single Domain Generalization Approach for Object
Detection [60.20931827772482]
Single Domain Generalization tackles the problem of training a model on a single source domain so that it generalizes to any unseen target domain.
We propose to leverage a pre-trained vision-language model to introduce semantic domain concepts via textual prompts.
We achieve this via a semantic augmentation strategy acting on the features extracted by the detector backbone, as well as a text-based classification loss.
arXiv Detail & Related papers (2023-01-13T12:01:18Z) - Exploiting Instance-based Mixed Sampling via Auxiliary Source Domain
Supervision for Domain-adaptive Action Detection [75.38704117155909]
We propose a novel domain adaptive action detection approach and a new adaptation protocol.
Self-training combined with cross-domain mixed sampling has shown remarkable performance gain in UDA context.
We name our proposed framework as domain-adaptive action instance mixing (DA-AIM)
arXiv Detail & Related papers (2022-09-28T22:03:25Z) - Constrained Sampling for Class-Agnostic Weakly Supervised Object
Localization [10.542859578763068]
Self-supervised vision transformers can generate accurate localization maps of the objects in an image.
We propose leveraging the multiple maps generated by the different transformer heads to acquire pseudo-labels for training a weakly-supervised object localization model.
arXiv Detail & Related papers (2022-09-09T19:58:38Z) - Discriminative Sampling of Proposals in Self-Supervised Transformers for
Weakly Supervised Object Localization [10.542859578763068]
Self-supervised vision transformers can generate accurate localization maps of the objects in an image.
We propose leveraging the multiple maps generated by the different transformer heads to acquire pseudo-labels for training a weakly-supervised object localization model.
arXiv Detail & Related papers (2022-09-09T18:33:23Z) - Bridging the Gap between Classification and Localization for Weakly
Supervised Object Localization [39.63778214094173]
Weakly supervised object localization aims to find a target object region in a given image with only weak supervision, such as image-level labels.
We find the gap between classification and localization in terms of the misalignment of the directions between an input feature and a class-specific weight.
We propose a method to align feature directions with a class-specific weight to bridge the gap.
arXiv Detail & Related papers (2022-04-01T05:49:22Z) - Seeking Similarities over Differences: Similarity-based Domain Alignment
for Adaptive Object Detection [86.98573522894961]
We propose a framework that generalizes the components commonly used by Unsupervised Domain Adaptation (UDA) algorithms for detection.
Specifically, we propose a novel UDA algorithm, ViSGA, that leverages the best design choices and introduces a simple but effective method to aggregate features at instance-level.
We show that both similarity-based grouping and adversarial training allows our model to focus on coarsely aligning feature groups, without being forced to match all instances across loosely aligned domains.
arXiv Detail & Related papers (2021-10-04T13:09:56Z) - Instance Level Affinity-Based Transfer for Unsupervised Domain
Adaptation [74.71931918541748]
We propose an instance affinity based criterion for source to target transfer during adaptation, called ILA-DA.
We first propose a reliable and efficient method to extract similar and dissimilar samples across source and target, and utilize a multi-sample contrastive loss to drive the domain alignment process.
We verify the effectiveness of ILA-DA by observing consistent improvements in accuracy over popular domain adaptation approaches on a variety of benchmark datasets.
arXiv Detail & Related papers (2021-04-03T01:33:14Z) - Rethinking the Route Towards Weakly Supervised Object Localization [28.90792512056726]
We show that weakly supervised object localization should be divided into two parts: class-agnostic object localization and object classification.
For class-agnostic object localization, we should use class-agnostic methods to generate noisy pseudo annotations and then perform bounding box regression on them without class labels.
Our PSOL models have good transferability across different datasets without fine-tuning.
arXiv Detail & Related papers (2020-02-26T08:54:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.