Bridging the Gap between Classification and Localization for Weakly
Supervised Object Localization
- URL: http://arxiv.org/abs/2204.00220v1
- Date: Fri, 1 Apr 2022 05:49:22 GMT
- Title: Bridging the Gap between Classification and Localization for Weakly
Supervised Object Localization
- Authors: Eunji Kim, Siwon Kim, Jungbeom Lee, Hyunwoo Kim, Sungroh Yoon
- Abstract summary: Weakly supervised object localization aims to find a target object region in a given image with only weak supervision, such as image-level labels.
We find the gap between classification and localization in terms of the misalignment of the directions between an input feature and a class-specific weight.
We propose a method to align feature directions with a class-specific weight to bridge the gap.
- Score: 39.63778214094173
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Weakly supervised object localization aims to find a target object region in
a given image with only weak supervision, such as image-level labels. Most
existing methods use a class activation map (CAM) to generate a localization
map; however, a CAM identifies only the most discriminative parts of a target
object rather than the entire object region. In this work, we find the gap
between classification and localization in terms of the misalignment of the
directions between an input feature and a class-specific weight. We demonstrate
that the misalignment suppresses the activation of CAM in areas that are less
discriminative but belong to the target object. To bridge the gap, we propose a
method to align feature directions with a class-specific weight. The proposed
method achieves a state-of-the-art localization performance on the CUB-200-2011
and ImageNet-1K benchmarks.
Related papers
- Spatial Structure Constraints for Weakly Supervised Semantic
Segmentation [100.0316479167605]
A class activation map (CAM) can only locate the most discriminative part of objects.
We propose spatial structure constraints (SSC) for weakly supervised semantic segmentation to alleviate the unwanted object over-activation of attention expansion.
Our approach achieves 72.7% and 47.0% mIoU on the PASCAL VOC 2012 and COCO datasets, respectively.
arXiv Detail & Related papers (2024-01-20T05:25:25Z) - Bagging Regional Classification Activation Maps for Weakly Supervised
Object Localization [11.25759292976175]
BagCAMs is a plug-and-play mechanism to better project a well-trained classifier for the localization task.
Our BagCAMs adopts a proposed regional localizer generation strategy to define a set of regional localizers.
Experiments indicate that adopting our proposed BagCAMs can improve the performance of baseline WSOL methods.
arXiv Detail & Related papers (2022-07-16T03:03:01Z) - Saliency Guided Inter- and Intra-Class Relation Constraints for Weakly
Supervised Semantic Segmentation [66.87777732230884]
We propose a saliency guided Inter- and Intra-Class Relation Constrained (I$2$CRC) framework to assist the expansion of the activated object regions.
We also introduce an object guided label refinement module to take a full use of both the segmentation prediction and the initial labels for obtaining superior pseudo-labels.
arXiv Detail & Related papers (2022-06-20T03:40:56Z) - Anti-Adversarially Manipulated Attributions for Weakly Supervised
Semantic Segmentation and Object Localization [31.69344455448125]
We present an attribution map of an image that is manipulated to increase the classification score produced by a classifier before the final softmax or sigmoid layer.
This manipulation is realized in an anti-adversarial manner, so that the original image is perturbed along pixel gradients in directions opposite to those used in an adversarial attack.
In addition, we introduce a new regularization procedure that inhibits the incorrect attribution of regions unrelated to the target object and the excessive concentration of attributions on a small region of the target object.
arXiv Detail & Related papers (2022-04-11T06:18:02Z) - Weakly Supervised Object Localization as Domain Adaption [19.854125742336688]
Weakly supervised object localization (WSOL) focuses on localizing objects only with the supervision of image-level classification masks.
Most previous WSOL methods follow the classification activation map (CAM) that localizes objects based on the classification structure with the multi-instance learning (MIL) mechanism.
This work provides a novel perspective that models WSOL as a domain adaption (DA) task, where the score estimator trained on the source/image domain is tested on the target/pixel domain to locate objects.
arXiv Detail & Related papers (2022-03-03T13:50:22Z) - Anti-Adversarially Manipulated Attributions for Weakly and
Semi-Supervised Semantic Segmentation [24.4472594401663]
AdvCAM is an attribution map of an image that is manipulated to increase the classification score.
It forces regions initially considered not to be discriminative to become involved in subsequent classifications.
We achieve mIoUs of 68.0 and 76.9 for weakly and semi-supervised semantic segmentation respectively.
arXiv Detail & Related papers (2021-03-16T07:39:06Z) - Instance Localization for Self-supervised Detection Pretraining [68.24102560821623]
We propose a new self-supervised pretext task, called instance localization.
We show that integration of bounding boxes into pretraining promotes better task alignment and architecture alignment for transfer learning.
Experimental results demonstrate that our approach yields state-of-the-art transfer learning results for object detection.
arXiv Detail & Related papers (2021-02-16T17:58:57Z) - Inter-Image Communication for Weakly Supervised Localization [77.2171924626778]
Weakly supervised localization aims at finding target object regions using only image-level supervision.
We propose to leverage pixel-level similarities across different objects for learning more accurate object locations.
Our method achieves the Top-1 localization error rate of 45.17% on the ILSVRC validation set.
arXiv Detail & Related papers (2020-08-12T04:14:11Z) - Weakly-Supervised Semantic Segmentation via Sub-category Exploration [73.03956876752868]
We propose a simple yet effective approach to enforce the network to pay attention to other parts of an object.
Specifically, we perform clustering on image features to generate pseudo sub-categories labels within each annotated parent class.
We conduct extensive analysis to validate the proposed method and show that our approach performs favorably against the state-of-the-art approaches.
arXiv Detail & Related papers (2020-08-03T20:48:31Z) - Rethinking the Route Towards Weakly Supervised Object Localization [28.90792512056726]
We show that weakly supervised object localization should be divided into two parts: class-agnostic object localization and object classification.
For class-agnostic object localization, we should use class-agnostic methods to generate noisy pseudo annotations and then perform bounding box regression on them without class labels.
Our PSOL models have good transferability across different datasets without fine-tuning.
arXiv Detail & Related papers (2020-02-26T08:54:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.