Online Refinement of Low-level Feature Based Activation Map for Weakly
Supervised Object Localization
- URL: http://arxiv.org/abs/2110.05741v1
- Date: Tue, 12 Oct 2021 05:09:21 GMT
- Title: Online Refinement of Low-level Feature Based Activation Map for Weakly
Supervised Object Localization
- Authors: Jinheng Xie, Cheng Luo, Xiangping Zhu, Ziqi Jin, Weizeng Lu, Linlin
Shen
- Abstract summary: We present a two-stage learning framework for weakly supervised object localization (WSOL)
In the first stage, an activation map generator produces activation maps based on the low-level feature maps in the classifier.
In the second stage, we employ an evaluator to evaluate the activation maps predicted by the activation map generator.
Based on the low-level object information preserved in the first stage, the second stage model gradually generates a well-separated, complete, and compact activation map of object in the image.
- Score: 15.665479740413229
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We present a two-stage learning framework for weakly supervised object
localization (WSOL). While most previous efforts rely on high-level feature
based CAMs (Class Activation Maps), this paper proposes to localize objects
using the low-level feature based activation maps. In the first stage, an
activation map generator produces activation maps based on the low-level
feature maps in the classifier, such that rich contextual object information is
included in an online manner. In the second stage, we employ an evaluator to
evaluate the activation maps predicted by the activation map generator. Based
on this, we further propose a weighted entropy loss, an attentive erasing, and
an area loss to drive the activation map generator to substantially reduce the
uncertainty of activations between object and background, and explore less
discriminative regions. Based on the low-level object information preserved in
the first stage, the second stage model gradually generates a well-separated,
complete, and compact activation map of object in the image, which can be
easily thresholded for accurate localization. Extensive experiments on
CUB-200-2011 and ImageNet-1K datasets show that our framework surpasses
previous methods by a large margin, which sets a new state-of-the-art for WSOL.
Related papers
- Background Activation Suppression for Weakly Supervised Object
Localization and Semantic Segmentation [84.62067728093358]
Weakly supervised object localization and semantic segmentation aim to localize objects using only image-level labels.
New paradigm has emerged by generating a foreground prediction map to achieve pixel-level localization.
This paper presents two astonishing experimental observations on the object localization learning process.
arXiv Detail & Related papers (2023-09-22T15:44:10Z) - Rethinking the Localization in Weakly Supervised Object Localization [51.29084037301646]
Weakly supervised object localization (WSOL) is one of the most popular and challenging tasks in computer vision.
Recent dividing WSOL into two parts (class-agnostic object localization and object classification) has become the state-of-the-art pipeline for this task.
We propose to replace SCR with a binary-class detector (BCD) for localizing multiple objects, where the detector is trained by discriminating the foreground and background.
arXiv Detail & Related papers (2023-08-11T14:38:51Z) - Rethinking Class Activation Maps for Segmentation: Revealing Semantic
Information in Shallow Layers by Reducing Noise [2.462953128215088]
A major limitation to the performance of the class activation maps is the small spatial resolution of the feature maps in the last layer of the convolutional neural network.
We propose a simple gradient-based denoising method to filter the noise by truncating the positive gradient.
Our proposed scheme can be easily deployed in other CAM-related methods, facilitating these methods to obtain higher-quality class activation maps.
arXiv Detail & Related papers (2023-08-04T03:04:09Z) - Weakly-supervised Contrastive Learning for Unsupervised Object Discovery [52.696041556640516]
Unsupervised object discovery is promising due to its ability to discover objects in a generic manner.
We design a semantic-guided self-supervised learning model to extract high-level semantic features from images.
We introduce Principal Component Analysis (PCA) to localize object regions.
arXiv Detail & Related papers (2023-07-07T04:03:48Z) - CREAM: Weakly Supervised Object Localization via Class RE-Activation
Mapping [18.67907876709536]
Class RE-Activation Mapping (CREAM) is a clustering-based approach to boost the activation values of the integral object regions.
CREAM achieves the state-of-the-art performance on CUB, ILSVRC and OpenImages benchmark datasets.
arXiv Detail & Related papers (2022-05-27T11:57:41Z) - Background-aware Classification Activation Map for Weakly Supervised
Object Localization [14.646874544729426]
We propose a background-aware classification activation map (B-CAM) to simultaneously learn localization scores of both object and background.
Our B-CAM can be trained in end-to-end manner based on a proposed stagger classification loss.
Experiments show that our B-CAM outperforms one-stage WSOL methods on the CUB-200, OpenImages and VOC2012 datasets.
arXiv Detail & Related papers (2021-12-29T03:12:09Z) - Background Activation Suppression for Weakly Supervised Object
Localization [11.31345656299108]
We argue for using activation value to achieve more efficient learning.
In this paper, we propose a Background Activation Suppression (BAS) method.
BAS achieves significant and consistent improvement over the baseline methods on the CUB-200-2011 and ILSVRC datasets.
arXiv Detail & Related papers (2021-12-01T15:53:40Z) - Unveiling the Potential of Structure-Preserving for Weakly Supervised
Object Localization [71.79436685992128]
We propose a two-stage approach, termed structure-preserving activation (SPA), towards fully leveraging the structure information incorporated in convolutional features for WSOL.
In the first stage, a restricted activation module (RAM) is designed to alleviate the structure-missing issue caused by the classification network.
In the second stage, we propose a post-process approach, termed self-correlation map generating (SCG) module to obtain structure-preserving localization maps.
arXiv Detail & Related papers (2021-03-08T03:04:14Z) - Rethinking Localization Map: Towards Accurate Object Perception with
Self-Enhancement Maps [78.2581910688094]
This work introduces a novel self-enhancement method to harvest accurate object localization maps and object boundaries with only category labels as supervision.
In particular, the proposed Self-Enhancement Maps achieve the state-of-the-art localization accuracy of 54.88% on ILSVRC.
arXiv Detail & Related papers (2020-06-09T12:35:55Z) - BiDet: An Efficient Binarized Object Detector [96.19708396510894]
We propose a binarized neural network learning method called BiDet for efficient object detection.
Our BiDet fully utilizes the representational capacity of the binary neural networks for object detection by redundancy removal.
Our method outperforms the state-of-the-art binary neural networks by a sizable margin.
arXiv Detail & Related papers (2020-03-09T08:16:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.