Rethinking Class Activation Maps for Segmentation: Revealing Semantic
Information in Shallow Layers by Reducing Noise
- URL: http://arxiv.org/abs/2308.02118v1
- Date: Fri, 4 Aug 2023 03:04:09 GMT
- Title: Rethinking Class Activation Maps for Segmentation: Revealing Semantic
Information in Shallow Layers by Reducing Noise
- Authors: Hang-Cheng Dong, Yuhao Jiang, Yingyan Huang, Jingxiao Liao, Bingguo
Liu, Dong Ye, Guodong Liu
- Abstract summary: A major limitation to the performance of the class activation maps is the small spatial resolution of the feature maps in the last layer of the convolutional neural network.
We propose a simple gradient-based denoising method to filter the noise by truncating the positive gradient.
Our proposed scheme can be easily deployed in other CAM-related methods, facilitating these methods to obtain higher-quality class activation maps.
- Score: 2.462953128215088
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Class activation maps are widely used for explaining deep neural networks.
Due to its ability to highlight regions of interest, it has evolved in recent
years as a key step in weakly supervised learning. A major limitation to the
performance of the class activation maps is the small spatial resolution of the
feature maps in the last layer of the convolutional neural network. Therefore,
we expect to generate high-resolution feature maps that result in high-quality
semantic information. In this paper, we rethink the properties of semantic
information in shallow feature maps. We find that the shallow feature maps
still have fine-grained non-discriminative features while mixing considerable
non-target noise. Furthermore, we propose a simple gradient-based denoising
method to filter the noise by truncating the positive gradient. Our proposed
scheme can be easily deployed in other CAM-related methods, facilitating these
methods to obtain higher-quality class activation maps. We evaluate the
proposed approach through a weakly-supervised semantic segmentation task, and a
large number of experiments demonstrate the effectiveness of our approach.
Related papers
- SegPrompt: Using Segmentation Map as a Better Prompt to Finetune Deep
Models for Kidney Stone Classification [62.403510793388705]
Deep learning has produced encouraging results for kidney stone classification using endoscope images.
The shortage of annotated training data poses a severe problem in improving the performance and generalization ability of the trained model.
We propose SegPrompt to alleviate the data shortage problems by exploiting segmentation maps from two aspects.
arXiv Detail & Related papers (2023-03-15T01:30:48Z) - Abs-CAM: A Gradient Optimization Interpretable Approach for Explanation
of Convolutional Neural Networks [7.71412567705588]
Class activation mapping-based method has been widely used to interpret the internal decisions of models in computer vision tasks.
We propose an Absolute value Class Activation Mapping-based (Abs-CAM) method, which optimize the gradients derived from the backpropagation.
The framework of Abs-CAM is divided into two phases: generating initial saliency map and generating final saliency map.
arXiv Detail & Related papers (2022-07-08T02:06:46Z) - Poly-CAM: High resolution class activation map for convolutional neural
networks [88.29660600055715]
saliency maps derived from convolutional neural networks generally fail in localizing with accuracy the image features justifying the network prediction.
This is because those maps are either low-resolution as for CAM [Zhou et al., 2016], or smooth as for perturbation-based methods [Zeiler and Fergus, 2014], or do correspond to a large number of widespread peaky spots.
In contrast, our work proposes to combine the information from earlier network layers with the one from later layers to produce a high resolution Class Activation Map.
arXiv Detail & Related papers (2022-04-28T09:06:19Z) - Reducing Information Bottleneck for Weakly Supervised Semantic
Segmentation [17.979336178991083]
Weakly supervised semantic segmentation produces pixel-level localization from class labels.
A classifier trained on such labels is likely to focus on a small discriminative region of the target object.
We propose a method to reduce the information bottleneck by removing the last activation function.
In addition, we introduce a new pooling method that further encourages the transmission of information from non-discriminative regions to the classification.
arXiv Detail & Related papers (2021-10-13T06:49:45Z) - Online Refinement of Low-level Feature Based Activation Map for Weakly
Supervised Object Localization [15.665479740413229]
We present a two-stage learning framework for weakly supervised object localization (WSOL)
In the first stage, an activation map generator produces activation maps based on the low-level feature maps in the classifier.
In the second stage, we employ an evaluator to evaluate the activation maps predicted by the activation map generator.
Based on the low-level object information preserved in the first stage, the second stage model gradually generates a well-separated, complete, and compact activation map of object in the image.
arXiv Detail & Related papers (2021-10-12T05:09:21Z) - TSG: Target-Selective Gradient Backprop for Probing CNN Visual Saliency [72.9106103283475]
We study the visual saliency, a.k.a. visual explanation, to interpret convolutional neural networks.
Inspired by those observations, we propose a novel visual saliency framework, termed Target-Selective Gradient (TSG) backprop.
The proposed TSG consists of two components, namely, TSG-Conv and TSG-FC, which rectify the gradients for convolutional layers and fully-connected layers, respectively.
arXiv Detail & Related papers (2021-10-11T12:00:20Z) - CAMERAS: Enhanced Resolution And Sanity preserving Class Activation
Mapping for image saliency [61.40511574314069]
Backpropagation image saliency aims at explaining model predictions by estimating model-centric importance of individual pixels in the input.
We propose CAMERAS, a technique to compute high-fidelity backpropagation saliency maps without requiring any external priors.
arXiv Detail & Related papers (2021-06-20T08:20:56Z) - Weakly-Supervised Semantic Segmentation via Sub-category Exploration [73.03956876752868]
We propose a simple yet effective approach to enforce the network to pay attention to other parts of an object.
Specifically, we perform clustering on image features to generate pseudo sub-categories labels within each annotated parent class.
We conduct extensive analysis to validate the proposed method and show that our approach performs favorably against the state-of-the-art approaches.
arXiv Detail & Related papers (2020-08-03T20:48:31Z) - Understanding Integrated Gradients with SmoothTaylor for Deep Neural
Network Attribution [70.78655569298923]
Integrated Gradients as an attribution method for deep neural network models offers simple implementability.
It suffers from noisiness of explanations which affects the ease of interpretability.
The SmoothGrad technique is proposed to solve the noisiness issue and smoothen the attribution maps of any gradient-based attribution method.
arXiv Detail & Related papers (2020-04-22T10:43:19Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.