Beyond Discriminative Regions: Saliency Maps as Alternatives to CAMs for
Weakly Supervised Semantic Segmentation
- URL: http://arxiv.org/abs/2308.11052v1
- Date: Mon, 21 Aug 2023 21:30:48 GMT
- Title: Beyond Discriminative Regions: Saliency Maps as Alternatives to CAMs for
Weakly Supervised Semantic Segmentation
- Authors: M. Maruf, Arka Daw, Amartya Dutta, Jie Bu, Anuj Karpatne
- Abstract summary: This paper provides a comprehensive comparison between saliencies and CAMs for Weakly Supervised Semantic (WS3)
We demonstrate the effectiveness of saliencies in addressing the limitation of CAMs through our empirical studies on benchmark datasets.
- Score: 7.518769185369959
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In recent years, several Weakly Supervised Semantic Segmentation (WS3)
methods have been proposed that use class activation maps (CAMs) generated by a
classifier to produce pseudo-ground truths for training segmentation models.
While CAMs are good at highlighting discriminative regions (DR) of an image,
they are known to disregard regions of the object that do not contribute to the
classifier's prediction, termed non-discriminative regions (NDR). In contrast,
attribution methods such as saliency maps provide an alternative approach for
assigning a score to every pixel based on its contribution to the
classification prediction. This paper provides a comprehensive comparison
between saliencies and CAMs for WS3. Our study includes multiple perspectives
on understanding their similarities and dissimilarities. Moreover, we provide
new evaluation metrics that perform a comprehensive assessment of WS3
performance of alternative methods w.r.t. CAMs. We demonstrate the
effectiveness of saliencies in addressing the limitation of CAMs through our
empirical studies on benchmark datasets. Furthermore, we propose random
cropping as a stochastic aggregation technique that improves the performance of
saliency, making it a strong alternative to CAM for WS3.
Related papers
- Generalize or Detect? Towards Robust Semantic Segmentation Under Multiple Distribution Shifts [56.57141696245328]
In open-world scenarios, where both novel classes and domains may exist, an ideal segmentation model should detect anomaly classes for safety.
Existing methods often struggle to distinguish between domain-level and semantic-level distribution shifts.
arXiv Detail & Related papers (2024-11-06T11:03:02Z) - Volumetric Semantically Consistent 3D Panoptic Mapping [77.13446499924977]
We introduce an online 2D-to-3D semantic instance mapping algorithm aimed at generating semantic 3D maps suitable for autonomous agents in unstructured environments.
It introduces novel ways of integrating semantic prediction confidence during mapping, producing semantic and instance-consistent 3D regions.
The proposed method achieves accuracy superior to the state of the art on public large-scale datasets, improving on a number of widely used metrics.
arXiv Detail & Related papers (2023-09-26T08:03:10Z) - Fine-grained Recognition with Learnable Semantic Data Augmentation [68.48892326854494]
Fine-grained image recognition is a longstanding computer vision challenge.
We propose diversifying the training data at the feature-level to alleviate the discriminative region loss problem.
Our method significantly improves the generalization performance on several popular classification networks.
arXiv Detail & Related papers (2023-09-01T11:15:50Z) - Discrepancy-based Active Learning for Weakly Supervised Bleeding
Segmentation in Wireless Capsule Endoscopy Images [36.39723547760312]
This paper proposes a new Discrepancy-basEd Active Learning approach to bridge the gap between CAMs and ground truths with a few annotations.
Specifically, to liberate labor, we design a novel discrepancy decoder and a CAMPUS criterion to replace the noisy CAMs with accurate model predictions and a few human labels.
Our method outperforms the state-of-the-art active learning methods and reaches comparable performance to those trained with full annotated datasets with only 10% of the training data labeled.
arXiv Detail & Related papers (2023-08-09T15:04:17Z) - Decom--CAM: Tell Me What You See, In Details! Feature-Level Interpretation via Decomposition Class Activation Map [23.71680014689873]
Class Activation Map (CAM) is widely used to interpret deep model predictions by highlighting object location.
This paper proposes a new two-stage interpretability method called the Decomposition Class Activation Map (Decom-CAM)
Our experiments demonstrate that the proposed Decom-CAM outperforms current state-of-the-art methods significantly.
arXiv Detail & Related papers (2023-05-27T14:33:01Z) - Extracting Class Activation Maps from Non-Discriminative Features as
well [23.968856513180032]
Class activation maps (CAM) from a classification model often results in poor coverage on foreground objects.
We introduce a new computation method for CAM that explicitly captures non-discriminative features as well.
We call the resultant K cluster centers local prototypes - represent local semantics like the "head", "leg", and "body" of "sheep"
arXiv Detail & Related papers (2023-03-18T04:47:42Z) - PointInst3D: Segmenting 3D Instances by Points [136.7261709896713]
We propose a fully-convolutional 3D point cloud instance segmentation method that works in a per-point prediction fashion.
We find the key to its success is assigning a suitable target to each sampled point.
Our approach achieves promising results on both ScanNet and S3DIS benchmarks.
arXiv Detail & Related papers (2022-04-25T02:41:46Z) - Importance Sampling CAMs for Weakly-Supervised Segmentation [16.86352815414646]
Class activation maps (CAMs) can be used to localize and segment objects in images by means of class activation maps (CAMs)
In this work, we approach both problems with two contributions for improving CAM learning.
We conduct experiments on the PASCAL VOC 2012 benchmark dataset to demonstrate that these modifications significantly increase the performance in terms of contour accuracy.
arXiv Detail & Related papers (2022-03-23T14:54:29Z) - Complementary Patch for Weakly Supervised Semantic Segmentation [25.990386258122726]
We formulate the expansion of object regions in Class Activation Map (CAM) as an increase in information.
We propose a novel Complementary Patch (CP) Representation and prove that the information of the sum of the CAMs by a pair of input images with complementary hidden (patched) parts, namely CP Pair, is greater than or equal to the information of the baseline CAM.
To further improve the quality of the CAMs, we propose a Pixel-Region Correlation Module (PRCM) to augment the contextual information.
arXiv Detail & Related papers (2021-08-09T07:50:54Z) - MCDAL: Maximum Classifier Discrepancy for Active Learning [74.73133545019877]
Recent state-of-the-art active learning methods have mostly leveraged Generative Adversarial Networks (GAN) for sample acquisition.
We propose in this paper a novel active learning framework that we call Maximum Discrepancy for Active Learning (MCDAL)
In particular, we utilize two auxiliary classification layers that learn tighter decision boundaries by maximizing the discrepancies among them.
arXiv Detail & Related papers (2021-07-23T06:57:08Z) - Region Comparison Network for Interpretable Few-shot Image
Classification [97.97902360117368]
Few-shot image classification has been proposed to effectively use only a limited number of labeled examples to train models for new classes.
We propose a metric learning based method named Region Comparison Network (RCN), which is able to reveal how few-shot learning works.
We also present a new way to generalize the interpretability from the level of tasks to categories.
arXiv Detail & Related papers (2020-09-08T07:29:05Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.