MetaCAM: Ensemble-Based Class Activation Map
- URL: http://arxiv.org/abs/2307.16863v1
- Date: Mon, 31 Jul 2023 17:20:48 GMT
- Title: MetaCAM: Ensemble-Based Class Activation Map
- Authors: Emily Kaczmarek, Olivier X. Miguel, Alexa C. Bowie, Robin Ducharme,
Alysha L.J. Dingwall-Harvey, Steven Hawken, Christine M. Armour, Mark C.
Walker, Kevin Dick
- Abstract summary: Class Activation Maps (CAMs) are an increasingly popular category of visual explanation methods for CNNs.
We propose MetaCAM, an ensemble-based method for combining multiple existing CAM methods.
We show that MetaCAM outperforms existing CAMs and refines the most salient regions of images used for model predictions.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: The need for clear, trustworthy explanations of deep learning model
predictions is essential for high-criticality fields, such as medicine and
biometric identification. Class Activation Maps (CAMs) are an increasingly
popular category of visual explanation methods for Convolutional Neural
Networks (CNNs). However, the performance of individual CAMs depends largely on
experimental parameters such as the selected image, target class, and model.
Here, we propose MetaCAM, an ensemble-based method for combining multiple
existing CAM methods based on the consensus of the top-k% most highly activated
pixels across component CAMs. We perform experiments to quantifiably determine
the optimal combination of 11 CAMs for a given MetaCAM experiment. A new method
denoted Cumulative Residual Effect (CRE) is proposed to summarize large-scale
ensemble-based experiments. We also present adaptive thresholding and
demonstrate how it can be applied to individual CAMs to improve their
performance, measured using pixel perturbation method Remove and Debias (ROAD).
Lastly, we show that MetaCAM outperforms existing CAMs and refines the most
salient regions of images used for model predictions. In a specific example,
MetaCAM improved ROAD performance to 0.393 compared to 11 individual CAMs with
ranges from -0.101-0.172, demonstrating the importance of combining CAMs
through an ensembling method and adaptive thresholding.
Related papers
- BroadCAM: Outcome-agnostic Class Activation Mapping for Small-scale
Weakly Supervised Applications [69.22739434619531]
We propose an outcome-agnostic CAM approach, called BroadCAM, for small-scale weakly supervised applications.
By evaluating BroadCAM on VOC2012 and BCSS-WSSS for WSSS and OpenImages30k for WSOL, BroadCAM demonstrates superior performance.
arXiv Detail & Related papers (2023-09-07T06:45:43Z) - AME-CAM: Attentive Multiple-Exit CAM for Weakly Supervised Segmentation
on MRI Brain Tumor [20.70840352243769]
We propose a novel CAM method, Attentive Multiple-Exit CAM (AME-CAM), that extracts activation maps from multiple resolutions to hierarchically aggregate and improve prediction accuracy.
We evaluate our method on the BraTS 2021 dataset and show that it outperforms state-of-the-art methods.
arXiv Detail & Related papers (2023-06-26T08:24:37Z) - Opti-CAM: Optimizing saliency maps for interpretability [10.122899813335694]
We introduce Opti-CAM, combining ideas from CAM-based and masking-based approaches.
Our saliency map is a linear combination of feature maps, where weights are optimized per image.
On several datasets, Opti-CAM largely outperforms other CAM-based approaches according to the most relevant classification metrics.
arXiv Detail & Related papers (2023-01-17T16:44:48Z) - Learning with MISELBO: The Mixture Cookbook [62.75516608080322]
We present the first ever mixture of variational approximations for a normalizing flow-based hierarchical variational autoencoder (VAE) with VampPrior and a PixelCNN decoder network.
We explain this cooperative behavior by drawing a novel connection between VI and adaptive importance sampling.
We obtain state-of-the-art results among VAE architectures in terms of negative log-likelihood on the MNIST and FashionMNIST datasets.
arXiv Detail & Related papers (2022-09-30T15:01:35Z) - IL-MCAM: An interactive learning and multi-channel attention
mechanism-based weakly supervised colorectal histopathology image
classification approach [23.520258872268556]
We propose an IL-MCAM framework, based on attention mechanisms and interactive learning.
The proposed IL-MCAM framework includes two stages: automatic learning (AL) and interactivity learning (IL)
In the IL stage, the proposed IL-MCAM framework continuously adds misclassified images to the training set in an interactive approach, which improves the classification ability of the MCAM model.
arXiv Detail & Related papers (2022-06-07T15:03:05Z) - F-CAM: Full Resolution CAM via Guided Parametric Upscaling [20.609010268320013]
Class Activation Mapping (CAM) methods have recently gained much attention for weakly-supervised object localization (WSOL) tasks.
CAM methods are typically integrated within off-the-shelf CNN backbones, such as ResNet50.
We introduce a generic method for parametric upscaling of CAMs that allows constructing accurate full resolution CAMs.
arXiv Detail & Related papers (2021-09-15T04:45:20Z) - Joint Noise-Tolerant Learning and Meta Camera Shift Adaptation for
Unsupervised Person Re-Identification [60.36551512902312]
unsupervised person re-identification (re-ID) aims to learn discriminative models with unlabeled data.
One popular method is to obtain pseudo-label by clustering and use them to optimize the model.
In this paper, we propose a unified framework to solve both problems.
arXiv Detail & Related papers (2021-03-08T09:13:06Z) - Use HiResCAM instead of Grad-CAM for faithful explanations of
convolutional neural networks [89.56292219019163]
Explanation methods facilitate the development of models that learn meaningful concepts and avoid exploiting spurious correlations.
We illustrate a previously unrecognized limitation of the popular neural network explanation method Grad-CAM.
We propose HiResCAM, a class-specific explanation method that is guaranteed to highlight only the locations the model used to make each prediction.
arXiv Detail & Related papers (2020-11-17T19:26:14Z) - IS-CAM: Integrated Score-CAM for axiomatic-based explanations [0.0]
We propose IS-CAM (Integrated Score-CAM), where we introduce the integration operation within the Score-CAM pipeline to achieve visually sharper attribution maps.
Our method is evaluated on 2000 randomly selected images from the ILSVRC 2012 Validation dataset, which proves the versatility of IS-CAM to account for different models and methods.
arXiv Detail & Related papers (2020-10-06T21:03:03Z) - Deep Keypoint-Based Camera Pose Estimation with Geometric Constraints [80.60538408386016]
Estimating relative camera poses from consecutive frames is a fundamental problem in visual odometry.
We propose an end-to-end trainable framework consisting of learnable modules for detection, feature extraction, matching and outlier rejection.
arXiv Detail & Related papers (2020-07-29T21:41:31Z) - VMLoc: Variational Fusion For Learning-Based Multimodal Camera
Localization [46.607930208613574]
We propose an end-to-end framework, termed VMLoc, to fuse different sensor inputs into a common latent space.
Unlike previous multimodal variational works directly adapting the objective function of vanilla variational auto-encoder, we show how camera localization can be accurately estimated.
arXiv Detail & Related papers (2020-03-12T14:52:10Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.