Gaze-Guided Class Activation Mapping: Leveraging Human Attention for
Network Attention in Chest X-rays Classification
- URL: http://arxiv.org/abs/2202.07107v1
- Date: Tue, 15 Feb 2022 00:33:23 GMT
- Title: Gaze-Guided Class Activation Mapping: Leveraging Human Attention for
Network Attention in Chest X-rays Classification
- Authors: Hongzhi Zhu, Septimiu Salcudean, Robert Rohling
- Abstract summary: This paper describes a gaze-guided class activation mapping (GG-CAM) method to directly regulate the formation of network attention.
GG-CAM is a lightweight ($3$ additional trainable parameters for regulating the learning process) and generic extension that can be easily applied to most classification convolutional neural networks (CNN)
Comparative experiments suggest that two standard CNNs with the GG-CAM extension achieve significantly greater classification performance.
- Score: 3.8637285238278434
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: The increased availability and accuracy of eye-gaze tracking technology has
sparked attention-related research in psychology, neuroscience, and, more
recently, computer vision and artificial intelligence. The attention mechanism
in artificial neural networks is known to improve learning tasks. However, no
previous research has combined the network attention and human attention. This
paper describes a gaze-guided class activation mapping (GG-CAM) method to
directly regulate the formation of network attention based on expert
radiologists' visual attention for the chest X-ray pathology classification
problem, which remains challenging due to the complex and often nuanced
differences among images. GG-CAM is a lightweight ($3$ additional trainable
parameters for regulating the learning process) and generic extension that can
be easily applied to most classification convolutional neural networks (CNN).
GG-CAM-modified CNNs do not require human attention as an input when fully
trained. Comparative experiments suggest that two standard CNNs with the GG-CAM
extension achieve significantly greater classification performance. The median
area under the curve (AUC) metrics for ResNet50 increases from $0.721$ to
$0.776$. For EfficientNetv2 (s), the median AUC increases from $0.723$ to
$0.801$. The GG-CAM also brings better interpretability of the network that
facilitates the weakly-supervised pathology localization and analysis.
Related papers
- Gaze-directed Vision GNN for Mitigating Shortcut Learning in Medical Image [6.31072075551707]
We propose a novel gaze-directed Vision GNN (called GD-ViG) to leverage the visual patterns of radiologists from gaze as expert knowledge.
The experiments on two public medical image datasets demonstrate that GD-ViG outperforms the state-of-the-art methods.
arXiv Detail & Related papers (2024-06-20T07:16:41Z) - Affine-Consistent Transformer for Multi-Class Cell Nuclei Detection [76.11864242047074]
We propose a novel Affine-Consistent Transformer (AC-Former), which directly yields a sequence of nucleus positions.
We introduce an Adaptive Affine Transformer (AAT) module, which can automatically learn the key spatial transformations to warp original images for local network training.
Experimental results demonstrate that the proposed method significantly outperforms existing state-of-the-art algorithms on various benchmarks.
arXiv Detail & Related papers (2023-10-22T02:27:02Z) - How neural networks learn to classify chaotic time series [77.34726150561087]
We study the inner workings of neural networks trained to classify regular-versus-chaotic time series.
We find that the relation between input periodicity and activation periodicity is key for the performance of LKCNN models.
arXiv Detail & Related papers (2023-06-04T08:53:27Z) - Backdoor Attack Detection in Computer Vision by Applying Matrix
Factorization on the Weights of Deep Networks [6.44397009982949]
We introduce a novel method for backdoor detection that extracts features from pre-trained DNN's weights.
In comparison to other detection techniques, this has a number of benefits, such as not requiring any training data.
Our method outperforms the competing algorithms in terms of efficiency and is more accurate, helping to ensure the safe application of deep learning and AI.
arXiv Detail & Related papers (2022-12-15T20:20:18Z) - SAR Despeckling Using Overcomplete Convolutional Networks [53.99620005035804]
despeckling is an important problem in remote sensing as speckle degrades SAR images.
Recent studies show that convolutional neural networks(CNNs) outperform classical despeckling methods.
This study employs an overcomplete CNN architecture to focus on learning low-level features by restricting the receptive field.
We show that the proposed network improves despeckling performance compared to recent despeckling methods on synthetic and real SAR images.
arXiv Detail & Related papers (2022-05-31T15:55:37Z) - CNN Attention Guidance for Improved Orthopedics Radiographic Fracture
Classification [37.07168182420638]
Convolutional neural networks (CNNs) have gained significant popularity in orthopedic imaging in recent years due to their ability to solve fracture classification problems.
A common criticism of CNNs is their opaque learning and reasoning process, making it difficult to trust machine diagnosis and the subsequent adoption of such algorithms in clinical setting.
This paper explores the effectiveness of regularizing CNN network with human-provided attention guidance on where in the image the network should look for answering clues.
arXiv Detail & Related papers (2022-03-21T01:07:42Z) - Visual Attention Network [90.0753726786985]
We propose a novel large kernel attention (LKA) module to enable self-adaptive and long-range correlations in self-attention.
We also introduce a novel neural network based on LKA, namely Visual Attention Network (VAN)
VAN outperforms the state-of-the-art vision transformers and convolutional neural networks with a large margin in extensive experiments.
arXiv Detail & Related papers (2022-02-20T06:35:18Z) - Stain Normalized Breast Histopathology Image Recognition using
Convolutional Neural Networks for Cancer Detection [9.826027427965354]
Recent advances have shown that the convolutional Neural Network (CNN) architectures can be used to design a Computer Aided Diagnostic (CAD) System for breast cancer detection.
We consider some contemporary CNN models for binary classification of breast histopathology images.
We have validated the trained CNN networks on a publicly available BreaKHis dataset, for 200x and 400x magnified histopathology images.
arXiv Detail & Related papers (2022-01-04T03:09:40Z) - BreakingBED -- Breaking Binary and Efficient Deep Neural Networks by
Adversarial Attacks [65.2021953284622]
We study robustness of CNNs against white-box and black-box adversarial attacks.
Results are shown for distilled CNNs, agent-based state-of-the-art pruned models, and binarized neural networks.
arXiv Detail & Related papers (2021-03-14T20:43:19Z) - Curriculum By Smoothing [52.08553521577014]
Convolutional Neural Networks (CNNs) have shown impressive performance in computer vision tasks such as image classification, detection, and segmentation.
We propose an elegant curriculum based scheme that smoothes the feature embedding of a CNN using anti-aliasing or low-pass filters.
As the amount of information in the feature maps increases during training, the network is able to progressively learn better representations of the data.
arXiv Detail & Related papers (2020-03-03T07:27:44Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.