Attend Who is Weak: Pruning-assisted Medical Image Localization under
Sophisticated and Implicit Imbalances
- URL: http://arxiv.org/abs/2212.02675v1
- Date: Tue, 6 Dec 2022 00:32:03 GMT
- Title: Attend Who is Weak: Pruning-assisted Medical Image Localization under
Sophisticated and Implicit Imbalances
- Authors: Ajay Jaiswal, Tianlong Chen, Justin F. Rousseau, Yifan Peng, Ying
Ding, Zhangyang Wang
- Abstract summary: Deep neural networks (DNNs) have rapidly become a textitde facto choice for medical image understanding tasks.
In this paper, we propose to use pruning to automatically and adaptively identify textithard-to-learn (HTL) training samples.
We also present an interesting demographic analysis which illustrates HTLs ability to capture complex demographic imbalances.
- Score: 102.68466217374655
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Deep neural networks (DNNs) have rapidly become a \textit{de facto} choice
for medical image understanding tasks. However, DNNs are notoriously fragile to
the class imbalance in image classification. We further point out that such
imbalance fragility can be amplified when it comes to more sophisticated tasks
such as pathology localization, as imbalances in such problems can have highly
complex and often implicit forms of presence. For example, different pathology
can have different sizes or colors (w.r.t.the background), different underlying
demographic distributions, and in general different difficulty levels to
recognize, even in a meticulously curated balanced distribution of training
data. In this paper, we propose to use pruning to automatically and adaptively
identify \textit{hard-to-learn} (HTL) training samples, and improve pathology
localization by attending them explicitly, during training in
\textit{supervised, semi-supervised, and weakly-supervised} settings. Our main
inspiration is drawn from the recent finding that deep classification models
have difficult-to-memorize samples and those may be effectively exposed through
network pruning \cite{hooker2019compressed} - and we extend such observation
beyond classification for the first time. We also present an interesting
demographic analysis which illustrates HTLs ability to capture complex
demographic imbalances. Our extensive experiments on the Skin Lesion
Localization task in multiple training settings by paying additional attention
to HTLs show significant improvement of localization performance by
$\sim$2-3\%.
Related papers
- Leave No Stone Unturned: Mine Extra Knowledge for Imbalanced Facial
Expression Recognition [39.08466869516571]
Facial expression data is characterized by a significant imbalance, with most collected data showing happy or neutral expressions and fewer instances of fear or disgust.
This imbalance poses challenges to facial expression recognition (FER) models, hindering their ability to fully understand various human emotional states.
Existing FER methods typically report overall accuracy on highly imbalanced test sets but exhibit low performance in terms of the mean accuracy across all expression classes.
arXiv Detail & Related papers (2023-10-30T15:26:26Z) - Class Attention to Regions of Lesion for Imbalanced Medical Image
Recognition [59.28732531600606]
We propose a framework named textbfClass textbfAttention to textbfREgions of the lesion (CARE) to handle data imbalance issues.
The CARE framework needs bounding boxes to represent the lesion regions of rare diseases.
Results show that the CARE variants with automated bounding box generation are comparable to the original CARE framework.
arXiv Detail & Related papers (2023-07-19T15:19:02Z) - Analyzing the Effects of Handling Data Imbalance on Learned Features
from Medical Images by Looking Into the Models [50.537859423741644]
Training a model on an imbalanced dataset can introduce unique challenges to the learning problem.
We look deeper into the internal units of neural networks to observe how handling data imbalance affects the learned features.
arXiv Detail & Related papers (2022-04-04T09:38:38Z) - Balanced-MixUp for Highly Imbalanced Medical Image Classification [19.338350044289736]
We propose a novel mechanism for sampling training data based on the popular MixUp regularization technique.
We experiment with a highly imbalanced dataset of retinal images and a long-tail dataset of gastro-intestinal video frames.
arXiv Detail & Related papers (2021-09-20T21:31:31Z) - Analyzing Overfitting under Class Imbalance in Neural Networks for Image
Segmentation [19.259574003403998]
In image segmentation neural networks may overfit to the foreground samples from small structures.
In this study, we provide new insights on the problem of overfitting under class imbalance by inspecting the network behavior.
arXiv Detail & Related papers (2021-02-20T14:57:58Z) - An anatomically-informed 3D CNN for brain aneurysm classification with
weak labels [0.2010294990327175]
We address the task of brain aneurysm detection as a patch-wise binary classification with weak labels.
Our approach comes with the non-trivial challenge of the data set creation.
We propose a novel, anatomically-driven approach by using a multi-scale and multi-input 3D Convolutional Neural Network.
arXiv Detail & Related papers (2020-11-27T08:00:54Z) - Deep Low-Shot Learning for Biological Image Classification and
Visualization from Limited Training Samples [52.549928980694695]
In situ hybridization (ISH) gene expression pattern images from the same developmental stage are compared.
labeling training data with precise stages is very time-consuming even for biologists.
We propose a deep two-step low-shot learning framework to accurately classify ISH images using limited training images.
arXiv Detail & Related papers (2020-10-20T06:06:06Z) - Domain Generalization for Medical Imaging Classification with
Linear-Dependency Regularization [59.5104563755095]
We introduce a simple but effective approach to improve the generalization capability of deep neural networks in the field of medical imaging classification.
Motivated by the observation that the domain variability of the medical images is to some extent compact, we propose to learn a representative feature space through variational encoding.
arXiv Detail & Related papers (2020-09-27T12:30:30Z) - Multi-label Thoracic Disease Image Classification with Cross-Attention
Networks [65.37531731899837]
We propose a novel scheme of Cross-Attention Networks (CAN) for automated thoracic disease classification from chest x-ray images.
We also design a new loss function that beyond cross-entropy loss to help cross-attention process and is able to overcome the imbalance between classes and easy-dominated samples within each class.
arXiv Detail & Related papers (2020-07-21T14:37:00Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.