Learning disentangled representations for explainable chest X-ray
classification using Dirichlet VAEs
- URL: http://arxiv.org/abs/2302.02979v1
- Date: Mon, 6 Feb 2023 18:10:08 GMT
- Title: Learning disentangled representations for explainable chest X-ray
classification using Dirichlet VAEs
- Authors: Rachael Harkness, Alejandro F Frangi, Kieran Zucker, Nishant Ravikumar
- Abstract summary: This study explores the use of the Dirichlet Variational Autoencoder (DirVAE) for learning disentangled latent representations of chest X-ray (CXR) images.
The predictive capacity of multi-modal latent representations learned by DirVAE models is investigated through implementation of an auxiliary multi-label classification task.
- Score: 68.73427163074015
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This study explores the use of the Dirichlet Variational Autoencoder (DirVAE)
for learning disentangled latent representations of chest X-ray (CXR) images.
Our working hypothesis is that distributional sparsity, as facilitated by the
Dirichlet prior, will encourage disentangled feature learning for the complex
task of multi-label classification of CXR images. The DirVAE is trained using
CXR images from the CheXpert database, and the predictive capacity of
multi-modal latent representations learned by DirVAE models is investigated
through implementation of an auxiliary multi-label classification task, with a
view to enforce separation of latent factors according to class-specific
features. The predictive performance and explainability of the latent space
learned using the DirVAE were quantitatively and qualitatively assessed,
respectively, and compared with a standard Gaussian prior-VAE (GVAE). We
introduce a new approach for explainable multi-label classification in which we
conduct gradient-guided latent traversals for each class of interest. Study
findings indicate that the DirVAE is able to disentangle latent factors into
class-specific visual features, a property not afforded by the GVAE, and
achieve a marginal increase in predictive performance relative to GVAE. We
generate visual examples to show that our explainability method, when applied
to the trained DirVAE, is able to highlight regions in CXR images that are
clinically relevant to the class(es) of interest and additionally, can identify
cases where classification relies on spurious feature correlations.
Related papers
- Intuitionistic Fuzzy Cognitive Maps for Interpretable Image Classification [2.130156029408832]
This paper introduces a novel framework, named Interpretable Intuitionistic FCM (I2FCM) which is domain-independent, simple to implement, and can be applied on CNN models.
To the best of our knowledge this is the first time iFCMs are applied for image classification.
arXiv Detail & Related papers (2024-08-07T12:58:39Z) - FlowSDF: Flow Matching for Medical Image Segmentation Using Distance Transforms [60.195642571004804]
We propose FlowSDF, an image-guided conditional flow matching framework to represent the signed distance function (SDF)
By learning a vector field that is directly related to the probability path of a conditional distribution of SDFs, we can accurately sample from the distribution of segmentation masks.
arXiv Detail & Related papers (2024-05-28T11:47:12Z) - A Probabilistic Approach to Self-Supervised Learning using Cyclical
Stochastic Gradient MCMC [8.027994148508844]
We present a practical self-supervised learning method with Cyclical Gradient Hamiltonian Monte Carlo (cSGHMC)
Within this framework, we place a prior over the parameters of a self-supervised learning model and use cSGHMC to approximate the high dimensional and multimodal posterior distribution over the embeddings.
We provide experimental results on multiple classification tasks on four challenging datasets.
arXiv Detail & Related papers (2023-08-02T16:52:56Z) - Score-Based Generative Models for Medical Image Segmentation using
Signed Distance Functions [11.137438870686026]
We propose a conditional score-based generative modeling framework to represent the signed distance function (SDF)
The advantage of leveraging the SDF is a more natural distortion when compared to that of binary masks.
arXiv Detail & Related papers (2023-03-10T14:55:35Z) - Fine-Grained Visual Classification using Self Assessment Classifier [12.596520707449027]
Extracting discriminative features plays a crucial role in the fine-grained visual classification task.
In this paper, we introduce a Self Assessment, which simultaneously leverages the representation of the image and top-k prediction classes.
We show that our method achieves new state-of-the-art results on CUB200-2011, Stanford Dog, and FGVC Aircraft datasets.
arXiv Detail & Related papers (2022-05-21T07:41:27Z) - MCDAL: Maximum Classifier Discrepancy for Active Learning [74.73133545019877]
Recent state-of-the-art active learning methods have mostly leveraged Generative Adversarial Networks (GAN) for sample acquisition.
We propose in this paper a novel active learning framework that we call Maximum Discrepancy for Active Learning (MCDAL)
In particular, we utilize two auxiliary classification layers that learn tighter decision boundaries by maximizing the discrepancies among them.
arXiv Detail & Related papers (2021-07-23T06:57:08Z) - Transforming Feature Space to Interpret Machine Learning Models [91.62936410696409]
This contribution proposes a novel approach that interprets machine-learning models through the lens of feature space transformations.
It can be used to enhance unconditional as well as conditional post-hoc diagnostic tools.
A case study on remote-sensing landcover classification with 46 features is used to demonstrate the potential of the proposed approach.
arXiv Detail & Related papers (2021-04-09T10:48:11Z) - Attention Model Enhanced Network for Classification of Breast Cancer
Image [54.83246945407568]
AMEN is formulated in a multi-branch fashion with pixel-wised attention model and classification submodular.
To focus more on subtle detail information, the sample image is enhanced by the pixel-wised attention map generated from former branch.
Experiments conducted on three benchmark datasets demonstrate the superiority of the proposed method under various scenarios.
arXiv Detail & Related papers (2020-10-07T08:44:21Z) - Region Comparison Network for Interpretable Few-shot Image
Classification [97.97902360117368]
Few-shot image classification has been proposed to effectively use only a limited number of labeled examples to train models for new classes.
We propose a metric learning based method named Region Comparison Network (RCN), which is able to reveal how few-shot learning works.
We also present a new way to generalize the interpretability from the level of tasks to categories.
arXiv Detail & Related papers (2020-09-08T07:29:05Z) - Projective Latent Interventions for Understanding and Fine-tuning
Classifiers [5.539383380453129]
We present Projective Latent Interventions (PLIs), a technique for retraining classifiers by back-propagating manual changes made to low-dimensional embeddings of the latent space.
PLIs allow domain experts to control the latent decision space in an intuitive way in order to better match their expectations.
We evaluate our technique on a real-world scenario in fetal ultrasound imaging.
arXiv Detail & Related papers (2020-06-23T11:12:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.