Benchmarking Dependence Measures to Prevent Shortcut Learning in Medical Imaging
- URL: http://arxiv.org/abs/2407.18792v2
- Date: Mon, 29 Jul 2024 09:05:17 GMT
- Title: Benchmarking Dependence Measures to Prevent Shortcut Learning in Medical Imaging
- Authors: Sarah Müller, Louisa Fay, Lisa M. Koch, Sergios Gatidis, Thomas Küstner, Philipp Berens,
- Abstract summary: Medical imaging cohorts are often confounded by factors such as acquisition devices, hospital sites, patient backgrounds, and many more.
Deep learning models tend to learn spurious correlations instead of causally related features, limiting their generalizability to new and unseen data.
This problem can be addressed by minimizing dependence measures between intermediate representations of task-related and non-task-related variables.
- Score: 7.917858344544847
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Medical imaging cohorts are often confounded by factors such as acquisition devices, hospital sites, patient backgrounds, and many more. As a result, deep learning models tend to learn spurious correlations instead of causally related features, limiting their generalizability to new and unseen data. This problem can be addressed by minimizing dependence measures between intermediate representations of task-related and non-task-related variables. These measures include mutual information, distance correlation, and the performance of adversarial classifiers. Here, we benchmark such dependence measures for the task of preventing shortcut learning. We study a simplified setting using Morpho-MNIST and a medical imaging task with CheXpert chest radiographs. Our results provide insights into how to mitigate confounding factors in medical imaging.
Related papers
- Overcoming Dimensional Collapse in Self-supervised Contrastive Learning
for Medical Image Segmentation [2.6764957223405657]
We investigate the application of contrastive learning to the domain of medical image analysis.
Our findings reveal that MoCo v2, a state-of-the-art contrastive learning method, encounters dimensional collapse when applied to medical images.
To address this, we propose two key contributions: local feature learning and feature decorrelation.
arXiv Detail & Related papers (2024-02-22T15:02:13Z) - Instrumental Variable Learning for Chest X-ray Classification [52.68170685918908]
We propose an interpretable instrumental variable (IV) learning framework to eliminate the spurious association and obtain accurate causal representation.
Our approach's performance is demonstrated using the MIMIC-CXR, NIH ChestX-ray 14, and CheXpert datasets.
arXiv Detail & Related papers (2023-05-20T03:12:23Z) - X-TRA: Improving Chest X-ray Tasks with Cross-Modal Retrieval
Augmentation [14.375693586801338]
We apply multi-modal retrieval augmentation to several tasks in chest X-ray analysis.
Vision and language modalities are aligned using a pre-trained CLIP model.
Non-parametric retrieval index reaches state-of-the-art retrieval levels.
arXiv Detail & Related papers (2023-02-22T12:53:33Z) - MedFACT: Modeling Medical Feature Correlations in Patient Health
Representation Learning via Feature Clustering [20.68759679109556]
In this paper, we propose a general patient health representation learning framework MedFACT.
We estimate correlations via measuring similarity between temporal patterns of medical features with kernel methods, and cluster features with strong correlations into groups.
We employ graph convolutional networks to conduct group-wise feature interactions for better representation learning.
arXiv Detail & Related papers (2022-04-21T10:27:24Z) - Multi-Task Multi-Scale Learning For Outcome Prediction in 3D PET Images [4.234843176066354]
We propose a multi-task learning framework to predict patient's survival and response.
Our model was tested and validated for treatment response and survival in lung and esophageal cancers.
arXiv Detail & Related papers (2022-03-01T17:30:28Z) - Solving Inverse Problems in Medical Imaging with Score-Based Generative
Models [87.48867245544106]
Reconstructing medical images from partial measurements is an important inverse problem in Computed Tomography (CT) and Magnetic Resonance Imaging (MRI)
Existing solutions based on machine learning typically train a model to directly map measurements to medical images.
We propose a fully unsupervised technique for inverse problem solving, leveraging the recently introduced score-based generative models.
arXiv Detail & Related papers (2021-11-15T05:41:12Z) - On the Robustness of Pretraining and Self-Supervision for a Deep
Learning-based Analysis of Diabetic Retinopathy [70.71457102672545]
We compare the impact of different training procedures for diabetic retinopathy grading.
We investigate different aspects such as quantitative performance, statistics of the learned feature representations, interpretability and robustness to image distortions.
Our results indicate that models from ImageNet pretraining report a significant increase in performance, generalization and robustness to image distortions.
arXiv Detail & Related papers (2021-06-25T08:32:45Z) - Impact of individual rater style on deep learning uncertainty in medical
imaging segmentation [0.0]
This study quantifies rater style in the form of bias and consistency and explores their impacts when used to train deep learning models.
Two multi-rater public datasets were used, consisting of brain multiple sclerosis lesion and spinal cord grey matter segmentation.
arXiv Detail & Related papers (2021-05-05T17:11:18Z) - Evaluation of Complexity Measures for Deep Learning Generalization in
Medical Image Analysis [77.34726150561087]
PAC-Bayes flatness-based and path norm-based measures produce the most consistent explanation for the combination of models and data.
We also investigate the use of multi-task classification and segmentation approach for breast images.
arXiv Detail & Related papers (2021-03-04T20:58:22Z) - Few-shot Medical Image Segmentation using a Global Correlation Network
with Discriminative Embedding [60.89561661441736]
We propose a novel method for few-shot medical image segmentation.
We construct our few-shot image segmentor using a deep convolutional network trained episodically.
We enhance discriminability of deep embedding to encourage clustering of the feature domains of the same class.
arXiv Detail & Related papers (2020-12-10T04:01:07Z) - Memory-Augmented Relation Network for Few-Shot Learning [114.47866281436829]
In this work, we investigate a new metric-learning method, Memory-Augmented Relation Network (MRN)
In MRN, we choose the samples that are visually similar from the working context, and perform weighted information propagation to attentively aggregate helpful information from chosen ones to enhance its representation.
We empirically demonstrate that MRN yields significant improvement over its ancestor and achieves competitive or even better performance when compared with other few-shot learning approaches.
arXiv Detail & Related papers (2020-05-09T10:09:13Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.