Detecting Shortcuts in Medical Images -- A Case Study in Chest X-rays
- URL: http://arxiv.org/abs/2211.04279v2
- Date: Wed, 9 Nov 2022 12:06:12 GMT
- Title: Detecting Shortcuts in Medical Images -- A Case Study in Chest X-rays
- Authors: Amelia Jim\'enez-S\'anchez, Dovile Juodelyte, Bethany Chamberlain,
Veronika Cheplygina
- Abstract summary: We present a case study on chest X-rays using two publicly available datasets.
We share annotations for a subset of pneumothorax images with drains.
- Score: 0.22940141855172028
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The availability of large public datasets and the increased amount of
computing power have shifted the interest of the medical community to
high-performance algorithms. However, little attention is paid to the quality
of the data and their annotations. High performance on benchmark datasets may
be reported without considering possible shortcuts or artifacts in the data,
besides, models are not tested on subpopulation groups. With this work, we aim
to raise awareness about shortcuts problems. We validate previous findings, and
present a case study on chest X-rays using two publicly available datasets. We
share annotations for a subset of pneumothorax images with drains. We conclude
with general recommendations for medical image classification.
Related papers
- Multi-task Explainable Skin Lesion Classification [54.76511683427566]
We propose a few-shot-based approach for skin lesions that generalizes well with few labelled data.
The proposed approach comprises a fusion of a segmentation network that acts as an attention module and classification network.
arXiv Detail & Related papers (2023-10-11T05:49:47Z) - Augmenting Chest X-ray Datasets with Non-Expert Annotations [1.9991771189143435]
A popular and cost-effective approach is automated annotation extraction from free-text medical reports.
We enhance two publicly available chest X-ray datasets by incorporating non-expert annotations.
We train a chest drain detector with the non-expert annotations that generalizes well to expert labels.
arXiv Detail & Related papers (2023-09-05T13:52:43Z) - Generation of Anonymous Chest Radiographs Using Latent Diffusion Models
for Training Thoracic Abnormality Classification Systems [7.909848251752742]
Biometric identifiers in chest radiographs hinder the public sharing of such data for research purposes.
This work employs a latent diffusion model to synthesize an anonymous chest X-ray dataset of high-quality class-conditional images.
arXiv Detail & Related papers (2022-11-02T17:43:02Z) - Dataset Distillation for Medical Dataset Sharing [38.65823547986758]
dataset distillation can synthesize a small dataset such that models trained on it achieve comparable performance with the original large dataset.
Experimental results on a COVID-19 chest X-ray image dataset show that our method can achieve high detection performance even using scarce anonymized chest X-ray images.
arXiv Detail & Related papers (2022-09-29T07:49:20Z) - Long-Tailed Classification of Thorax Diseases on Chest X-Ray: A New
Benchmark Study [75.05049024176584]
We present a benchmark study of the long-tailed learning problem in the specific domain of thorax diseases on chest X-rays.
We focus on learning from naturally distributed chest X-ray data, optimizing classification accuracy over not only the common "head" classes, but also the rare yet critical "tail" classes.
The benchmark consists of two chest X-ray datasets for 19- and 20-way thorax disease classification, containing classes with as many as 53,000 and as few as 7 labeled training images.
arXiv Detail & Related papers (2022-08-29T04:34:15Z) - Preservation of High Frequency Content for Deep Learning-Based Medical
Image Classification [74.84221280249876]
An efficient analysis of large amounts of chest radiographs can aid physicians and radiologists.
We propose a novel Discrete Wavelet Transform (DWT)-based method for the efficient identification and encoding of visual information.
arXiv Detail & Related papers (2022-05-08T15:29:54Z) - Contrastive Attention for Automatic Chest X-ray Report Generation [124.60087367316531]
In most cases, the normal regions dominate the entire chest X-ray image, and the corresponding descriptions of these normal regions dominate the final report.
We propose Contrastive Attention (CA) model, which compares the current input image with normal images to distill the contrastive information.
We achieve the state-of-the-art results on the two public datasets.
arXiv Detail & Related papers (2021-06-13T11:20:31Z) - Learning Invariant Feature Representation to Improve Generalization
across Chest X-ray Datasets [55.06983249986729]
We show that a deep learning model performing well when tested on the same dataset as training data starts to perform poorly when it is tested on a dataset from a different source.
By employing an adversarial training strategy, we show that a network can be forced to learn a source-invariant representation.
arXiv Detail & Related papers (2020-08-04T07:41:15Z) - Deep Mining External Imperfect Data for Chest X-ray Disease Screening [57.40329813850719]
We argue that incorporating an external CXR dataset leads to imperfect training data, which raises the challenges.
We formulate the multi-label disease classification problem as weighted independent binary tasks according to the categories.
Our framework simultaneously models and tackles the domain and label discrepancies, enabling superior knowledge mining ability.
arXiv Detail & Related papers (2020-06-06T06:48:40Z) - Semi-supervised lung nodule retrieval [2.055949720959582]
A content based image retrieval (CBIR) system provides as its output a set of images, ranked by similarity to the query image.
Ground truth on similarity between dataset elements (e.g. between nodules) is not readily available, thus greatly challenging machine learning methods.
The current study suggests a semi-supervised approach that involves two steps: 1) Automatic annotation of a given partially labeled dataset; 2) Learning a semantic similarity metric space based on the predicated annotations.
The proposed system is demonstrated in lung nodule retrieval using the LIDC dataset, and shows that it is feasible to learn embedding from predicted ratings.
arXiv Detail & Related papers (2020-05-04T19:26:14Z) - Unsupervised Anomaly Detection for X-Ray Images [4.353258086186526]
We investigate how unsupervised methods trained on images without anomalies can be used to assist doctors in evaluating X-ray images of hands.
We adopt state-of-the-art approaches for unsupervised learning to detect anomalies and show how the outputs of these methods can be explained.
We provide an extensive evaluation of different approaches and demonstrate empirically that even without labels it is possible to achieve satisfying results on a real-world dataset of X-ray images of hands.
arXiv Detail & Related papers (2020-01-29T15:14:56Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.