Long-Tailed Classification of Thorax Diseases on Chest X-Ray: A New
Benchmark Study
- URL: http://arxiv.org/abs/2208.13365v1
- Date: Mon, 29 Aug 2022 04:34:15 GMT
- Title: Long-Tailed Classification of Thorax Diseases on Chest X-Ray: A New
Benchmark Study
- Authors: Gregory Holste, Song Wang, Ziyu Jiang, Thomas C. Shen, George Shih,
Ronald M. Summers, Yifan Peng, Zhangyang Wang
- Abstract summary: We present a benchmark study of the long-tailed learning problem in the specific domain of thorax diseases on chest X-rays.
We focus on learning from naturally distributed chest X-ray data, optimizing classification accuracy over not only the common "head" classes, but also the rare yet critical "tail" classes.
The benchmark consists of two chest X-ray datasets for 19- and 20-way thorax disease classification, containing classes with as many as 53,000 and as few as 7 labeled training images.
- Score: 75.05049024176584
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: Imaging exams, such as chest radiography, will yield a small set of common
findings and a much larger set of uncommon findings. While a trained
radiologist can learn the visual presentation of rare conditions by studying a
few representative examples, teaching a machine to learn from such a
"long-tailed" distribution is much more difficult, as standard methods would be
easily biased toward the most frequent classes. In this paper, we present a
comprehensive benchmark study of the long-tailed learning problem in the
specific domain of thorax diseases on chest X-rays. We focus on learning from
naturally distributed chest X-ray data, optimizing classification accuracy over
not only the common "head" classes, but also the rare yet critical "tail"
classes. To accomplish this, we introduce a challenging new long-tailed chest
X-ray benchmark to facilitate research on developing long-tailed learning
methods for medical image classification. The benchmark consists of two chest
X-ray datasets for 19- and 20-way thorax disease classification, containing
classes with as many as 53,000 and as few as 7 labeled training images. We
evaluate both standard and state-of-the-art long-tailed learning methods on
this new benchmark, analyzing which aspects of these methods are most
beneficial for long-tailed medical image classification and summarizing
insights for future algorithm design. The datasets, trained models, and code
are available at https://github.com/VITA-Group/LongTailCXR.
Related papers
- Position-Guided Prompt Learning for Anomaly Detection in Chest X-Rays [46.78926066405227]
Anomaly detection in chest X-rays is a critical task.
Recently, CLIP-based methods, pre-trained on a large number of medical images, have shown impressive performance on zero/few-shot downstream tasks.
We propose a position-guided prompt learning method to adapt the task data to the frozen CLIP-based model.
arXiv Detail & Related papers (2024-05-20T12:11:41Z) - Towards long-tailed, multi-label disease classification from chest X-ray: Overview of the CXR-LT challenge [59.323306639144526]
Many real-world image recognition problems, such as diagnostic medical imaging exams, are emerging.
Diagnose is both a long-tailed and multi-label problem, as patients often present with multiple findings.
We synthesize common themes, providing recommendations for long-tailed, multi-label medical image classification.
arXiv Detail & Related papers (2023-10-24T18:26:22Z) - Transfer learning method in the problem of binary classification of
chest X-rays [0.0]
High-precision and rapid detection of pathologies on chest X-rays makes it possible to detect the development of pneumonia at an early stage and begin immediate treatment.
Artificial intelligence can speed up and qualitatively improve the procedure of X-ray analysis and give recommendations to the doctor for additional consideration of suspicious images.
arXiv Detail & Related papers (2023-03-19T08:35:47Z) - AnaXNet: Anatomy Aware Multi-label Finding Classification in Chest X-ray [9.087789790647786]
We propose a novel multi-label chest X-ray classification model that accurately classifies the image finding and also localizes the findings to their correct anatomical regions.
The latter utilizes graph convolutional networks, which enable our model to learn not only the label dependency but also the relationship between the anatomical regions in the chest X-ray.
arXiv Detail & Related papers (2021-05-20T17:58:02Z) - Relational Subsets Knowledge Distillation for Long-tailed Retinal
Diseases Recognition [65.77962788209103]
We propose class subset learning by dividing the long-tailed data into multiple class subsets according to prior knowledge.
It enforces the model to focus on learning the subset-specific knowledge.
The proposed framework proved to be effective for the long-tailed retinal diseases recognition task.
arXiv Detail & Related papers (2021-04-22T13:39:33Z) - Many-to-One Distribution Learning and K-Nearest Neighbor Smoothing for
Thoracic Disease Identification [83.6017225363714]
deep learning has become the most powerful computer-aided diagnosis technology for improving disease identification performance.
For chest X-ray imaging, annotating large-scale data requires professional domain knowledge and is time-consuming.
In this paper, we propose many-to-one distribution learning (MODL) and K-nearest neighbor smoothing (KNNS) methods to improve a single model's disease identification performance.
arXiv Detail & Related papers (2021-02-26T02:29:30Z) - Extracting and Learning Fine-Grained Labels from Chest Radiographs [0.157292030677369]
We focus on extracting and learning fine-grained labels for chest X-ray images.
A total of 457 fine-grained labels depicting the largest spectrum of findings to date were selected.
We show results that indicate a highly accurate label extraction process and a reliable learning of fine-grained labels.
arXiv Detail & Related papers (2020-11-18T19:56:08Z) - Learning Invariant Feature Representation to Improve Generalization
across Chest X-ray Datasets [55.06983249986729]
We show that a deep learning model performing well when tested on the same dataset as training data starts to perform poorly when it is tested on a dataset from a different source.
By employing an adversarial training strategy, we show that a network can be forced to learn a source-invariant representation.
arXiv Detail & Related papers (2020-08-04T07:41:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.