Robust Classification from Noisy Labels: Integrating Additional
Knowledge for Chest Radiography Abnormality Assessment
- URL: http://arxiv.org/abs/2104.05261v2
- Date: Wed, 14 Apr 2021 19:43:52 GMT
- Title: Robust Classification from Noisy Labels: Integrating Additional
Knowledge for Chest Radiography Abnormality Assessment
- Authors: Sebastian G\"undel, Arnaud A. A. Setio, Florin C. Ghesu, Sasa Grbic,
Bogdan Georgescu, Andreas Maier, Dorin Comaniciu
- Abstract summary: The introduction of large-scale public datasets has led to a series of novel systems for automated abnormality classification.
We propose novel training strategies that handle label noise from such suboptimal data.
With an average AUC score of 0.880 across all abnormalities, our proposed training strategies can be used to significantly improve performance scores.
- Score: 14.631388658828921
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Chest radiography is the most common radiographic examination performed in
daily clinical practice for the detection of various heart and lung
abnormalities. The large amount of data to be read and reported, with more than
100 studies per day for a single radiologist, poses a challenge in consistently
maintaining high interpretation accuracy. The introduction of large-scale
public datasets has led to a series of novel systems for automated abnormality
classification. However, the labels of these datasets were obtained using
natural language processed medical reports, yielding a large degree of label
noise that can impact the performance. In this study, we propose novel training
strategies that handle label noise from such suboptimal data. Prior label
probabilities were measured on a subset of training data re-read by 4
board-certified radiologists and were used during training to increase the
robustness of the training model to the label noise. Furthermore, we exploit
the high comorbidity of abnormalities observed in chest radiography and
incorporate this information to further reduce the impact of label noise.
Additionally, anatomical knowledge is incorporated by training the system to
predict lung and heart segmentation, as well as spatial knowledge labels. To
deal with multiple datasets and images derived from various scanners that apply
different post-processing techniques, we introduce a novel image normalization
strategy. Experiments were performed on an extensive collection of 297,541
chest radiographs from 86,876 patients, leading to a state-of-the-art
performance level for 17 abnormalities from 2 datasets. With an average AUC
score of 0.880 across all abnormalities, our proposed training strategies can
be used to significantly improve performance scores.
Related papers
- The role of noise in denoising models for anomaly detection in medical
images [62.0532151156057]
Pathological brain lesions exhibit diverse appearance in brain images.
Unsupervised anomaly detection approaches have been proposed using only normal data for training.
We show that optimization of the spatial resolution and magnitude of the noise improves the performance of different model training regimes.
arXiv Detail & Related papers (2023-01-19T21:39:38Z) - Significantly improving zero-shot X-ray pathology classification via fine-tuning pre-trained image-text encoders [50.689585476660554]
We propose a new fine-tuning strategy that includes positive-pair loss relaxation and random sentence sampling.
Our approach consistently improves overall zero-shot pathology classification across four chest X-ray datasets and three pre-trained models.
arXiv Detail & Related papers (2022-12-14T06:04:18Z) - Robust Medical Image Classification from Noisy Labeled Data with Global
and Local Representation Guided Co-training [73.60883490436956]
We propose a novel collaborative training paradigm with global and local representation learning for robust medical image classification.
We employ the self-ensemble model with a noisy label filter to efficiently select the clean and noisy samples.
We also design a novel global and local representation learning scheme to implicitly regularize the networks to utilize noisy samples.
arXiv Detail & Related papers (2022-05-10T07:50:08Z) - Preservation of High Frequency Content for Deep Learning-Based Medical
Image Classification [74.84221280249876]
An efficient analysis of large amounts of chest radiographs can aid physicians and radiologists.
We propose a novel Discrete Wavelet Transform (DWT)-based method for the efficient identification and encoding of visual information.
arXiv Detail & Related papers (2022-05-08T15:29:54Z) - Segmentation-free Heart Pathology Detection Using Deep Learning [12.065014651638943]
We propose a novel segmentation-free heart sound classification method.
Specifically, we apply discrete wavelet transform to denoise the signal, followed by feature extraction and feature reduction.
Support Vector Machines and Deep Neural Networks are utilised for classification.
arXiv Detail & Related papers (2021-08-09T16:09:30Z) - Improving Medical Image Classification with Label Noise Using
Dual-uncertainty Estimation [72.0276067144762]
We discuss and define the two common types of label noise in medical images.
We propose an uncertainty estimation-based framework to handle these two label noise amid the medical image classification task.
arXiv Detail & Related papers (2021-02-28T14:56:45Z) - Multi-Label Annotation of Chest Abdomen Pelvis Computed Tomography Text
Reports Using Deep Learning [1.5701326192371183]
We developed a multi-label annotator for body Computed Tomography (CT) reports that can be applied to a variety of diseases, organs, and cases.
We used a dictionary approach to develop a rule-based algorithm for extraction of disease labels from radiology text reports.
An attention-guided recurrent neural network (RNN) was trained using the RBA-extracted labels to classify the reports as being positive for one or more diseases or normal for each organ system.
arXiv Detail & Related papers (2021-02-05T02:07:39Z) - Joint Modeling of Chest Radiographs and Radiology Reports for Pulmonary
Edema Assessment [39.60171837961607]
We develop a neural network model that is trained on both images and free-text to assess pulmonary edema severity from chest radiographs at inference time.
Our experimental results suggest that the joint image-text representation learning improves the performance of pulmonary edema assessment.
arXiv Detail & Related papers (2020-08-22T17:28:39Z) - Paying Per-label Attention for Multi-label Extraction from Radiology
Reports [1.9601378412924186]
We tackle the automated extraction of structured labels from head CT reports for imaging of suspected stroke patients.
We propose a set of 31 labels which correspond to radiographic findings and clinical impressions related to neurological abnormalities.
We are able to robustly extract many labels with a single model, classified according to the radiologist's reporting.
arXiv Detail & Related papers (2020-07-31T16:11:09Z) - Machine-Learning-Based Multiple Abnormality Prediction with Large-Scale
Chest Computed Tomography Volumes [64.21642241351857]
We curated and analyzed a chest computed tomography (CT) data set of 36,316 volumes from 19,993 unique patients.
We developed a rule-based method for automatically extracting abnormality labels from free-text radiology reports.
We also developed a model for multi-organ, multi-disease classification of chest CT volumes.
arXiv Detail & Related papers (2020-02-12T00:59:23Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.