Machine-Learning-Based Multiple Abnormality Prediction with Large-Scale
Chest Computed Tomography Volumes
- URL: http://arxiv.org/abs/2002.04752v3
- Date: Mon, 12 Oct 2020 23:57:17 GMT
- Title: Machine-Learning-Based Multiple Abnormality Prediction with Large-Scale
Chest Computed Tomography Volumes
- Authors: Rachel Lea Draelos, David Dov, Maciej A. Mazurowski, Joseph Y. Lo,
Ricardo Henao, Geoffrey D. Rubin, Lawrence Carin
- Abstract summary: We curated and analyzed a chest computed tomography (CT) data set of 36,316 volumes from 19,993 unique patients.
We developed a rule-based method for automatically extracting abnormality labels from free-text radiology reports.
We also developed a model for multi-organ, multi-disease classification of chest CT volumes.
- Score: 64.21642241351857
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Machine learning models for radiology benefit from large-scale data sets with
high quality labels for abnormalities. We curated and analyzed a chest computed
tomography (CT) data set of 36,316 volumes from 19,993 unique patients. This is
the largest multiply-annotated volumetric medical imaging data set reported. To
annotate this data set, we developed a rule-based method for automatically
extracting abnormality labels from free-text radiology reports with an average
F-score of 0.976 (min 0.941, max 1.0). We also developed a model for
multi-organ, multi-disease classification of chest CT volumes that uses a deep
convolutional neural network (CNN). This model reached a classification
performance of AUROC greater than 0.90 for 18 abnormalities, with an average
AUROC of 0.773 for all 83 abnormalities, demonstrating the feasibility of
learning from unfiltered whole volume CT data. We show that training on more
labels improves performance significantly: for a subset of 9 labels - nodule,
opacity, atelectasis, pleural effusion, consolidation, mass, pericardial
effusion, cardiomegaly, and pneumothorax - the model's average AUROC increased
by 10% when the number of training labels was increased from 9 to all 83. All
code for volume preprocessing, automated label extraction, and the volume
abnormality prediction model will be made publicly available. The 36,316 CT
volumes and labels will also be made publicly available pending institutional
approval.
Related papers
- Brain Tumor Classification on MRI in Light of Molecular Markers [61.77272414423481]
Co-deletion of the 1p/19q gene is associated with clinical outcomes in low-grade gliomas.
This study aims to utilize a specially MRI-based convolutional neural network for brain cancer detection.
arXiv Detail & Related papers (2024-09-29T07:04:26Z) - TotalSegmentator: robust segmentation of 104 anatomical structures in CT
images [48.50994220135258]
We present a deep learning segmentation model for body CT images.
The model can segment 104 anatomical structures relevant for use cases such as organ volumetry, disease characterization, and surgical or radiotherapy planning.
arXiv Detail & Related papers (2022-08-11T15:16:40Z) - Improving Disease Classification Performance and Explainability of Deep
Learning Models in Radiology with Heatmap Generators [0.0]
Three experiment sets were conducted with a U-Net architecture to improve the classification performance.
The greatest improvements were for the "pneumonia" and "CHF" classes, which the baseline model struggled most to classify.
arXiv Detail & Related papers (2022-06-28T13:03:50Z) - Preservation of High Frequency Content for Deep Learning-Based Medical
Image Classification [74.84221280249876]
An efficient analysis of large amounts of chest radiographs can aid physicians and radiologists.
We propose a novel Discrete Wavelet Transform (DWT)-based method for the efficient identification and encoding of visual information.
arXiv Detail & Related papers (2022-05-08T15:29:54Z) - Federated Learning Enables Big Data for Rare Cancer Boundary Detection [98.5549882883963]
We present findings from the largest Federated ML study to-date, involving data from 71 healthcare institutions across 6 continents.
We generate an automatic tumor boundary detector for the rare disease of glioblastoma.
We demonstrate a 33% improvement over a publicly trained model to delineate the surgically targetable tumor, and 23% improvement over the tumor's entire extent.
arXiv Detail & Related papers (2022-04-22T17:27:00Z) - Robust Classification from Noisy Labels: Integrating Additional
Knowledge for Chest Radiography Abnormality Assessment [14.631388658828921]
The introduction of large-scale public datasets has led to a series of novel systems for automated abnormality classification.
We propose novel training strategies that handle label noise from such suboptimal data.
With an average AUC score of 0.880 across all abnormalities, our proposed training strategies can be used to significantly improve performance scores.
arXiv Detail & Related papers (2021-04-12T07:51:07Z) - COVID-19 identification from volumetric chest CT scans using a
progressively resized 3D-CNN incorporating segmentation, augmentation, and
class-rebalancing [4.446085353384894]
COVID-19 is a global pandemic disease overgrowing worldwide.
Computer-aided screening tools with greater sensitivity is imperative for disease diagnosis and prognosis.
This article proposes a 3D Convolutional Neural Network (CNN)-based classification approach.
arXiv Detail & Related papers (2021-02-11T18:16:18Z) - Multi-Label Annotation of Chest Abdomen Pelvis Computed Tomography Text
Reports Using Deep Learning [1.5701326192371183]
We developed a multi-label annotator for body Computed Tomography (CT) reports that can be applied to a variety of diseases, organs, and cases.
We used a dictionary approach to develop a rule-based algorithm for extraction of disease labels from radiology text reports.
An attention-guided recurrent neural network (RNN) was trained using the RBA-extracted labels to classify the reports as being positive for one or more diseases or normal for each organ system.
arXiv Detail & Related papers (2021-02-05T02:07:39Z) - Deep Learning to Quantify Pulmonary Edema in Chest Radiographs [7.121765928263759]
We developed a machine learning model to classify the severity grades of pulmonary edema on chest radiographs.
Deep learning models were trained on a large chest radiograph dataset.
arXiv Detail & Related papers (2020-08-13T15:45:44Z) - Severity Assessment of Coronavirus Disease 2019 (COVID-19) Using
Quantitative Features from Chest CT Images [54.919022945740515]
The aim of this study is to realize automatic severity assessment (non-severe or severe) of COVID-19 based on chest CT images.
A random forest (RF) model is trained to assess the severity (non-severe or severe) based on quantitative features.
Several quantitative features, which have the potential to reflect the severity of COVID-19, were revealed.
arXiv Detail & Related papers (2020-03-26T15:49:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.