Virtual vs. Reality: External Validation of COVID-19 Classifiers using
XCAT Phantoms for Chest Computed Tomography
- URL: http://arxiv.org/abs/2203.03074v1
- Date: Mon, 7 Mar 2022 00:11:53 GMT
- Title: Virtual vs. Reality: External Validation of COVID-19 Classifiers using
XCAT Phantoms for Chest Computed Tomography
- Authors: Fakrul Islam Tushar, Ehsan Abadi, Saman Sotoudeh-Paima, Rafael B.
Fricks, Maciej A. Mazurowski, W. Paul Segars, Ehsan Samei, Joseph Y. Lo
- Abstract summary: We created the CVIT-COVID dataset including 180 virtually imaged computed tomography (CT) images from simulated COVID-19 and normal phantom models.
We evaluated the performance of an open-source, deep-learning model from the University of Waterloo trained with multi-institutional data.
We validated the model's performance against open clinical data of 305 CT images to understand virtual vs. real clinical data performance.
- Score: 2.924350993741562
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Research studies of artificial intelligence models in medical imaging have
been hampered by poor generalization. This problem has been especially
concerning over the last year with numerous applications of deep learning for
COVID-19 diagnosis. Virtual imaging trials (VITs) could provide a solution for
objective evaluation of these models. In this work utilizing the VITs, we
created the CVIT-COVID dataset including 180 virtually imaged computed
tomography (CT) images from simulated COVID-19 and normal phantom models under
different COVID-19 morphology and imaging properties. We evaluated the
performance of an open-source, deep-learning model from the University of
Waterloo trained with multi-institutional data and an in-house model trained
with the open clinical dataset called MosMed. We further validated the model's
performance against open clinical data of 305 CT images to understand virtual
vs. real clinical data performance. The open-source model was published with
nearly perfect performance on the original Waterloo dataset but showed a
consistent performance drop in external testing on another clinical dataset
(AUC=0.77) and our simulated CVIT-COVID dataset (AUC=0.55). The in-house model
achieved an AUC of 0.87 while testing on the internal test set (MosMed test
set). However, performance dropped to an AUC of 0.65 and 0.69 when evaluated on
clinical and our simulated CVIT-COVID dataset. The VIT framework offered
control over imaging conditions, allowing us to show there was no change in
performance as CT exposure was changed from 28.5 to 57 mAs. The VIT framework
also provided voxel-level ground truth, revealing that performance of in-house
model was much higher at AUC=0.87 for diffuse COVID-19 infection size >2.65%
lung volume versus AUC=0.52 for focal disease with <2.65% volume. The virtual
imaging framework enabled these uniquely rigorous analyses of model
performance.
Related papers
- Virtual imaging trials improved the transparency and reliability of AI systems in COVID-19 imaging [1.6040478776985583]
This study focuses on using convolutional neural networks (CNNs) for COVID-19 diagnosis using computed tomography (CT) and chest radiography (CXR)
We developed and tested multiple AI models, 3D ResNet-like and 2D EfficientNetv2 architectures, across diverse datasets.
Models trained on the most diverse datasets showed the highest external testing performance, with AUC values ranging from 0.73 to 0.76 for CT and 0.70 to 0.73 for CXR.
arXiv Detail & Related papers (2023-08-17T19:12:32Z) - Evaluating Generalizability of Deep Learning Models Using
Indian-COVID-19 CT Dataset [5.398550081886242]
ma-chine learning (ML) approaches for automatic processing of CT scan images in clinical setting are trained on limited and biased sub-sets of publicly available COVID-19 data.
This has raised concerns regarding the generalizability of these models on external datasets, not seen by the model during training.
To address some of these issues, in this work CT scan images from confirmed COVID-19 data obtained from one of the largest public repositories, COVIDx CT 2A were used for training and internal vali-dation of machine learning models.
arXiv Detail & Related papers (2022-12-28T16:23:18Z) - A Generalizable Artificial Intelligence Model for COVID-19
Classification Task Using Chest X-ray Radiographs: Evaluated Over Four
Clinical Datasets with 15,097 Patients [6.209420804714487]
The generalizability of the trained model was retrospectively evaluated using four different real-world clinical datasets.
The AI model trained using a single-source clinical dataset achieved an AUC of 0.82 when applied to the internal temporal test set.
An AUC of 0.79 was achieved when applied to a multi-institutional COVID-19 dataset collected by the Medical Imaging and Data Resource Center.
arXiv Detail & Related papers (2022-10-04T04:12:13Z) - Robust Automated Framework for COVID-19 Disease Identification from a
Multicenter Dataset of Chest CT Scans [27.29759500174996]
We show that our proposed model is trained on a relatively small dataset acquired from only one imaging center using a specific scanning protocol.
We also showed that the model can be updated via an unsupervised approach to cope with the data shift between the train and test sets.
Experimental results indicate that our proposed framework performs well on all test sets achieving total accuracy of 96.15%.
arXiv Detail & Related papers (2021-09-19T22:32:55Z) - The Report on China-Spain Joint Clinical Testing for Rapid COVID-19 Risk
Screening by Eye-region Manifestations [59.48245489413308]
We developed and tested a COVID-19 rapid prescreening model using the eye-region images captured in China and Spain with cellphone cameras.
The performance was measured using area under receiver-operating-characteristic curve (AUC), sensitivity, specificity, accuracy, and F1.
arXiv Detail & Related papers (2021-09-18T02:28:01Z) - The pitfalls of using open data to develop deep learning solutions for
COVID-19 detection in chest X-rays [64.02097860085202]
Deep learning models have been developed to identify COVID-19 from chest X-rays.
Results have been exceptional when training and testing on open-source data.
Data analysis and model evaluations show that the popular open-source dataset COVIDx is not representative of the real clinical problem.
arXiv Detail & Related papers (2021-09-14T10:59:11Z) - Deep learning-based COVID-19 pneumonia classification using chest CT
images: model generalizability [54.86482395312936]
Deep learning (DL) classification models were trained to identify COVID-19-positive patients on 3D computed tomography (CT) datasets from different countries.
We trained nine identical DL-based classification models by using combinations of the datasets with a 72% train, 8% validation, and 20% test data split.
The models trained on multiple datasets and evaluated on a test set from one of the datasets used for training performed better.
arXiv Detail & Related papers (2021-02-18T21:14:52Z) - FLANNEL: Focal Loss Based Neural Network Ensemble for COVID-19 Detection [61.04937460198252]
We construct the X-ray imaging data from 2874 patients with four classes: normal, bacterial pneumonia, non-COVID-19 viral pneumonia, and COVID-19.
To identify COVID-19, we propose a Focal Loss Based Neural Ensemble Network (FLANNEL)
FLANNEL consistently outperforms baseline models on COVID-19 identification task in all metrics.
arXiv Detail & Related papers (2020-10-30T03:17:31Z) - Classification of COVID-19 in CT Scans using Multi-Source Transfer
Learning [91.3755431537592]
We propose the use of Multi-Source Transfer Learning to improve upon traditional Transfer Learning for the classification of COVID-19 from CT scans.
With our multi-source fine-tuning approach, our models outperformed baseline models fine-tuned with ImageNet.
Our best performing model was able to achieve an accuracy of 0.893 and a Recall score of 0.897, outperforming its baseline Recall score by 9.3%.
arXiv Detail & Related papers (2020-09-22T11:53:06Z) - COVIDNet-CT: A Tailored Deep Convolutional Neural Network Design for
Detection of COVID-19 Cases from Chest CT Images [75.74756992992147]
We introduce COVIDNet-CT, a deep convolutional neural network architecture that is tailored for detection of COVID-19 cases from chest CT images.
We also introduce COVIDx-CT, a benchmark CT image dataset derived from CT imaging data collected by the China National Center for Bioinformation.
arXiv Detail & Related papers (2020-09-08T15:49:55Z) - Intra-model Variability in COVID-19 Classification Using Chest X-ray
Images [0.0]
We quantify baseline performance metrics and variability for COVID-19 detection in chest x-ray for 12 common deep learning architectures.
Best performing models achieve a false negative rate of 3 out of 20 for detecting COVID-19 in a hold-out set.
arXiv Detail & Related papers (2020-04-30T21:20:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.