Open-radiomics: A Collection of Standardized Datasets and a Technical
Protocol for Reproducible Radiomics Machine Learning Pipelines
- URL: http://arxiv.org/abs/2207.14776v2
- Date: Tue, 24 Oct 2023 18:41:44 GMT
- Title: Open-radiomics: A Collection of Standardized Datasets and a Technical
Protocol for Reproducible Radiomics Machine Learning Pipelines
- Authors: Khashayar Namdar, Matthias W. Wagner, Birgit B. Ertl-Wagner, Farzad
Khalvati
- Abstract summary: We introduce open-radiomics, a set of radiomics datasets and a comprehensive radiomics pipeline.
Experiments are conducted on BraTS 2020 open-source Magnetic Resonance Imaging (MRI) dataset.
Unlike binWidth and image normalization, tumor subregion and imaging sequence significantly affected performance of the models.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Purpose: As an important branch of machine learning pipelines in medical
imaging, radiomics faces two major challenges namely reproducibility and
accessibility. In this work, we introduce open-radiomics, a set of radiomics
datasets along with a comprehensive radiomics pipeline based on our proposed
technical protocol to investigate the effects of radiomics feature extraction
on the reproducibility of the results.
Materials and Methods: Experiments are conducted on BraTS 2020 open-source
Magnetic Resonance Imaging (MRI) dataset that includes 369 adult patients with
brain tumors (76 low-grade glioma (LGG), and 293 high-grade glioma (HGG)).
Using PyRadiomics library for LGG vs. HGG classification, 288 radiomics
datasets are formed; the combinations of 4 MRI sequences, 3 binWidths, 6 image
normalization methods, and 4 tumor subregions.
Random Forest classifiers were used, and for each radiomics dataset the
training-validation-test (60%/20%/20%) experiment with different data splits
and model random states was repeated 100 times (28,800 test results) and Area
Under Receiver Operating Characteristic Curve (AUC) was calculated.
Results: Unlike binWidth and image normalization, tumor subregion and imaging
sequence significantly affected performance of the models. T1 contrast-enhanced
sequence and the union of necrotic and the non-enhancing tumor core subregions
resulted in the highest AUCs (average test AUC 0.951, 95% confidence interval
of (0.949, 0.952)). Although 28 settings and data splits yielded test AUC of 1,
they were irreproducible.
Conclusion: Our experiments demonstrate the sources of variability in
radiomics pipelines (e.g., tumor subregion) can have a significant impact on
the results, which may lead to superficial perfect performances that are
irreproducible.
Related papers
- CXR-LLAVA: a multimodal large language model for interpreting chest
X-ray images [3.0757789554622597]
This study aimed to develop an open-source multimodal large language model (CXR-LLAVA) for interpreting chest X-ray images (CXRs)
For training, we collected 592,580 publicly available CXRs, of which 374,881 had labels for certain radiographic abnormalities.
The model's diagnostic performance for major pathological findings was evaluated, along with the acceptability of radiologic reports by human radiologists.
arXiv Detail & Related papers (2023-10-22T06:22:37Z) - ChatRadio-Valuer: A Chat Large Language Model for Generalizable
Radiology Report Generation Based on Multi-institution and Multi-system Data [115.0747462486285]
ChatRadio-Valuer is a tailored model for automatic radiology report generation that learns generalizable representations.
The clinical dataset utilized in this study encompasses a remarkable total of textbf332,673 observations.
ChatRadio-Valuer consistently outperforms state-of-the-art models, especially ChatGPT (GPT-3.5-Turbo) and GPT-4 et al.
arXiv Detail & Related papers (2023-10-08T17:23:17Z) - Radiology-Llama2: Best-in-Class Large Language Model for Radiology [71.27700230067168]
This paper introduces Radiology-Llama2, a large language model specialized for radiology through a process known as instruction tuning.
Quantitative evaluations using ROUGE metrics on the MIMIC-CXR and OpenI datasets demonstrate that Radiology-Llama2 achieves state-of-the-art performance.
arXiv Detail & Related papers (2023-08-29T17:44:28Z) - Brain Tumor Recurrence vs. Radiation Necrosis Classification and Patient
Survivability Prediction [0.0]
GBM is the most aggressive brain tumor in adults that has a short survival rate even after aggressive treatment with surgery and radiation therapy.
The changes on magnetic resonance imaging (MRI) for patients with GBM after radiotherapy are indicative of radiation-induced necrosis (RN) or recurrent brain tumor (rBT)
This study proposes computational modeling with statistically rigorous repeated random sub-sampling to balance the subset sample size for rBT and RN classification.
arXiv Detail & Related papers (2023-06-05T21:39:11Z) - MRI-based classification of IDH mutation and 1p/19q codeletion status of
gliomas using a 2.5D hybrid multi-task convolutional neural network [0.18374319565577152]
Isocitrate dehydrogenase mutation and 1p/19q codeletion status are important prognostic markers for glioma.
Our goal was to develop artificial intelligence-based methods to non-invasively determine these molecular alterations from MRI.
A 2.5D hybrid convolutional neural network was proposed to simultaneously localize the tumor and classify its molecular status.
arXiv Detail & Related papers (2022-10-07T18:46:39Z) - Deep Learning for Classification of Thyroid Nodules on Ultrasound:
Validation on an Independent Dataset [7.4674725823899175]
The purpose is to apply a previously validated deep learning algorithm to a new thyroid ultrasound image dataset.
The relative performance difference between the algorithm and the radiologists is not significantly affected by the difference of ultrasound scanner.
arXiv Detail & Related papers (2022-07-27T19:45:41Z) - StRegA: Unsupervised Anomaly Detection in Brain MRIs using a Compact
Context-encoding Variational Autoencoder [48.2010192865749]
Unsupervised anomaly detection (UAD) can learn a data distribution from an unlabelled dataset of healthy subjects and then be applied to detect out of distribution samples.
This research proposes a compact version of the "context-encoding" VAE (ceVAE) model, combined with pre and post-processing steps, creating a UAD pipeline (StRegA)
The proposed pipeline achieved a Dice score of 0.642$pm$0.101 while detecting tumours in T2w images of the BraTS dataset and 0.859$pm$0.112 while detecting artificially induced anomalies.
arXiv Detail & Related papers (2022-01-31T14:27:35Z) - Event-based clinical findings extraction from radiology reports with
pre-trained language model [0.22940141855172028]
We present a new corpus of radiology reports annotated with clinical findings.
The gold standard corpus contained a total of 500 annotated computed tomography (CT) reports.
We extracted triggers and argument entities using two state-of-the-art deep learning architectures, including BERT.
arXiv Detail & Related papers (2021-12-27T05:03:10Z) - FLANNEL: Focal Loss Based Neural Network Ensemble for COVID-19 Detection [61.04937460198252]
We construct the X-ray imaging data from 2874 patients with four classes: normal, bacterial pneumonia, non-COVID-19 viral pneumonia, and COVID-19.
To identify COVID-19, we propose a Focal Loss Based Neural Ensemble Network (FLANNEL)
FLANNEL consistently outperforms baseline models on COVID-19 identification task in all metrics.
arXiv Detail & Related papers (2020-10-30T03:17:31Z) - A multicenter study on radiomic features from T$_2$-weighted images of a
customized MR pelvic phantom setting the basis for robust radiomic models in
clinics [47.187609203210705]
2D and 3D T$$-weighted images of a pelvic phantom were acquired on three scanners.
repeatability and repositioning of radiomic features were assessed.
arXiv Detail & Related papers (2020-05-14T09:24:48Z) - Machine-Learning-Based Multiple Abnormality Prediction with Large-Scale
Chest Computed Tomography Volumes [64.21642241351857]
We curated and analyzed a chest computed tomography (CT) data set of 36,316 volumes from 19,993 unique patients.
We developed a rule-based method for automatically extracting abnormality labels from free-text radiology reports.
We also developed a model for multi-organ, multi-disease classification of chest CT volumes.
arXiv Detail & Related papers (2020-02-12T00:59:23Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.