AI Models Close to your Chest: Robust Federated Learning Strategies for
Multi-site CT
- URL: http://arxiv.org/abs/2303.13567v2
- Date: Thu, 13 Apr 2023 21:28:21 GMT
- Title: AI Models Close to your Chest: Robust Federated Learning Strategies for
Multi-site CT
- Authors: Edward H. Lee, Brendan Kelly, Emre Altinmakas, Hakan Dogan, Maryam
Mohammadzadeh, Errol Colak, Steve Fu, Olivia Choudhury, Ujjwal Ratan, Felipe
Kitamura, Hernan Chaves, Jimmy Zheng, Mourad Said, Eduardo Reis, Jaekwang
Lim, Patricia Yokoo, Courtney Mitchell, Golnaz Houshmand, Marzyeh Ghassemi,
Ronan Killeen, Wendy Qiu, Joel Hayden, Farnaz Rafiee, Chad Klochko, Nicholas
Bevins, Faeze Sazgara, S. Simon Wong, Michael Moseley, Safwan Halabi, Kristen
W. Yeom
- Abstract summary: AI studies in medicine have largely focused on locoregional patient cohorts with less diverse data sources.
Federated learning (FL) is one potential pathway for AI development that enables learning across hospitals without data share.
We show the results of various FL strategies on one of the largest and most diverse COVID-19 chest CT datasets.
- Score: 3.0888799865182395
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: While it is well known that population differences from genetics, sex, race,
and environmental factors contribute to disease, AI studies in medicine have
largely focused on locoregional patient cohorts with less diverse data sources.
Such limitation stems from barriers to large-scale data share and ethical
concerns over data privacy. Federated learning (FL) is one potential pathway
for AI development that enables learning across hospitals without data share.
In this study, we show the results of various FL strategies on one of the
largest and most diverse COVID-19 chest CT datasets: 21 participating hospitals
across five continents that comprise >10,000 patients with >1 million images.
We also propose an FL strategy that leverages synthetically generated data to
overcome class and size imbalances. We also describe the sources of data
heterogeneity in the context of FL, and show how even among the correctly
labeled populations, disparities can arise due to these biases.
Related papers
- FedCVD: The First Real-World Federated Learning Benchmark on Cardiovascular Disease Data [52.55123685248105]
Cardiovascular diseases (CVDs) are currently the leading cause of death worldwide, highlighting the critical need for early diagnosis and treatment.
Machine learning (ML) methods can help diagnose CVDs early, but their performance relies on access to substantial data with high quality.
This paper presents the first real-world FL benchmark for cardiovascular disease detection, named FedCVD.
arXiv Detail & Related papers (2024-10-28T02:24:01Z) - Future-Proofing Medical Imaging with Privacy-Preserving Federated Learning and Uncertainty Quantification: A Review [14.88874727211064]
AI could soon become routine in clinical practice for disease diagnosis, prognosis, treatment planning, and post-treatment surveillance.
Privacy concerns surrounding patient data present a major barrier to the widespread adoption of AI in medical imaging.
Federated Learning (FL) offers a solution that enables organizations to train AI models collaboratively without sharing sensitive data.
arXiv Detail & Related papers (2024-09-24T16:55:32Z) - Federated Learning Enables Big Data for Rare Cancer Boundary Detection [98.5549882883963]
We present findings from the largest Federated ML study to-date, involving data from 71 healthcare institutions across 6 continents.
We generate an automatic tumor boundary detector for the rare disease of glioblastoma.
We demonstrate a 33% improvement over a publicly trained model to delineate the surgically targetable tumor, and 23% improvement over the tumor's entire extent.
arXiv Detail & Related papers (2022-04-22T17:27:00Z) - When Accuracy Meets Privacy: Two-Stage Federated Transfer Learning
Framework in Classification of Medical Images on Limited Data: A COVID-19
Case Study [77.34726150561087]
COVID-19 pandemic has spread rapidly and caused a shortage of global medical resources.
CNN has been widely utilized and verified in analyzing medical images.
arXiv Detail & Related papers (2022-03-24T02:09:41Z) - Advancing COVID-19 Diagnosis with Privacy-Preserving Collaboration in
Artificial Intelligence [79.038671794961]
We launch the Unified CT-COVID AI Diagnostic Initiative (UCADI), where the AI model can be distributedly trained and independently executed at each host institution.
Our study is based on 9,573 chest computed tomography scans (CTs) from 3,336 patients collected from 23 hospitals located in China and the UK.
arXiv Detail & Related papers (2021-11-18T00:43:41Z) - Deep learning-based COVID-19 pneumonia classification using chest CT
images: model generalizability [54.86482395312936]
Deep learning (DL) classification models were trained to identify COVID-19-positive patients on 3D computed tomography (CT) datasets from different countries.
We trained nine identical DL-based classification models by using combinations of the datasets with a 72% train, 8% validation, and 20% test data split.
The models trained on multiple datasets and evaluated on a test set from one of the datasets used for training performed better.
arXiv Detail & Related papers (2021-02-18T21:14:52Z) - FLOP: Federated Learning on Medical Datasets using Partial Networks [84.54663831520853]
COVID-19 Disease due to the novel coronavirus has caused a shortage of medical resources.
Different data-driven deep learning models have been developed to mitigate the diagnosis of COVID-19.
The data itself is still scarce due to patient privacy concerns.
We propose a simple yet effective algorithm, named textbfFederated textbfL textbfon Medical datasets using textbfPartial Networks (FLOP)
arXiv Detail & Related papers (2021-02-10T01:56:58Z) - Blockchain-Federated-Learning and Deep Learning Models for COVID-19
detection using CT Imaging [8.280858576611587]
Primary problem in diagnosing COVID-19 patients is the shortage and reliability of testing kits.
Second real-world problem is to share the data among the hospitals globally.
Thirdly, we design a method that can collaboratively train a global model using blockchain technology.
arXiv Detail & Related papers (2020-07-10T11:23:14Z) - Heterogeneity Loss to Handle Intersubject and Intrasubject Variability
in Cancer [11.440201348567681]
Deep learning (DL) models have shown impressive results in medical domain.
These AI methods can provide immense support to developing nations as affordable healthcare solutions.
This work is focused on one such application of blood cancer diagnosis.
arXiv Detail & Related papers (2020-03-06T16:16:23Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.