Privacy-Preserving Technology to Help Millions of People: Federated
Prediction Model for Stroke Prevention
- URL: http://arxiv.org/abs/2006.10517v2
- Date: Tue, 15 Dec 2020 02:51:30 GMT
- Title: Privacy-Preserving Technology to Help Millions of People: Federated
Prediction Model for Stroke Prevention
- Authors: Ce Ju, Ruihui Zhao, Jichao Sun, Xiguang Wei, Bo Zhao, Yang Liu,
Hongshan Li, Tianjian Chen, Xinwei Zhang, Dashan Gao, Ben Tan, Han Yu,
Chuning He and Yuan Jin
- Abstract summary: Our scientists and engineers propose a privacy-preserving scheme to predict the risk of stroke and deploy our federated prediction model on cloud servers.
Our model trains over all the healthcare data from hospitals in a certain city without actual data sharing among them.
Especially for small hospitals with few confirmed stroke cases, our federated model boosts model performance by 10%20% in several machine learning metrics.
- Score: 25.276264953982253
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Prevention of stroke with its associated risk factors has been one of the
public health priorities worldwide. Emerging artificial intelligence technology
is being increasingly adopted to predict stroke. Because of privacy concerns,
patient data are stored in distributed electronic health record (EHR)
databases, voluminous clinical datasets, which prevent patient data from being
aggregated and restrains AI technology to boost the accuracy of stroke
prediction with centralized training data. In this work, our scientists and
engineers propose a privacy-preserving scheme to predict the risk of stroke and
deploy our federated prediction model on cloud servers. Our system of federated
prediction model asynchronously supports any number of client connections and
arbitrary local gradient iterations in each communication round. It adopts
federated averaging during the model training process, without patient data
being taken out of the hospitals during the whole process of model training and
forecasting. With the privacy-preserving mechanism, our federated prediction
model trains over all the healthcare data from hospitals in a certain city
without actual data sharing among them. Therefore, it is not only secure but
also more accurate than any single prediction model that trains over the data
only from one single hospital. Especially for small hospitals with few
confirmed stroke cases, our federated model boosts model performance by 10%~20%
in several machine learning metrics. To help stroke experts comprehend the
advantage of our prediction system more intuitively, we developed a mobile app
that collects the key information of patients' statistics and demonstrates
performance comparisons between the federated prediction model and the single
prediction model during the federated training process.
Related papers
- Tackling Data Heterogeneity in Federated Time Series Forecasting [61.021413959988216]
Time series forecasting plays a critical role in various real-world applications, including energy consumption prediction, disease transmission monitoring, and weather forecasting.
Most existing methods rely on a centralized training paradigm, where large amounts of data are collected from distributed devices to a central cloud server.
We propose a novel framework, Fed-TREND, to address data heterogeneity by generating informative synthetic data as auxiliary knowledge carriers.
arXiv Detail & Related papers (2024-11-24T04:56:45Z) - Federated GNNs for EEG-Based Stroke Assessment [1.3274340213871945]
This study proposes a novel method that combines federated learning (FL) and Graph Neural Networks (GNNs) to predict stroke severity.
Our approach enables multiple hospitals to jointly train a shared GNN model on their local EEG data without exchanging patient information.
arXiv Detail & Related papers (2024-11-04T17:13:35Z) - MedDiffusion: Boosting Health Risk Prediction via Diffusion-based Data
Augmentation [58.93221876843639]
This paper introduces a novel, end-to-end diffusion-based risk prediction model, named MedDiffusion.
It enhances risk prediction performance by creating synthetic patient data during training to enlarge sample space.
It discerns hidden relationships between patient visits using a step-wise attention mechanism, enabling the model to automatically retain the most vital information for generating high-quality data.
arXiv Detail & Related papers (2023-10-04T01:36:30Z) - Federated Learning of Medical Concepts Embedding using BEHRT [0.0]
We propose a federated learning approach for learning medical concepts embedding.
Our approach is based on embedding model like BEHRT, a deep neural sequence model for EHR.
We compare the performance of a model trained with FL against a model trained on centralized data.
arXiv Detail & Related papers (2023-05-22T14:05:39Z) - Unsupervised Pre-Training on Patient Population Graphs for Patient-Level
Predictions [48.02011627390706]
Pre-training has shown success in different areas of machine learning, such as Computer Vision (CV), Natural Language Processing (NLP) and medical imaging.
In this paper, we apply unsupervised pre-training to heterogeneous, multi-modal EHR data for patient outcome prediction.
We find that our proposed graph based pre-training method helps in modeling the data at a population level.
arXiv Detail & Related papers (2022-03-23T17:59:45Z) - Bridging the Gap Between Patient-specific and Patient-independent
Seizure Prediction via Knowledge Distillation [7.2666838978096875]
Existing approaches typically train models in a patient-specific fashion due to the highly personalized characteristics of epileptic signals.
A patient-specific model can then be obtained with the help of distilled knowledge and additional personalized data.
Five state-of-the-art seizure prediction methods are trained on the CHB-MIT sEEG database with our proposed scheme.
arXiv Detail & Related papers (2022-02-25T10:30:29Z) - Practical Challenges in Differentially-Private Federated Survival
Analysis of Medical Data [57.19441629270029]
In this paper, we take advantage of the inherent properties of neural networks to federate the process of training of survival analysis models.
In the realistic setting of small medical datasets and only a few data centers, this noise makes it harder for the models to converge.
We propose DPFed-post which adds a post-processing stage to the private federated learning scheme.
arXiv Detail & Related papers (2022-02-08T10:03:24Z) - Test-time Collective Prediction [73.74982509510961]
Multiple parties in machine learning want to jointly make predictions on future test points.
Agents wish to benefit from the collective expertise of the full set of agents, but may not be willing to release their data or model parameters.
We explore a decentralized mechanism to make collective predictions at test time, leveraging each agent's pre-trained model.
arXiv Detail & Related papers (2021-06-22T18:29:58Z) - UNITE: Uncertainty-based Health Risk Prediction Leveraging Multi-sourced
Data [81.00385374948125]
We present UNcertaInTy-based hEalth risk prediction (UNITE) model.
UNITE provides accurate disease risk prediction and uncertainty estimation leveraging multi-sourced health data.
We evaluate UNITE on real-world disease risk prediction tasks: nonalcoholic fatty liver disease (NASH) and Alzheimer's disease (AD)
UNITE achieves up to 0.841 in F1 score for AD detection, up to 0.609 in PR-AUC for NASH detection, and outperforms various state-of-the-art baselines by up to $19%$ over the best baseline.
arXiv Detail & Related papers (2020-10-22T02:28:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.