Practical Challenges in Differentially-Private Federated Survival
Analysis of Medical Data
- URL: http://arxiv.org/abs/2202.03758v1
- Date: Tue, 8 Feb 2022 10:03:24 GMT
- Title: Practical Challenges in Differentially-Private Federated Survival
Analysis of Medical Data
- Authors: Shadi Rahimian, Raouf Kerkouche, Ina Kurth, Mario Fritz
- Abstract summary: In this paper, we take advantage of the inherent properties of neural networks to federate the process of training of survival analysis models.
In the realistic setting of small medical datasets and only a few data centers, this noise makes it harder for the models to converge.
We propose DPFed-post which adds a post-processing stage to the private federated learning scheme.
- Score: 57.19441629270029
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Survival analysis or time-to-event analysis aims to model and predict the
time it takes for an event of interest to happen in a population or an
individual. In the medical context this event might be the time of dying,
metastasis, recurrence of cancer, etc. Recently, the use of neural networks
that are specifically designed for survival analysis has become more popular
and an attractive alternative to more traditional methods. In this paper, we
take advantage of the inherent properties of neural networks to federate the
process of training of these models. This is crucial in the medical domain
since data is scarce and collaboration of multiple health centers is essential
to make a conclusive decision about the properties of a treatment or a disease.
To ensure the privacy of the datasets, it is common to utilize differential
privacy on top of federated learning. Differential privacy acts by introducing
random noise to different stages of training, thus making it harder for an
adversary to extract details about the data. However, in the realistic setting
of small medical datasets and only a few data centers, this noise makes it
harder for the models to converge. To address this problem, we propose
DPFed-post which adds a post-processing stage to the private federated learning
scheme. This extra step helps to regulate the magnitude of the noisy average
parameter update and easier convergence of the model. For our experiments, we
choose 3 real-world datasets in the realistic setting when each health center
has only a few hundred records, and we show that DPFed-post successfully
increases the performance of the models by an average of up to $17\%$ compared
to the standard differentially private federated learning scheme.
Related papers
- Comparing Federated Stochastic Gradient Descent and Federated Averaging for Predicting Hospital Length of Stay [0.0]
Predicting hospital length of stay (LOS) reliably is an essential need for efficient resource allocation at hospitals.
Traditional predictive modeling tools frequently have difficulty acquiring sufficient and diverse data because healthcare institutions have privacy rules in place.
This modeling approach facilitates collaborative model training by modeling decentralized data sources from different hospitals without extracting sensitive data outside of hospitals.
arXiv Detail & Related papers (2024-07-17T17:00:20Z) - Federated Data Model [16.62770246342126]
In artificial intelligence (AI), especially deep learning, data diversity and volume play a pivotal role in model development.
We developed a method called the Federated Data Model (FDM) to train robust deep learning models across different locations.
Our results show that models trained with this method perform well both on the data they were originally trained on and on data from other sites.
arXiv Detail & Related papers (2024-03-13T18:16:54Z) - MedDiffusion: Boosting Health Risk Prediction via Diffusion-based Data
Augmentation [58.93221876843639]
This paper introduces a novel, end-to-end diffusion-based risk prediction model, named MedDiffusion.
It enhances risk prediction performance by creating synthetic patient data during training to enlarge sample space.
It discerns hidden relationships between patient visits using a step-wise attention mechanism, enabling the model to automatically retain the most vital information for generating high-quality data.
arXiv Detail & Related papers (2023-10-04T01:36:30Z) - Medical Federated Model with Mixture of Personalized and Sharing
Components [31.068735334318088]
We propose a new personalized framework of federated learning to handle the problem.
It successfully yields personalized models based on awareness of similarity between local data.
Also, we propose an effective method to reduce the computational cost, which improves computation efficiency significantly.
arXiv Detail & Related papers (2023-06-26T07:50:32Z) - Decentralized Distributed Learning with Privacy-Preserving Data
Synthesis [9.276097219140073]
In the medical field, multi-center collaborations are often sought to yield more generalizable findings by leveraging the heterogeneity of patient and clinical data.
Recent privacy regulations hinder the possibility to share data, and consequently, to come up with machine learning-based solutions that support diagnosis and prognosis.
We present a decentralized distributed method that integrates features from local nodes, providing models able to generalize across multiple datasets while maintaining privacy.
arXiv Detail & Related papers (2022-06-20T23:49:38Z) - LifeLonger: A Benchmark for Continual Disease Classification [59.13735398630546]
We introduce LifeLonger, a benchmark for continual disease classification on the MedMNIST collection.
Task and class incremental learning of diseases address the issue of classifying new samples without re-training the models from scratch.
Cross-domain incremental learning addresses the issue of dealing with datasets originating from different institutions while retaining the previously obtained knowledge.
arXiv Detail & Related papers (2022-04-12T12:25:05Z) - When Accuracy Meets Privacy: Two-Stage Federated Transfer Learning
Framework in Classification of Medical Images on Limited Data: A COVID-19
Case Study [77.34726150561087]
COVID-19 pandemic has spread rapidly and caused a shortage of global medical resources.
CNN has been widely utilized and verified in analyzing medical images.
arXiv Detail & Related papers (2022-03-24T02:09:41Z) - FLOP: Federated Learning on Medical Datasets using Partial Networks [84.54663831520853]
COVID-19 Disease due to the novel coronavirus has caused a shortage of medical resources.
Different data-driven deep learning models have been developed to mitigate the diagnosis of COVID-19.
The data itself is still scarce due to patient privacy concerns.
We propose a simple yet effective algorithm, named textbfFederated textbfL textbfon Medical datasets using textbfPartial Networks (FLOP)
arXiv Detail & Related papers (2021-02-10T01:56:58Z) - Select-ProtoNet: Learning to Select for Few-Shot Disease Subtype
Prediction [55.94378672172967]
We focus on few-shot disease subtype prediction problem, identifying subgroups of similar patients.
We introduce meta learning techniques to develop a new model, which can extract the common experience or knowledge from interrelated clinical tasks.
Our new model is built upon a carefully designed meta-learner, called Prototypical Network, that is a simple yet effective meta learning machine for few-shot image classification.
arXiv Detail & Related papers (2020-09-02T02:50:30Z) - Multi-site fMRI Analysis Using Privacy-preserving Federated Learning and
Domain Adaptation: ABIDE Results [13.615292855384729]
To train a high-quality deep learning model, the aggregation of a significant amount of patient information is required.
Due to the need to protect the privacy of patient data, it is hard to assemble a central database from multiple institutions.
Federated learning allows for population-level models to be trained without centralizing entities' data.
arXiv Detail & Related papers (2020-01-16T04:49:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.