Spatio-Temporal Split Learning for Privacy-Preserving Medical Platforms:
Case Studies with COVID-19 CT, X-Ray, and Cholesterol Data
- URL: http://arxiv.org/abs/2108.10147v1
- Date: Fri, 20 Aug 2021 04:47:02 GMT
- Title: Spatio-Temporal Split Learning for Privacy-Preserving Medical Platforms:
Case Studies with COVID-19 CT, X-Ray, and Cholesterol Data
- Authors: Yoo Jeong Ha, Minjae Yoo, Gusang Lee, Soyi Jung, Sae Won Choi,
Joongheon Kim, and Seehwan Yoo
- Abstract summary: Patient records are one of the most sensitive private information that is not usually shared among institutes.
This paper presents a distributed deep neural network framework, which is a turning point in allowing collaboration among privacy-sensitive organizations.
- Score: 10.169998593773915
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Machine learning requires a large volume of sample data, especially when it
is used in high-accuracy medical applications. However, patient records are one
of the most sensitive private information that is not usually shared among
institutes. This paper presents spatio-temporal split learning, a distributed
deep neural network framework, which is a turning point in allowing
collaboration among privacy-sensitive organizations. Our spatio-temporal split
learning presents how distributed machine learning can be efficiently conducted
with minimal privacy concerns. The proposed split learning consists of a number
of clients and a centralized server. Each client has only has one hidden layer,
which acts as the privacy-preserving layer, and the centralized server
comprises the other hidden layers and the output layer. Since the centralized
server does not need to access the training data and trains the deep neural
network with parameters received from the privacy-preserving layer, privacy of
original data is guaranteed. We have coined the term, spatio-temporal split
learning, as multiple clients are spatially distributed to cover diverse
datasets from different participants, and we can temporally split the learning
process, detaching the privacy preserving layer from the rest of the learning
process to minimize privacy breaches. This paper shows how we can analyze the
medical data whilst ensuring privacy using our proposed multi-site
spatio-temporal split learning algorithm on Coronavirus Disease-19 (COVID-19)
chest Computed Tomography (CT) scans, MUsculoskeletal RAdiographs (MURA) X-ray
images, and cholesterol levels.
Related papers
- CURE: Privacy-Preserving Split Learning Done Right [1.388112207221632]
Homomorphic encryption (HE)-based solutions exist for this scenario but often impose prohibitive computational burdens.
CURE is a novel system that encrypts only the server side of the model and the data.
We demonstrate CURE can achieve similar accuracy to plaintext SL while being 16x more efficient in terms of the runtime.
arXiv Detail & Related papers (2024-07-12T04:10:19Z) - Scalable Collaborative Learning via Representation Sharing [53.047460465980144]
Federated learning (FL) and Split Learning (SL) are two frameworks that enable collaborative learning while keeping the data private (on device)
In FL, each data holder trains a model locally and releases it to a central server for aggregation.
In SL, the clients must release individual cut-layer activations (smashed data) to the server and wait for its response (during both inference and back propagation).
In this work, we present a novel approach for privacy-preserving machine learning, where the clients collaborate via online knowledge distillation using a contrastive loss.
arXiv Detail & Related papers (2022-11-20T10:49:22Z) - When Accuracy Meets Privacy: Two-Stage Federated Transfer Learning
Framework in Classification of Medical Images on Limited Data: A COVID-19
Case Study [77.34726150561087]
COVID-19 pandemic has spread rapidly and caused a shortage of global medical resources.
CNN has been widely utilized and verified in analyzing medical images.
arXiv Detail & Related papers (2022-03-24T02:09:41Z) - Feasibility Study of Multi-Site Split Learning for Privacy-Preserving
Medical Systems under Data Imbalance Constraints in COVID-19, X-Ray, and
Cholesterol Dataset [11.021444485027143]
This paper provides a novel split learning algorithm coined the term, "multi-site split learning"
It enables a secure transfer of medical data between multiple hospitals without fear of exposing personal data contained in patient records.
arXiv Detail & Related papers (2022-02-21T03:51:27Z) - Practical Challenges in Differentially-Private Federated Survival
Analysis of Medical Data [57.19441629270029]
In this paper, we take advantage of the inherent properties of neural networks to federate the process of training of survival analysis models.
In the realistic setting of small medical datasets and only a few data centers, this noise makes it harder for the models to converge.
We propose DPFed-post which adds a post-processing stage to the private federated learning scheme.
arXiv Detail & Related papers (2022-02-08T10:03:24Z) - DQRE-SCnet: A novel hybrid approach for selecting users in Federated
Learning with Deep-Q-Reinforcement Learning based on Spectral Clustering [1.174402845822043]
Machine learning models based on sensitive data in the real-world promise advances in areas ranging from medical screening to disease outbreaks, agriculture, industry, defense science, and more.
In many applications, learning participant communication rounds benefit from collecting their own private data sets, teaching detailed machine learning models on the real data, and sharing the benefits of using these models.
Due to existing privacy and security concerns, most people avoid sensitive data sharing for training. Without each user demonstrating their local data to a central server, Federated Learning allows various parties to train a machine learning algorithm on their shared data jointly.
arXiv Detail & Related papers (2021-11-07T15:14:29Z) - NeuraCrypt: Hiding Private Health Data via Random Neural Networks for
Public Training [64.54200987493573]
We propose NeuraCrypt, a private encoding scheme based on random deep neural networks.
NeuraCrypt encodes raw patient data using a randomly constructed neural network known only to the data-owner.
We show that NeuraCrypt achieves competitive accuracy to non-private baselines on a variety of x-ray tasks.
arXiv Detail & Related papers (2021-06-04T13:42:21Z) - FedDG: Federated Domain Generalization on Medical Image Segmentation via
Episodic Learning in Continuous Frequency Space [63.43592895652803]
Federated learning allows distributed medical institutions to collaboratively learn a shared prediction model with privacy protection.
While at clinical deployment, the models trained in federated learning can still suffer from performance drop when applied to completely unseen hospitals outside the federation.
We present a novel approach, named as Episodic Learning in Continuous Frequency Space (ELCFS), for this problem.
The effectiveness of our method is demonstrated with superior performance over state-of-the-arts and in-depth ablation experiments on two medical image segmentation tasks.
arXiv Detail & Related papers (2021-03-10T13:05:23Z) - FLOP: Federated Learning on Medical Datasets using Partial Networks [84.54663831520853]
COVID-19 Disease due to the novel coronavirus has caused a shortage of medical resources.
Different data-driven deep learning models have been developed to mitigate the diagnosis of COVID-19.
The data itself is still scarce due to patient privacy concerns.
We propose a simple yet effective algorithm, named textbfFederated textbfL textbfon Medical datasets using textbfPartial Networks (FLOP)
arXiv Detail & Related papers (2021-02-10T01:56:58Z) - Privacy-Preserving Deep Learning Computation for Geo-Distributed Medical
Big-Data Platforms [12.64748421991378]
This paper proposes a distributed deep learning framework for privacy-preserving medical data training.
In order to avoid patients' data leakage in medical platforms, the hidden layers in the deep learning framework are separated and where the first layer is kept in platform and others layers are kept in a centralized server.
arXiv Detail & Related papers (2020-01-09T11:46:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.