Machine Learning in Precision Medicine to Preserve Privacy via
Encryption
- URL: http://arxiv.org/abs/2102.03412v1
- Date: Fri, 5 Feb 2021 20:22:15 GMT
- Title: Machine Learning in Precision Medicine to Preserve Privacy via
Encryption
- Authors: William Briguglio, Parisa Moghaddam, Waleed A. Yousef, Issa Traore,
Mohammad Mamun
- Abstract summary: We propose a generic machine learning with encryption (MLE) framework, which we used to build an ML model that predicts cancer.
Our framework's prediction accuracy is slightly higher than that of the most recent studies conducted on the same dataset.
We provide an open-source repository that contains the design and implementation of the framework, all the ML experiments and code, and the final predictive model deployed to a free cloud service.
- Score: 2.099922236065961
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Precision medicine is an emerging approach for disease treatment and
prevention that delivers personalized care to individual patients by
considering their genetic makeups, medical histories, environments, and
lifestyles. Despite the rapid advancement of precision medicine and its
considerable promise, several underlying technological challenges remain
unsolved. One such challenge of great importance is the security and privacy of
precision health-related data, such as genomic data and electronic health
records, which stifle collaboration and hamper the full potential of
machine-learning (ML) algorithms. To preserve data privacy while providing ML
solutions, this article makes three contributions. First, we propose a generic
machine learning with encryption (MLE) framework, which we used to build an ML
model that predicts cancer from one of the most recent comprehensive genomics
datasets in the field. Second, our framework's prediction accuracy is slightly
higher than that of the most recent studies conducted on the same dataset, yet
it maintains the privacy of the patients' genomic data. Third, to facilitate
the validation, reproduction, and extension of this work, we provide an
open-source repository that contains the design and implementation of the
framework, all the ML experiments and code, and the final predictive model
deployed to a free cloud service.
Related papers
- Prediction and Detection of Terminal Diseases Using Internet of Medical Things: A Review [4.4389631374821255]
AI-driven models have achieved over 98% accuracy in predicting heart disease, chronic kidney disease (CKD), Alzheimer's disease, and lung cancer.
The incorporation of IoMT data, which is vast and heterogeneous, adds complexities in ensuring interoperability and security to protect patient privacy.
Future research should focus on data standardization and advanced preprocessing techniques to improve data quality and interoperability.
arXiv Detail & Related papers (2024-09-22T15:02:33Z) - Controllable Synthetic Clinical Note Generation with Privacy Guarantees [7.1366477372157995]
In this paper, we introduce a novel method to "clone" datasets containing Personal Health Information (PHI)
Our approach ensures that the cloned datasets retain the essential characteristics and utility of the original data without compromising patient privacy.
We conduct utility testing to evaluate the performance of machine learning models trained on the cloned datasets.
arXiv Detail & Related papers (2024-09-12T07:38:34Z) - Privacy-Preserving Collaborative Genomic Research: A Real-Life Deployment and Vision [2.7968600664591983]
This paper presents a privacy-preserving framework for genomic research, developed in collaboration with Lynx.MD.
The framework addresses critical cybersecurity and privacy challenges, enabling the privacy-preserving sharing and analysis of genomic data.
Implementing the framework within Lynx.MD involves encoding genomic data into binary formats and applying noise through controlled perturbation techniques.
arXiv Detail & Related papers (2024-07-12T05:43:13Z) - Decentralised, Collaborative, and Privacy-preserving Machine Learning for Multi-Hospital Data [31.106733834322394]
We propose Decentralized, Collaborative, and Privacy-preserving ML for Multi-Hospital Data (DeCaPH)
We demonstrate the generalizability and power of DeCaPH on three distinct tasks using real-world distributed medical datasets.
arXiv Detail & Related papers (2024-01-31T22:06:10Z) - Leveraging Generative AI Models for Synthetic Data Generation in
Healthcare: Balancing Research and Privacy [0.0]
generative AI models like GANs and VAEs offer a promising solution to balance valuable data access and patient privacy protection.
In this paper, we examine generative AI models for creating realistic, anonymized patient data for research and training.
arXiv Detail & Related papers (2023-05-09T08:12:44Z) - Privacy-preserving machine learning for healthcare: open challenges and
future perspectives [72.43506759789861]
We conduct a review of recent literature concerning Privacy-Preserving Machine Learning (PPML) for healthcare.
We primarily focus on privacy-preserving training and inference-as-a-service.
The aim of this review is to guide the development of private and efficient ML models in healthcare.
arXiv Detail & Related papers (2023-03-27T19:20:51Z) - Safe AI for health and beyond -- Monitoring to transform a health
service [51.8524501805308]
We will assess the infrastructure required to monitor the outputs of a machine learning algorithm.
We will present two scenarios with examples of monitoring and updates of models.
arXiv Detail & Related papers (2023-03-02T17:27:45Z) - Unsupervised pre-training of graph transformers on patient population
graphs [48.02011627390706]
We propose a graph-transformer-based network to handle heterogeneous clinical data.
We show the benefit of our pre-training method in a self-supervised and a transfer learning setting.
arXiv Detail & Related papers (2022-07-21T16:59:09Z) - Privacy-preserving medical image analysis [53.4844489668116]
We present PriMIA, a software framework designed for privacy-preserving machine learning (PPML) in medical imaging.
We show significantly better classification performance of a securely aggregated federated learning model compared to human experts on unseen datasets.
We empirically evaluate the framework's security against a gradient-based model inversion attack.
arXiv Detail & Related papers (2020-12-10T13:56:00Z) - UNITE: Uncertainty-based Health Risk Prediction Leveraging Multi-sourced
Data [81.00385374948125]
We present UNcertaInTy-based hEalth risk prediction (UNITE) model.
UNITE provides accurate disease risk prediction and uncertainty estimation leveraging multi-sourced health data.
We evaluate UNITE on real-world disease risk prediction tasks: nonalcoholic fatty liver disease (NASH) and Alzheimer's disease (AD)
UNITE achieves up to 0.841 in F1 score for AD detection, up to 0.609 in PR-AUC for NASH detection, and outperforms various state-of-the-art baselines by up to $19%$ over the best baseline.
arXiv Detail & Related papers (2020-10-22T02:28:11Z) - Select-ProtoNet: Learning to Select for Few-Shot Disease Subtype
Prediction [55.94378672172967]
We focus on few-shot disease subtype prediction problem, identifying subgroups of similar patients.
We introduce meta learning techniques to develop a new model, which can extract the common experience or knowledge from interrelated clinical tasks.
Our new model is built upon a carefully designed meta-learner, called Prototypical Network, that is a simple yet effective meta learning machine for few-shot image classification.
arXiv Detail & Related papers (2020-09-02T02:50:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.