Towards Compliant Data Management Systems for Healthcare ML
- URL: http://arxiv.org/abs/2011.07555v1
- Date: Sun, 15 Nov 2020 15:27:51 GMT
- Title: Towards Compliant Data Management Systems for Healthcare ML
- Authors: Goutham Ramakrishnan, Aditya Nori, Hannah Murfet, Pashmina Cameron
- Abstract summary: We review how data flows within machine learning projects in healthcare from source to storage to use in training algorithms and beyond.
Our objective is to design tools to detect and track sensitive data across machines and users across the life cycle of a project.
We build a prototype of the solution that demonstrates the difficulties in this domain.
- Score: 6.057289837472806
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The increasing popularity of machine learning approaches and the rising
awareness of data protection and data privacy presents an opportunity to build
truly secure and trustworthy healthcare systems. Regulations such as GDPR and
HIPAA present broad guidelines and frameworks, but the implementation can
present technical challenges. Compliant data management systems require
enforcement of a number of technical and administrative safeguards. While
policies can be set for both safeguards there is limited availability to
understand compliance in real time. Increasingly, machine learning
practitioners are becoming aware of the importance of keeping track of
sensitive data. With sensitivity over personally identifiable, health or
commercially sensitive information there would be value in understanding
assessment of the flow of data in a more dynamic fashion. We review how data
flows within machine learning projects in healthcare from source to storage to
use in training algorithms and beyond. Based on this, we design engineering
specifications and solutions for versioning of data. Our objective is to design
tools to detect and track sensitive data across machines and users across the
life cycle of a project, prioritizing efficiency, consistency and ease of use.
We build a prototype of the solution that demonstrates the difficulties in this
domain. Together, these represent first efforts towards building a compliant
data management system for healthcare machine learning projects.
Related papers
- A Federated Learning Platform as a Service for Advancing Stroke Management in European Clinical Centers [37.285731240749904]
This paper introduces a novel Federated Learning (FL) platform designed to support the configuration, monitoring, and management of FL processes.
Considering the production sensitivity inherent in clinical environments, we emphasize the security of the proposed FL architecture.
The platform has been successfully tested in various operational environments using a publicly available dataset.
arXiv Detail & Related papers (2024-10-02T09:24:05Z) - Privacy-Preserving Edge Federated Learning for Intelligent Mobile-Health Systems [4.082799056366928]
We propose a privacy-preserving edge FL framework for resource-constrained mobile-health and wearable technologies over the IoT infrastructure.
We evaluate our proposed framework extensively and provide the implementation of our technique on Amazon's AWS cloud platform.
arXiv Detail & Related papers (2024-05-09T08:15:31Z) - Explainable Machine Learning-Based Security and Privacy Protection Framework for Internet of Medical Things Systems [1.8434042562191815]
The Internet of Medical Things (IoMT) transcends traditional medical boundaries, enabling a transition from reactive treatment to proactive prevention.
Its benefits are countered by significant security challenges that endanger the lives of its users due to the sensitivity and value of the processed data.
A new framework for Intrusion Detection Systems (IDS) is introduced, leveraging Artificial Neural Networks (ANN) for intrusion detection while utilizing Federated Learning (FL) for privacy preservation.
arXiv Detail & Related papers (2024-03-14T11:57:26Z) - Clairvoyance: A Pipeline Toolkit for Medical Time Series [95.22483029602921]
Time-series learning is the bread and butter of data-driven *clinical decision support*
Clairvoyance proposes a unified, end-to-end, autoML-friendly pipeline that serves as a software toolkit.
Clairvoyance is the first to demonstrate viability of a comprehensive and automatable pipeline for clinical time-series ML.
arXiv Detail & Related papers (2023-10-28T12:08:03Z) - Blockchain-empowered Federated Learning for Healthcare Metaverses:
User-centric Incentive Mechanism with Optimal Data Freshness [66.3982155172418]
We first design a user-centric privacy-preserving framework based on decentralized Federated Learning (FL) for healthcare metaverses.
We then utilize Age of Information (AoI) as an effective data-freshness metric and propose an AoI-based contract theory model under Prospect Theory (PT) to motivate sensing data sharing.
arXiv Detail & Related papers (2023-07-29T12:54:03Z) - Privacy-Preserving Graph Machine Learning from Data to Computation: A
Survey [67.7834898542701]
We focus on reviewing privacy-preserving techniques of graph machine learning.
We first review methods for generating privacy-preserving graph data.
Then we describe methods for transmitting privacy-preserved information.
arXiv Detail & Related papers (2023-07-10T04:30:23Z) - Safe AI for health and beyond -- Monitoring to transform a health
service [51.8524501805308]
We will assess the infrastructure required to monitor the outputs of a machine learning algorithm.
We will present two scenarios with examples of monitoring and updates of models.
arXiv Detail & Related papers (2023-03-02T17:27:45Z) - Information Governance as a Socio-Technical Process in the Development
of Trustworthy Healthcare AI [0.0]
Information Governance (IG) processes govern the use of personal confidential data.
Legal basis for data sharing is explicit only for the purpose of delivering patient care.
IG work should start early in the design life cycle and will likely continue throughout.
arXiv Detail & Related papers (2023-01-04T10:21:46Z) - Bringing the Algorithms to the Data -- Secure Distributed Medical
Analytics using the Personal Health Train (PHT-meDIC) [1.451998131020241]
Personal Health Train (PHT) paradigm implements an 'algorithm to the data' paradigm.
We present PHT-meDIC, a productively deployed open-source implementation of the PHT concept.
arXiv Detail & Related papers (2022-12-07T06:29:15Z) - Distributed Machine Learning and the Semblance of Trust [66.1227776348216]
Federated Learning (FL) allows the data owner to maintain data governance and perform model training locally without having to share their data.
FL and related techniques are often described as privacy-preserving.
We explain why this term is not appropriate and outline the risks associated with over-reliance on protocols that were not designed with formal definitions of privacy in mind.
arXiv Detail & Related papers (2021-12-21T08:44:05Z) - Dataset Security for Machine Learning: Data Poisoning, Backdoor Attacks,
and Defenses [150.64470864162556]
This work systematically categorizes and discusses a wide range of dataset vulnerabilities and exploits.
In addition to describing various poisoning and backdoor threat models and the relationships among them, we develop their unified taxonomy.
arXiv Detail & Related papers (2020-12-18T22:38:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.