Some Examples of Privacy-preserving Publication and Sharing of COVID-19
Pandemic Data
- URL: http://arxiv.org/abs/2106.10339v3
- Date: Tue, 13 Sep 2022 21:49:26 GMT
- Title: Some Examples of Privacy-preserving Publication and Sharing of COVID-19
Pandemic Data
- Authors: Fang Liu, Dong Wang, Tian Yan
- Abstract summary: We use three common but distinct data types collected during the pandemic to illustrate the publication and sharing of granular information and individual-level pandemic data in a privacy-preserving manner.
We investigate the inferential utility of privacy-preserving information through simulation studies at different levels of privacy guarantees and demonstrate the approaches in real-life data.
- Score: 9.514015456352265
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: A considerable amount of various types of data have been collected during the
COVID-19 pandemic, the analysis and interpretation of which have been
indispensable for curbing the spread of the disease. As the pandemic moves to
an endemic state, the data collected during the pandemic will continue to be
rich sources for further studying and understanding the impacts of the pandemic
on various aspects of our society. On the other hand, na\"{i}ve release and
sharing of the information can be associated with serious privacy concerns. In
this study, we use three common but distinct data types collected during the
pandemic (case surveillance tabular data, case location data, and contact
tracing networks) to illustrate the publication and sharing of granular
information and individual-level pandemic data in a privacy-preserving manner.
We leverage and build upon the concept of differential privacy to generate and
release privacy-preserving data for each data type. We investigate the
inferential utility of privacy-preserving information through simulation
studies at different levels of privacy guarantees and demonstrate the
approaches in real-life data. All the approaches employed in the study are
straightforward to apply. Our study generates statistical evidence on the
practical feasibility of sharing pandemic data with privacy guarantees and on
how to balance the statistical utility of released information during this
process.
Related papers
- Tabular Data Synthesis with Differential Privacy: A Survey [24.500349285858597]
Data sharing is a prerequisite for collaborative innovation, enabling organizations to leverage diverse datasets for deeper insights.
Data synthesis tackles this by generating artificial datasets that preserve the statistical characteristics of real data.
Differentially private data synthesis has emerged as a promising approach to privacy-aware data sharing.
arXiv Detail & Related papers (2024-11-04T06:32:48Z) - Collection, usage and privacy of mobility data in the enterprise and public administrations [55.2480439325792]
Security measures such as anonymization are needed to protect individuals' privacy.
Within our study, we conducted expert interviews to gain insights into practices in the field.
We survey privacy-enhancing methods in use, which generally do not comply with state-of-the-art standards of differential privacy.
arXiv Detail & Related papers (2024-07-04T08:29:27Z) - Guarding Multiple Secrets: Enhanced Summary Statistic Privacy for Data Sharing [3.7274308010465775]
We propose a novel framework to define, analyze, and protect multi-secret summary statistics privacy in data sharing.
We measure the privacy risk of any data release mechanism by the worst-case probability of an attacker successfully inferring summary statistic secrets.
arXiv Detail & Related papers (2024-05-22T16:30:34Z) - A Summary of Privacy-Preserving Data Publishing in the Local Setting [0.6749750044497732]
Statistical Disclosure Control aims to minimize the risk of exposing confidential information by de-identifying it.
We outline the current privacy-preserving techniques employed in microdata de-identification, delve into privacy measures tailored for various disclosure scenarios, and assess metrics for information loss and predictive performance.
arXiv Detail & Related papers (2023-12-19T04:23:23Z) - Preserving The Safety And Confidentiality Of Data Mining Information In Health Care: A literature review [0.0]
PPDM technique enables the extraction of actionable insight from enormous volume of data.
Disclosing sensitive information infringes on patients' privacy.
This paper aims to conduct a review of related work on privacy-preserving mechanisms, data protection regulations, and mitigating tactics.
arXiv Detail & Related papers (2023-10-30T05:32:15Z) - A Unified View of Differentially Private Deep Generative Modeling [60.72161965018005]
Data with privacy concerns comes with stringent regulations that frequently prohibited data access and data sharing.
Overcoming these obstacles is key for technological progress in many real-world application scenarios that involve privacy sensitive data.
Differentially private (DP) data publishing provides a compelling solution, where only a sanitized form of the data is publicly released.
arXiv Detail & Related papers (2023-09-27T14:38:16Z) - Private Set Generation with Discriminative Information [63.851085173614]
Differentially private data generation is a promising solution to the data privacy challenge.
Existing private generative models are struggling with the utility of synthetic samples.
We introduce a simple yet effective method that greatly improves the sample utility of state-of-the-art approaches.
arXiv Detail & Related papers (2022-11-07T10:02:55Z) - DP2-Pub: Differentially Private High-Dimensional Data Publication with
Invariant Post Randomization [58.155151571362914]
We propose a differentially private high-dimensional data publication mechanism (DP2-Pub) that runs in two phases.
splitting attributes into several low-dimensional clusters with high intra-cluster cohesion and low inter-cluster coupling helps obtain a reasonable privacy budget.
We also extend our DP2-Pub mechanism to the scenario with a semi-honest server which satisfies local differential privacy.
arXiv Detail & Related papers (2022-08-24T17:52:43Z) - Practical Challenges in Differentially-Private Federated Survival
Analysis of Medical Data [57.19441629270029]
In this paper, we take advantage of the inherent properties of neural networks to federate the process of training of survival analysis models.
In the realistic setting of small medical datasets and only a few data centers, this noise makes it harder for the models to converge.
We propose DPFed-post which adds a post-processing stage to the private federated learning scheme.
arXiv Detail & Related papers (2022-02-08T10:03:24Z) - Epidemic mitigation by statistical inference from contact tracing data [61.04165571425021]
We develop Bayesian inference methods to estimate the risk that an individual is infected.
We propose to use probabilistic risk estimation in order to optimize testing and quarantining strategies for the control of an epidemic.
Our approaches translate into fully distributed algorithms that only require communication between individuals who have recently been in contact.
arXiv Detail & Related papers (2020-09-20T12:24:45Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.