Operationalizing the Legal Principle of Data Minimization for
Personalization
- URL: http://arxiv.org/abs/2005.13718v1
- Date: Thu, 28 May 2020 00:43:06 GMT
- Title: Operationalizing the Legal Principle of Data Minimization for
Personalization
- Authors: Asia J. Biega, Peter Potash, Hal Daum\'e III, Fernando Diaz, Mich\`ele
Finck
- Abstract summary: We identify a lack of a homogeneous interpretation of the data minimization principle and explore two operational definitions applicable in the context of personalization.
We find that the performance decrease incurred by data minimization might not be substantial, but it might disparately impact different users.
- Score: 64.0027026050706
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Article 5(1)(c) of the European Union's General Data Protection Regulation
(GDPR) requires that "personal data shall be [...] adequate, relevant, and
limited to what is necessary in relation to the purposes for which they are
processed (`data minimisation')". To date, the legal and computational
definitions of `purpose limitation' and `data minimization' remain largely
unclear. In particular, the interpretation of these principles is an open issue
for information access systems that optimize for user experience through
personalization and do not strictly require personal data collection for the
delivery of basic service.
In this paper, we identify a lack of a homogeneous interpretation of the data
minimization principle and explore two operational definitions applicable in
the context of personalization. The focus of our empirical study in the domain
of recommender systems is on providing foundational insights about the (i)
feasibility of different data minimization definitions, (ii) robustness of
different recommendation algorithms to minimization, and (iii) performance of
different minimization strategies.We find that the performance decrease
incurred by data minimization might not be substantial, but that it might
disparately impact different users---a finding which has implications for the
viability of different formal minimization definitions. Overall, our analysis
uncovers the complexities of the data minimization problem in the context of
personalization and maps the remaining computational and regulatory challenges.
Related papers
- The trade-off between data minimization and fairness in collaborative filtering [1.8936798735951967]
General Data Protection Regulations aim to safeguard individuals' personal information from harm.
While full compliance is mandatory in the EU, it is not in other places.
This paper studies the relationship between principles of data minimization and fairness in recommender systems.
arXiv Detail & Related papers (2024-09-21T02:32:26Z) - The Data Minimization Principle in Machine Learning [61.17813282782266]
Data minimization aims to reduce the amount of data collected, processed or retained.
It has been endorsed by various global data protection regulations.
However, its practical implementation remains a challenge due to the lack of a rigorous formulation.
arXiv Detail & Related papers (2024-05-29T19:40:27Z) - Private Set Generation with Discriminative Information [63.851085173614]
Differentially private data generation is a promising solution to the data privacy challenge.
Existing private generative models are struggling with the utility of synthetic samples.
We introduce a simple yet effective method that greatly improves the sample utility of state-of-the-art approaches.
arXiv Detail & Related papers (2022-11-07T10:02:55Z) - Quantization for decentralized learning under subspace constraints [61.59416703323886]
We consider decentralized optimization problems where agents have individual cost functions to minimize subject to subspace constraints.
We propose and study an adaptive decentralized strategy where the agents employ differential randomized quantizers to compress their estimates.
The analysis shows that, under some general conditions on the quantization noise, the strategy is stable both in terms of mean-square error and average bit rate.
arXiv Detail & Related papers (2022-09-16T09:38:38Z) - Pessimistic Minimax Value Iteration: Provably Efficient Equilibrium
Learning from Offline Datasets [101.5329678997916]
We study episodic two-player zero-sum Markov games (MGs) in the offline setting.
The goal is to find an approximate Nash equilibrium (NE) policy pair based on a dataset collected a priori.
arXiv Detail & Related papers (2022-02-15T15:39:30Z) - Learning to Limit Data Collection via Scaling Laws: Data Minimization
Compliance in Practice [62.44110411199835]
We build on literature in machine learning law to propose framework for limiting collection based on data interpretation that ties data to system performance.
We formalize a data minimization criterion based on performance curve derivatives and provide an effective and interpretable piecewise power law technique.
arXiv Detail & Related papers (2021-07-16T19:59:01Z) - Reviving Purpose Limitation and Data Minimisation in Personalisation,
Profiling and Decision-Making Systems [0.0]
This paper determines, through an interdisciplinary law and computer science lens, whether data minimisation and purpose limitation can be meaningfully implemented in data-driven systems.
Our analysis reveals that the two legal principles continue to play an important role in mitigating the risks of personal data processing.
We highlight that even though these principles are important safeguards in the systems under consideration, there are important limits to their practical implementation.
arXiv Detail & Related papers (2021-01-15T16:36:29Z) - Differentially Private Simple Linear Regression [2.614403183902121]
We study algorithms for simple linear regression that satisfy differential privacy.
We consider the design of differentially private algorithms for simple linear regression for small datasets.
We study the performance of a spectrum of algorithms we adapt to the setting.
arXiv Detail & Related papers (2020-07-10T04:28:43Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.