Privacy-Aware Data Acquisition under Data Similarity in Regression
Markets
- URL: http://arxiv.org/abs/2312.02611v1
- Date: Tue, 5 Dec 2023 09:39:04 GMT
- Title: Privacy-Aware Data Acquisition under Data Similarity in Regression
Markets
- Authors: Shashi Raj Pandey, Pierre Pinson, and Petar Popovski
- Abstract summary: We show that data similarity and privacy preferences are integral to market design.
We numerically evaluate how data similarity affects market participation and traded data value.
- Score: 29.64195175524365
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Data markets facilitate decentralized data exchange for applications such as
prediction, learning, or inference. The design of these markets is challenged
by varying privacy preferences as well as data similarity among data owners.
Related works have often overlooked how data similarity impacts pricing and
data value through statistical information leakage. We demonstrate that data
similarity and privacy preferences are integral to market design and propose a
query-response protocol using local differential privacy for a two-party data
acquisition mechanism. In our regression data market model, we analyze
strategic interactions between privacy-aware owners and the learner as a
Stackelberg game over the asked price and privacy factor. Finally, we
numerically evaluate how data similarity affects market participation and
traded data value.
Related papers
- Constructing Data Transaction Chains Based on Opportunity Cost Exploration [9.353146025394372]
This paper compares data trading markets with traditional ones, focusing on how the replicability and privacy of data impact data markets.
We discuss how data's replicability fundamentally alters the concept of opportunity cost in traditional microeconomics within the context of data markets.
This paper outlines the constraints for data circulation within the privacy domain chain and presents a model that maximizes data's value under these constraints.
arXiv Detail & Related papers (2024-04-08T08:02:18Z) - Data Acquisition via Experimental Design for Decentralized Data Markets [25.300193837833426]
Data markets provide a way to increase the supply of data, particularly in data-scarce domains such as healthcare.
A major challenge for a data buyer in such a market is selecting the most valuable data points from a data seller.
We propose a federated approach to the data selection problem that is inspired by linear experimental design.
arXiv Detail & Related papers (2024-03-20T18:05:52Z) - Data Acquisition: A New Frontier in Data-centric AI [65.90972015426274]
We first present an investigation of current data marketplaces, revealing lack of platforms offering detailed information about datasets.
We then introduce the DAM challenge, a benchmark to model the interaction between the data providers and acquirers.
Our evaluation of the submitted strategies underlines the need for effective data acquisition strategies in Machine Learning.
arXiv Detail & Related papers (2023-11-22T22:15:17Z) - A Survey of Data Pricing for Data Marketplaces [77.3189288320768]
This paper attempts to comprehensively review the state-of-the-art on existing data pricing studies.
Our key contribution lies in a new taxonomy of data pricing studies that unifies different attributes determining data prices.
arXiv Detail & Related papers (2023-03-07T04:35:56Z) - Competition, Alignment, and Equilibria in Digital Marketplaces [97.03797129675951]
We study a duopoly market where platform actions are bandit algorithms and the two platforms compete for user participation.
Our main finding is that competition in this market does not perfectly align market outcomes with user utility.
arXiv Detail & Related papers (2022-08-30T17:43:58Z) - DP2-Pub: Differentially Private High-Dimensional Data Publication with
Invariant Post Randomization [58.155151571362914]
We propose a differentially private high-dimensional data publication mechanism (DP2-Pub) that runs in two phases.
splitting attributes into several low-dimensional clusters with high intra-cluster cohesion and low inter-cluster coupling helps obtain a reasonable privacy budget.
We also extend our DP2-Pub mechanism to the scenario with a semi-honest server which satisfies local differential privacy.
arXiv Detail & Related papers (2022-08-24T17:52:43Z) - Strategic Coalition for Data Pricing in IoT Data Markets [32.38170282930876]
This paper considers a market for trading Internet of Things (IoT) data that is used to train machine learning models.
The data is supplied to the market platform through a network and the price of such data is controlled based on the value it brings to the machine learning model.
arXiv Detail & Related papers (2022-06-15T19:48:10Z) - Data Sharing Markets [95.13209326119153]
We study a setup where each agent can be both buyer and seller of data.
We consider two cases: bilateral data exchange (trading data with data) and unilateral data exchange (trading data with money)
arXiv Detail & Related papers (2021-07-19T06:00:34Z) - Causally Constrained Data Synthesis for Private Data Release [36.80484740314504]
Using synthetic data which reflects certain statistical properties of the original data preserves the privacy of the original data.
Prior works utilize differentially private data release mechanisms to provide formal privacy guarantees.
We propose incorporating causal information into the training process to favorably modify the aforementioned trade-off.
arXiv Detail & Related papers (2021-05-27T13:46:57Z) - OSOUM Framework for Trading Data Research [79.0383470835073]
We supply, to the best of our knowledge, the first open source simulation platform, Open SOUrce Market Simulator (OSOUM) to analyze trading markets and specifically data markets.
We describe and implement a specific data market model, consisting of two types of agents: sellers who own various datasets available for acquisition, and buyers searching for relevant and beneficial datasets for purchase.
Although commercial frameworks, intended for handling data markets, already exist, we provide a free and extensive end-to-end research tool for simulating possible behavior for both buyers and sellers participating in (data) markets.
arXiv Detail & Related papers (2021-02-18T09:20:26Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.