A Survey of Data Marketplaces and Their Business Models
- URL: http://arxiv.org/abs/2201.04561v1
- Date: Tue, 11 Jan 2022 12:27:37 GMT
- Title: A Survey of Data Marketplaces and Their Business Models
- Authors: Santiago Andr\'es Azcoitia and Nikolaos Laoutaris
- Abstract summary: "Data" is becoming an indispensable production factor, just like land, infrastructure, labor or capital.
Tasks ranging from automating certain functions to facilitating decision-making in data-driven organizations increasingly benefit from acquiring data inputs from third parties.
New entities and novel business models have appeared with the aim of matching such data requirements with the right providers.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: "Data" is becoming an indispensable production factor, just like land,
infrastructure, labor or capital. As part of this, a myriad of applications in
different sectors require huge amounts of information to feed models and
algorithms responsible for critical roles in production chains and business
processes. Tasks ranging from automating certain functions to facilitating
decision-making in data-driven organizations increasingly benefit from
acquiring data inputs from third parties. Responding to this demand, new
entities and novel business models have appeared with the aim of matching such
data requirements with the right providers and facilitating the exchange of
information. In this paper, we present the results and conclusions of a
comprehensive survey on the state of the art of entities trading data on the
internet, as well as novel data marketplace designs from the research
community.
Related papers
- Blockchain-Enabled Accountability in Data Supply Chain: A Data Bill of Materials Approach [16.31469678670097]
We introduce Data Bill of Materials" (DataBOM) to capture the dependency relationship between different datasets and stakeholders by storing specific metadata.
We demonstrate a platform architecture for providing blockchain-based DataBOM services, present the interaction protocol for stakeholders, and discuss the minimal requirements for DataBOM metadata.
arXiv Detail & Related papers (2024-08-16T05:34:50Z) - Data-Centric AI in the Age of Large Language Models [51.20451986068925]
This position paper proposes a data-centric viewpoint of AI research, focusing on large language models (LLMs)
We make the key observation that data is instrumental in the developmental (e.g., pretraining and fine-tuning) and inferential stages (e.g., in-context learning) of LLMs.
We identify four specific scenarios centered around data, covering data-centric benchmarks and data curation, data attribution, knowledge transfer, and inference contextualization.
arXiv Detail & Related papers (2024-06-20T16:34:07Z) - A Survey on Data Selection for Language Models [148.300726396877]
Data selection methods aim to determine which data points to include in a training dataset.
Deep learning is mostly driven by empirical evidence and experimentation on large-scale data is expensive.
Few organizations have the resources for extensive data selection research.
arXiv Detail & Related papers (2024-02-26T18:54:35Z) - Data Acquisition: A New Frontier in Data-centric AI [65.90972015426274]
We first present an investigation of current data marketplaces, revealing lack of platforms offering detailed information about datasets.
We then introduce the DAM challenge, a benchmark to model the interaction between the data providers and acquirers.
Our evaluation of the submitted strategies underlines the need for effective data acquisition strategies in Machine Learning.
arXiv Detail & Related papers (2023-11-22T22:15:17Z) - Beyond Privacy: Navigating the Opportunities and Challenges of Synthetic
Data [91.52783572568214]
Synthetic data may become a dominant force in the machine learning world, promising a future where datasets can be tailored to individual needs.
We discuss which fundamental challenges the community needs to overcome for wider relevance and application of synthetic data.
arXiv Detail & Related papers (2023-04-07T16:38:40Z) - A Survey of Data Pricing for Data Marketplaces [77.3189288320768]
This paper attempts to comprehensively review the state-of-the-art on existing data pricing studies.
Our key contribution lies in a new taxonomy of data pricing studies that unifies different attributes determining data prices.
arXiv Detail & Related papers (2023-03-07T04:35:56Z) - Data Cards: Purposeful and Transparent Dataset Documentation for
Responsible AI [0.0]
We propose Data Cards for fostering transparent, purposeful and human-centered documentation of datasets.
Data Cards are structured summaries of essential facts about various aspects of ML datasets needed by stakeholders.
We present frameworks that ground Data Cards in real-world utility and human-centricity.
arXiv Detail & Related papers (2022-04-03T13:49:36Z) - OSOUM Framework for Trading Data Research [79.0383470835073]
We supply, to the best of our knowledge, the first open source simulation platform, Open SOUrce Market Simulator (OSOUM) to analyze trading markets and specifically data markets.
We describe and implement a specific data market model, consisting of two types of agents: sellers who own various datasets available for acquisition, and buyers searching for relevant and beneficial datasets for purchase.
Although commercial frameworks, intended for handling data markets, already exist, we provide a free and extensive end-to-end research tool for simulating possible behavior for both buyers and sellers participating in (data) markets.
arXiv Detail & Related papers (2021-02-18T09:20:26Z) - Ensuring the Robustness and Reliability of Data-Driven Knowledge
Discovery Models in Production and Manufacturing [0.0]
Cross-industry standard process for data mining (CRISP-DM) designed to address data- and model-related issues.
This paper reviews several extensions of CRISP-DM models and various data-robustness-- and model-robustness-related problems in machine learning.
arXiv Detail & Related papers (2020-07-28T14:21:14Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.