Exploring the Critical Success Factors for Data Democratization
- URL: http://arxiv.org/abs/2212.03059v1
- Date: Sun, 4 Dec 2022 22:07:45 GMT
- Title: Exploring the Critical Success Factors for Data Democratization
- Authors: Sasari Samarasinghe and Sachithra Lokuge
- Abstract summary: Data democratization is an ongoing process of broadening data access to employees.
This paper aims to identify the critical success factors for data democratization through an in-depth review of the literature.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: With the advent of the Data Age, organisations are constantly under pressure
to pay attention to the diffusion of data skills, data responsibilities, and
management of accessibility to data analysis tools for the technical as well as
non-technical employees. As such, in recent times, organisations are focusing
on data governance and management strategies such as data democratization. Data
democratization is an ongoing process of broadening data access to employees to
find, access, self-analyse, and share data by removing data silos. By
democratizing organisational data, organisations attempt to ensure that
employees can speak the language of data and empower them to use data
efficiently to improve their business functionalities. This paper aims to
identify the critical success factors for data democratization through an
in-depth review of the literature. Based on the findings of the analysis, nine
critical success factors were identified as successors of the data
democratization strategy.
Related papers
- Data-Centric AI in the Age of Large Language Models [51.20451986068925]
This position paper proposes a data-centric viewpoint of AI research, focusing on large language models (LLMs)
We make the key observation that data is instrumental in the developmental (e.g., pretraining and fine-tuning) and inferential stages (e.g., in-context learning) of LLMs.
We identify four specific scenarios centered around data, covering data-centric benchmarks and data curation, data attribution, knowledge transfer, and inference contextualization.
arXiv Detail & Related papers (2024-06-20T16:34:07Z) - Lazy Data Practices Harm Fairness Research [49.02318458244464]
We present a comprehensive analysis of fair ML datasets, demonstrating how unreflective practices hinder the reach and reliability of algorithmic fairness findings.
Our analyses identify three main areas of concern: (1) a textbflack of representation for certain protected attributes in both data and evaluations; (2) the widespread textbf of minorities during data preprocessing; and (3) textbfopaque data processing threatening the generalization of fairness research.
This study underscores the need for a critical reevaluation of data practices in fair ML and offers directions to improve both the sourcing and usage of datasets.
arXiv Detail & Related papers (2024-04-26T09:51:24Z) - A Survey on Data Selection for Language Models [148.300726396877]
Data selection methods aim to determine which data points to include in a training dataset.
Deep learning is mostly driven by empirical evidence and experimentation on large-scale data is expensive.
Few organizations have the resources for extensive data selection research.
arXiv Detail & Related papers (2024-02-26T18:54:35Z) - Data Acquisition: A New Frontier in Data-centric AI [65.90972015426274]
We first present an investigation of current data marketplaces, revealing lack of platforms offering detailed information about datasets.
We then introduce the DAM challenge, a benchmark to model the interaction between the data providers and acquirers.
Our evaluation of the submitted strategies underlines the need for effective data acquisition strategies in Machine Learning.
arXiv Detail & Related papers (2023-11-22T22:15:17Z) - Towards Avoiding the Data Mess: Industry Insights from Data Mesh Implementations [1.5029560229270191]
Data mesh is a socio-technical, decentralized, distributed concept for enterprise data management.
We conduct 15 semi-structured interviews with industry experts.
Our findings synthesize insights from industry experts and provide researchers and professionals with preliminary guidelines for the successful adoption of data mesh.
arXiv Detail & Related papers (2023-02-03T13:09:57Z) - Exploring Tenets of Data Democratization [0.0]
Data democratization is an ongoing process that broadens access to data and facilitates employees to find, access, self-analyze, and share data without additional support.
This paper explores the tenets of data democratization through an in-depth review of the literature.
The analysis identified twelve attributes that enable data democratization based on the literature review.
arXiv Detail & Related papers (2022-06-24T03:00:29Z) - Algorithmic Fairness Datasets: the Story so Far [68.45921483094705]
Data-driven algorithms are studied in diverse domains to support critical decisions, directly impacting people's well-being.
A growing community of researchers has been investigating the equity of existing algorithms and proposing novel ones, advancing the understanding of risks and opportunities of automated decision-making for historically disadvantaged populations.
Progress in fair Machine Learning hinges on data, which can be appropriately used only if adequately documented.
Unfortunately, the algorithmic fairness community suffers from a collective data documentation debt caused by a lack of information on specific resources (opacity) and scatteredness of available information (sparsity)
arXiv Detail & Related papers (2022-02-03T17:25:46Z) - Explainable Patterns: Going from Findings to Insights to Support Data
Analytics Democratization [60.18814584837969]
We present Explainable Patterns (ExPatt), a new framework to support lay users in exploring and creating data storytellings.
ExPatt automatically generates plausible explanations for observed or selected findings using an external (textual) source of information.
arXiv Detail & Related papers (2021-01-19T16:13:44Z) - Knowledge Scientists: Unlocking the data-driven organization [5.05432938384774]
We argue that the technologies for reliable data are driven by distinct concerns and expertise.
Those organizations which identify the central importance of meaningful, explainable, reproducible, and maintainable data will be at the forefront of the democratization of reliable data.
arXiv Detail & Related papers (2020-04-16T20:14:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.