Data Justice in Practice: A Guide for Developers
- URL: http://arxiv.org/abs/2205.01037v1
- Date: Tue, 12 Apr 2022 09:33:14 GMT
- Title: Data Justice in Practice: A Guide for Developers
- Authors: David Leslie, Michael Katell, Mhairi Aitken, Jatinder Singh, Morgan
Briggs, Rosamund Powell, Cami Rinc\'on, Antonella Perini, Smera Jayadeva, and
Christopher Burr
- Abstract summary: The Advancing Data Justice Research and Practice project aims to broaden understanding of the social, historical, cultural, political, and economic forces that contribute to discrimination and inequity in contemporary ecologies of data collection, governance, and use.
This is the consultation draft of a guide for developers and organisations, which are producing, procuring, or using data-intensive technologies.
- Score: 2.5953185061765884
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The Advancing Data Justice Research and Practice project aims to broaden
understanding of the social, historical, cultural, political, and economic
forces that contribute to discrimination and inequity in contemporary ecologies
of data collection, governance, and use. This is the consultation draft of a
guide for developers and organisations, which are producing, procuring, or
using data-intensive technologies.In the first section, we introduce the field
of data justice, from its early discussions to more recent proposals to
relocate understandings of what data justice means. This section includes a
description of the six pillars of data justice around which this guidance
revolves. Next, to support developers in designing, developing, and deploying
responsible and equitable data-intensive and AI/ML systems, we outline the
AI/ML project lifecycle through a sociotechnical lens. To support the
operationalisation data justice throughout the entirety of the AI/ML lifecycle
and within data innovation ecosystems, we then present five overarching
principles of responsible, equitable, and trustworthy data research and
innovation practices, the SAFE-D principles-Safety, Accountability, Fairness,
Explainability, and Data Quality, Integrity, Protection, and Privacy. The final
section presents guiding questions that will help developers both address data
justice issues throughout the AI/ML lifecycle and engage in reflective
innovation practices that ensure the design, development, and deployment of
responsible and equitable data-intensive and AI/ML systems.
Related papers
- Human-Data Interaction Framework: A Comprehensive Model for a Future Driven by Data and Humans [0.0]
The Human-Data Interaction (HDI) framework has become an essential approach to tackling the challenges and ethical issues associated with data governance and utilization in the modern digital world.
This paper outlines the fundamental steps required for organizations to seamlessly integrate HDI principles.
arXiv Detail & Related papers (2024-07-30T17:57:09Z) - Data Issues in Industrial AI System: A Meta-Review and Research Strategy [10.540603300770885]
Artificial intelligence (AI) is assuming an increasingly pivotal role within industrial systems.
Despite the recent trend within various industries to adopt AI, the actual adoption of AI is not as developed as perceived.
How to address these data issues stands as a significant concern confronting both industry and academia.
arXiv Detail & Related papers (2024-06-22T08:36:59Z) - Data-Centric AI in the Age of Large Language Models [51.20451986068925]
This position paper proposes a data-centric viewpoint of AI research, focusing on large language models (LLMs)
We make the key observation that data is instrumental in the developmental (e.g., pretraining and fine-tuning) and inferential stages (e.g., in-context learning) of LLMs.
We identify four specific scenarios centered around data, covering data-centric benchmarks and data curation, data attribution, knowledge transfer, and inference contextualization.
arXiv Detail & Related papers (2024-06-20T16:34:07Z) - Towards Responsible AI in Banking: Addressing Bias for Fair
Decision-Making [69.44075077934914]
"Responsible AI" emphasizes the critical nature of addressing biases within the development of a corporate culture.
This thesis is structured around three fundamental pillars: understanding bias, mitigating bias, and accounting for bias.
In line with open-source principles, we have released Bias On Demand and FairView as accessible Python packages.
arXiv Detail & Related papers (2024-01-13T14:07:09Z) - On Responsible Machine Learning Datasets with Fairness, Privacy, and Regulatory Norms [56.119374302685934]
There have been severe concerns over the trustworthiness of AI technologies.
Machine and deep learning algorithms depend heavily on the data used during their development.
We propose a framework to evaluate the datasets through a responsible rubric.
arXiv Detail & Related papers (2023-10-24T14:01:53Z) - EALM: Introducing Multidimensional Ethical Alignment in Conversational
Information Retrieval [43.72331337131317]
We introduce a workflow that integrates ethical alignment with an initial ethical judgment stage for efficient data screening.
We present the QA-ETHICS dataset adapted from the ETHICS benchmark, which serves as an evaluation tool by unifying scenarios and label meanings.
In addition, we suggest a new approach that achieves top performance in both binary and multi-label ethical judgment tasks.
arXiv Detail & Related papers (2023-10-02T08:22:34Z) - Applying Standards to Advance Upstream & Downstream Ethics in Large
Language Models [0.0]
This paper explores how AI-owners can develop safeguards for AI-generated content.
It draws from established codes of conduct and ethical standards in other content-creation industries.
arXiv Detail & Related papers (2023-06-06T08:47:42Z) - Data-centric AI: Perspectives and Challenges [51.70828802140165]
Data-centric AI (DCAI) advocates a fundamental shift from model advancements to ensuring data quality and reliability.
We bring together three general missions: training data development, inference data development, and data maintenance.
arXiv Detail & Related papers (2023-01-12T05:28:59Z) - DataPerf: Benchmarks for Data-Centric AI Development [81.03754002516862]
DataPerf is a community-led benchmark suite for evaluating ML datasets and data-centric algorithms.
We provide an open, online platform with multiple rounds of challenges to support this iterative development.
The benchmarks, online evaluation platform, and baseline implementations are open source.
arXiv Detail & Related papers (2022-07-20T17:47:54Z) - Advancing Data Justice Research and Practice: An Integrated Literature
Review [2.454361535046896]
The Advancing Data Justice Research and Practice (ADJRP) project aims to widen the lens of current thinking around data justice.
This integrated literature review lays the conceptual groundwork needed to support this aspiration.
arXiv Detail & Related papers (2022-04-06T21:09:27Z) - Trustworthy AI [75.99046162669997]
Brittleness to minor adversarial changes in the input data, ability to explain the decisions, address the bias in their training data, are some of the most prominent limitations.
We propose the tutorial on Trustworthy AI to address six critical issues in enhancing user and public trust in AI systems.
arXiv Detail & Related papers (2020-11-02T20:04:18Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.