In Consideration of Indigenous Data Sovereignty: Data Mining as a
Colonial Practice
- URL: http://arxiv.org/abs/2309.10215v1
- Date: Tue, 19 Sep 2023 00:00:35 GMT
- Title: In Consideration of Indigenous Data Sovereignty: Data Mining as a
Colonial Practice
- Authors: Jennafer Shae Roberts and Laura N Montoya
- Abstract summary: This research stresses the need for the inclusion of Indigenous Data Sovereignty.
To support this hypothesis and address the problem, the CARE Principles for Indigenous Data Governance are applied.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Data mining reproduces colonialism, and Indigenous voices are being left out
of the development of technology that relies on data, such as artificial
intelligence. This research stresses the need for the inclusion of Indigenous
Data Sovereignty and centers on the importance of Indigenous rights over their
own data. Inclusion is necessary in order to integrate Indigenous knowledge
into the design, development, and implementation of data-reliant technology. To
support this hypothesis and address the problem, the CARE Principles for
Indigenous Data Governance (Collective Benefit, Authority to Control,
Responsibility, and Ethics) are applied. We cover how the colonial practices of
data mining do not align with Indigenous convictions. The included case studies
highlight connections to Indigenous rights in relation to the protection of
data and environmental ecosystems, thus establishing how data governance can
serve both the people and the Earth. By applying the CARE Principles to the
issues that arise from data mining and neocolonialism, our goal is to provide a
framework that can be used in technological development. The theory is that
this could reflect outwards to promote data sovereignty generally and create
new relationships between people and data that are ethical as opposed to driven
by speed and profit.
Related papers
- Building Better: Avoiding Pitfalls in Developing Language Resources when Data is Scarce [27.918975040084387]
Data in a given language should be viewed as more than a collection of tokens.
Good data collection and labeling practices are key to building more human-centered and socially aware technologies.
arXiv Detail & Related papers (2024-10-16T15:51:18Z) - Data-Centric AI in the Age of Large Language Models [51.20451986068925]
This position paper proposes a data-centric viewpoint of AI research, focusing on large language models (LLMs)
We make the key observation that data is instrumental in the developmental (e.g., pretraining and fine-tuning) and inferential stages (e.g., in-context learning) of LLMs.
We identify four specific scenarios centered around data, covering data-centric benchmarks and data curation, data attribution, knowledge transfer, and inference contextualization.
arXiv Detail & Related papers (2024-06-20T16:34:07Z) - A Material Lens on Coloniality in NLP [57.63027898794855]
Coloniality is the continuation of colonial harms beyond "official" colonization.
We argue that coloniality is implicitly embedded in and amplified by NLP data, algorithms, and software.
arXiv Detail & Related papers (2023-11-14T18:52:09Z) - On Responsible Machine Learning Datasets with Fairness, Privacy, and Regulatory Norms [56.119374302685934]
There have been severe concerns over the trustworthiness of AI technologies.
Machine and deep learning algorithms depend heavily on the data used during their development.
We propose a framework to evaluate the datasets through a responsible rubric.
arXiv Detail & Related papers (2023-10-24T14:01:53Z) - EU law and emotion data [0.0]
Article sheds light on legal implications and challenges surrounding emotion data processing within the EU's legal framework.
We discuss the nuances of different approaches to affective computing and their relevance to the processing of special data.
We highlight some of the consequences, including harm, that processing of emotion data may have for individuals concerned.
arXiv Detail & Related papers (2023-09-19T17:25:02Z) - The Use of Synthetic Data to Train AI Models: Opportunities and Risks
for Sustainable Development [0.6906005491572401]
This paper investigates the policies governing the creation, utilization, and dissemination of synthetic data.
A well crafted synthetic data policy must strike a balance between privacy concerns and the utility of data.
arXiv Detail & Related papers (2023-08-31T23:18:53Z) - Human-Centric Multimodal Machine Learning: Recent Advances and Testbed
on AI-based Recruitment [66.91538273487379]
There is a certain consensus about the need to develop AI applications with a Human-Centric approach.
Human-Centric Machine Learning needs to be developed based on four main requirements: (i) utility and social good; (ii) privacy and data ownership; (iii) transparency and accountability; and (iv) fairness in AI-driven decision-making processes.
We study how current multimodal algorithms based on heterogeneous sources of information are affected by sensitive elements and inner biases in the data.
arXiv Detail & Related papers (2023-02-13T16:44:44Z) - Data-centric AI: Perspectives and Challenges [51.70828802140165]
Data-centric AI (DCAI) advocates a fundamental shift from model advancements to ensuring data quality and reliability.
We bring together three general missions: training data development, inference data development, and data maintenance.
arXiv Detail & Related papers (2023-01-12T05:28:59Z) - Decolonisation, Global Data Law, and Indigenous Data Sovereignty [0.0]
This research examines the impact of digital neo-colonialism on the Global South.
It encourages the development of legal and economic incentives to protect Indigenous cultures globally.
arXiv Detail & Related papers (2022-07-28T23:07:02Z) - Representative & Fair Synthetic Data [68.8204255655161]
We present a framework to incorporate fairness constraints into the self-supervised learning process.
We generate a representative as well as fair version of the UCI Adult census data set.
We consider representative & fair synthetic data a promising future building block to teach algorithms not on historic worlds, but rather on the worlds that we strive to live in.
arXiv Detail & Related papers (2021-04-07T09:19:46Z) - Decolonial AI: Decolonial Theory as Sociotechnical Foresight in
Artificial Intelligence [6.202104487394354]
This paper explores the role of critical science in understanding and shaping the ongoing advances in artificial intelligence.
Decolonial theories use historical hindsight to explain patterns of power that shape our intellectual, political, economic, and social world.
We highlight problematic applications that are instances of coloniality, and using a decolonial lens, submit three tactics that can form a decolonial field of artificial intelligence.
arXiv Detail & Related papers (2020-07-08T12:36:21Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.