Related papers: Global AI Bias Audit for Technical Governance

Global AI Bias Audit for Technical Governance

URL: http://arxiv.org/abs/2602.13246v1
Date: Sun, 01 Feb 2026 19:45:21 GMT
Title: Global AI Bias Audit for Technical Governance
Authors: Jason Hung,
Abstract summary: This paper presents the outputs of the exploratory phase of a global audit of Large Language Models (LLMs) project.<n>I used the Global AI dataset (GAID) Project as a framework to stress-test the Llama-3 8B model and evaluate geographic and socioeconomic biases in technical AI governance awareness.<n>The findings reveal that AI's technical knowledge is heavily concentrated in higher-income regions, while lower-income countries from the Global South are subject to disproportionate systemic information gaps.
Score: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: This paper presents the outputs of the exploratory phase of a global audit of Large Language Models (LLMs) project. In this exploratory phase, I used the Global AI Dataset (GAID) Project as a framework to stress-test the Llama-3 8B model and evaluate geographic and socioeconomic biases in technical AI governance awareness. By stress-testing the model with 1,704 queries across 213 countries and eight technical metrics, I identified a significant digital barrier and gap separating the Global North and South. The results indicate that the model was only able to provide number/fact responses in 11.4% of its query answers, where the empirical validity of such responses was yet to be verified. The findings reveal that AI's technical knowledge is heavily concentrated in higher-income regions, while lower-income countries from the Global South are subject to disproportionate systemic information gaps. This disparity between the Global North and South poses concerning risks for global AI safety and inclusive governance, as policymakers in underserved regions may lack reliable data-driven insights or be misled by hallucinated facts. This paper concludes that current AI alignment and training processes reinforce existing geoeconomic and geopolitical asymmetries, and urges the need for more inclusive data representation to ensure AI serves as a truly global resource.

Related papers

The Invisibility Hypothesis: Promises of AGI and the Future of the Global South [2.0390620658820433]
We argue that the availability of highly autonomous, general-purpose cognitive systems does not guarantee equitable outcomes.<n>In the best case, geographic location is no longer relevant as AGI fully democratizes access to knowledge and essential services for everyone in the globe.<n>In the worst case, existing structural constraints are severely amplified, rendering already marginalized populations functionally irrelevant to global systems.
arXiv Detail & Related papers (2026-03-02T08:46:18Z)
The California Report on Frontier AI Policy [110.35302787349856]
Continued progress in frontier AI carries the potential for profound advances in scientific discovery, economic productivity, and broader social well-being.<n>As the epicenter of global AI innovation, California has a unique opportunity to continue supporting developments in frontier AI.<n>Report derives policy principles that can inform how California approaches the use, assessment, and governance of frontier AI.
arXiv Detail & Related papers (2025-06-17T23:33:21Z)
Identifying Trustworthiness Challenges in Deep Learning Models for Continental-Scale Water Quality Prediction [69.38041171537573]
Water quality is foundational to environmental sustainability, ecosystem resilience, and public health.<n>Deep learning offers transformative potential for large-scale water quality prediction and scientific insights generation.<n>Their widespread adoption in high-stakes operational decision-making, such as pollution mitigation and equitable resource allocation, is prevented by unresolved trustworthiness challenges.
arXiv Detail & Related papers (2025-03-13T01:50:50Z)
Bridging the Data Provenance Gap Across Text, Speech and Video [67.72097952282262]
We conduct the largest and first-of-its-kind longitudinal audit across modalities of popular text, speech, and video datasets.<n>Our manual analysis covers nearly 4000 public datasets between 1990-2024, spanning 608 languages, 798 sources, 659 organizations, and 67 countries.<n>We find that multimodal machine learning applications have overwhelmingly turned to web-crawled, synthetic, and social media platforms, such as YouTube, for their training sets.
arXiv Detail & Related papers (2024-12-19T01:30:19Z)
An evidence-based methodology for human rights impact assessment (HRIA) in the development of AI data-intensive systems [49.1574468325115]
We show that human rights already underpin the decisions in the field of data use. This work presents a methodology and a model for a Human Rights Impact Assessment (HRIA) The proposed methodology is tested in concrete case-studies to prove its feasibility and effectiveness.
arXiv Detail & Related papers (2024-07-30T16:27:52Z)
Global-Liar: Factuality of LLMs over Time and Geographic Regions [3.715487408753612]
This study evaluates the factual accuracy, stability, and biases in widely adopted GPT models, including GPT-3.5 and GPT-4. We introduce 'Global-Liar,' a dataset uniquely balanced in terms of geographic and temporal representation.
arXiv Detail & Related papers (2024-01-31T13:57:24Z)
The Role of Large Language Models in the Recognition of Territorial Sovereignty: An Analysis of the Construction of Legitimacy [67.44950222243865]
We argue that technology tools like Google Maps and Large Language Models (LLM) are often perceived as impartial and objective. We highlight the case of three controversial territories: Crimea, West Bank and Transnitria, by comparing the responses of ChatGPT against Wikipedia information and United Nations resolutions.
arXiv Detail & Related papers (2023-03-17T08:46:49Z)
Jalisco's multiclass land cover analysis and classification using a novel lightweight convnet with real-world multispectral and relief data [51.715517570634994]
We present our novel lightweight (only 89k parameters) Convolution Neural Network (ConvNet) to make LC classification and analysis. In this work, we combine three real-world open data sources to obtain 13 channels. Our embedded analysis anticipates the limited performance in some classes and gives us the opportunity to group the most similar.
arXiv Detail & Related papers (2022-01-26T14:58:51Z)
Artificial Intelligence in the Global South (AI4D): Potential and Risks [0.0]
Artificial intelligence is becoming more widely available in all parts of the world. This paper examines the key issues and questions arising in the emerging sub-field of AI for global development (AI4D) We propose that although there are many risks associated with the use of AI, the potential benefits are enough to warrant detailed research and investigation of the most appropriate and effective ways to design, develop, implement, and use such technologies in the Global South.
arXiv Detail & Related papers (2021-08-23T11:48:31Z)
Artificial Intelligence Ethics: An Inclusive Global Discourse? [0.9208007322096533]
This research examines the growing body of documentation on AI ethics. It seeks to discover if both countries in the Global South and women are underrepresented in this discourse. Findings indicate a dearth of references to both of these themes in the AI ethics documents. Without adequate input from both countries in the Global South and from women, such ethical frameworks and standards may be discriminatory.
arXiv Detail & Related papers (2021-08-23T06:08:00Z)
AI in the "Real World": Examining the Impact of AI Deployment in Low-Resource Contexts [1.90365714903665]
This paper examines the deployment of AI by large industry labs situated in low-resource contexts. It highlights factors impacting unanticipated deployments, and reflects on the state of AI deployment within the Global South.
arXiv Detail & Related papers (2020-11-28T01:49:24Z)

This list is automatically generated from the titles and abstracts of the papers in this site.