AI's Regimes of Representation: A Community-centered Study of
Text-to-Image Models in South Asia
- URL: http://arxiv.org/abs/2305.11844v1
- Date: Fri, 19 May 2023 17:35:20 GMT
- Title: AI's Regimes of Representation: A Community-centered Study of
Text-to-Image Models in South Asia
- Authors: Rida Qadri, Renee Shelby, Cynthia L. Bennett, Emily Denton
- Abstract summary: We show how generative AI can reproduce an outsiders gaze for viewing South Asian cultures, shaped by global and regional power inequities.
We distill lessons for responsible development of T2I models, recommending concrete pathways forward.
- Score: 18.308417975842058
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This paper presents a community-centered study of cultural limitations of
text-to-image (T2I) models in the South Asian context. We theorize these
failures using scholarship on dominant media regimes of representations and
locate them within participants' reporting of their existing social
marginalizations. We thus show how generative AI can reproduce an outsiders
gaze for viewing South Asian cultures, shaped by global and regional power
inequities. By centering communities as experts and soliciting their
perspectives on T2I limitations, our study adds rich nuance into existing
evaluative frameworks and deepens our understanding of the culturally-specific
ways AI technologies can fail in non-Western and Global South settings. We
distill lessons for responsible development of T2I models, recommending
concrete pathways forward that can allow for recognition of structural
inequalities.
Related papers
- Exploring the Boundaries of Content Moderation in Text-to-Image Generation [9.476463361600828]
This paper analyzes the community safety guidelines of five text-to-image (T2I) generation platforms and audits five T2I models.
We argue that the concept of safety is difficult to define and operationalize, reflected in a discrepancy between the officially published safety guidelines and the actual behavior of the T2I models.
arXiv Detail & Related papers (2024-09-09T18:37:08Z) - Do Generative AI Models Output Harm while Representing Non-Western Cultures: Evidence from A Community-Centered Approach [8.805524738976073]
This research investigates the impact of Generative Artificial Intelligence (GAI) models, specifically text-to-image generators (T2Is), on the representation of non-Western cultures.
arXiv Detail & Related papers (2024-07-20T07:01:37Z) - Beyond Aesthetics: Cultural Competence in Text-to-Image Models [34.98692829036475]
CUBE is a first-of-its-kind benchmark to evaluate cultural competence of Text-to-Image models.
CUBE covers cultural artifacts associated with 8 countries across different geo-cultural regions.
CUBE-CSpace is a larger dataset of cultural artifacts that serves as grounding to evaluate cultural diversity.
arXiv Detail & Related papers (2024-07-09T13:50:43Z) - Generative AI and Digital Neocolonialism in Global Education: Towards an Equitable Framework [0.5586073503694489]
This paper critically discusses how generative artificial intelligence (GenAI) might impose Western ideologies on non-Western societies.
It suggests strategies for local and global stakeholders to mitigate these effects.
arXiv Detail & Related papers (2024-06-05T05:43:55Z) - T-HITL Effectively Addresses Problematic Associations in Image
Generation and Maintains Overall Visual Quality [52.5529784801908]
We focus on addressing the generation of problematic associations between demographic groups and semantic concepts.
We propose a new methodology with twice-human-in-the-loop (T-HITL) that promises improvements in both reducing problematic associations and also maintaining visual quality.
arXiv Detail & Related papers (2024-02-27T00:29:33Z) - Language Models: A Guide for the Perplexed [51.88841610098437]
This tutorial aims to help narrow the gap between those who study language models and those who are intrigued and want to learn more.
We offer a scientific viewpoint that focuses on questions amenable to study through experimentation.
We situate language models as they are today in the context of the research that led to their development.
arXiv Detail & Related papers (2023-11-29T01:19:02Z) - Not All Countries Celebrate Thanksgiving: On the Cultural Dominance in
Large Language Models [89.94270049334479]
This paper identifies a cultural dominance issue within large language models (LLMs)
LLMs often provide inappropriate English-culture-related answers that are not relevant to the expected culture when users ask in non-English languages.
arXiv Detail & Related papers (2023-10-19T05:38:23Z) - R&B: Region and Boundary Aware Zero-shot Grounded Text-to-image
Generation [74.5598315066249]
We probe into zero-shot grounded T2I generation with diffusion models.
We propose a Region and Boundary (R&B) aware cross-attention guidance approach.
arXiv Detail & Related papers (2023-10-13T05:48:42Z) - Foundational Models Defining a New Era in Vision: A Survey and Outlook [151.49434496615427]
Vision systems to see and reason about the compositional nature of visual scenes are fundamental to understanding our world.
The models learned to bridge the gap between such modalities coupled with large-scale training data facilitate contextual reasoning, generalization, and prompt capabilities at test time.
The output of such models can be modified through human-provided prompts without retraining, e.g., segmenting a particular object by providing a bounding box, having interactive dialogues by asking questions about an image or video scene or manipulating the robot's behavior through language instructions.
arXiv Detail & Related papers (2023-07-25T17:59:18Z) - On the Cultural Gap in Text-to-Image Generation [75.69755281031951]
One challenge in text-to-image (T2I) generation is the inadvertent reflection of culture gaps present in the training data.
There is no benchmark to systematically evaluate a T2I model's ability to generate cross-cultural images.
We propose a Challenging Cross-Cultural (C3) benchmark with comprehensive evaluation criteria, which can assess how well-suited a model is to a target culture.
arXiv Detail & Related papers (2023-07-06T13:17:55Z) - Critical pedagogy in the implementation of educational technologies [0.0]
This paper presents a review of the challenges to the implementation of learning technologies in developing countries.
The research question is: what extent does education empower learners to be full participants in a socially democratic society?
arXiv Detail & Related papers (2020-05-30T12:00:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.