AI's Regimes of Representation: A Community-centered Study of
Text-to-Image Models in South Asia
- URL: http://arxiv.org/abs/2305.11844v1
- Date: Fri, 19 May 2023 17:35:20 GMT
- Title: AI's Regimes of Representation: A Community-centered Study of
Text-to-Image Models in South Asia
- Authors: Rida Qadri, Renee Shelby, Cynthia L. Bennett, Emily Denton
- Abstract summary: We show how generative AI can reproduce an outsiders gaze for viewing South Asian cultures, shaped by global and regional power inequities.
We distill lessons for responsible development of T2I models, recommending concrete pathways forward.
- Score: 18.308417975842058
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This paper presents a community-centered study of cultural limitations of
text-to-image (T2I) models in the South Asian context. We theorize these
failures using scholarship on dominant media regimes of representations and
locate them within participants' reporting of their existing social
marginalizations. We thus show how generative AI can reproduce an outsiders
gaze for viewing South Asian cultures, shaped by global and regional power
inequities. By centering communities as experts and soliciting their
perspectives on T2I limitations, our study adds rich nuance into existing
evaluative frameworks and deepens our understanding of the culturally-specific
ways AI technologies can fail in non-Western and Global South settings. We
distill lessons for responsible development of T2I models, recommending
concrete pathways forward that can allow for recognition of structural
inequalities.
Related papers
- The Case for "Thick Evaluations" of Cultural Representation in AI [11.53198252426806]
Generative AI image models have been increasingly evaluated for their (in)ability to represent non-Western cultures.
We argue that these evaluations operate through reductive ideals of representation, neglecting how people define their own representation.
We introduce the idea of 'thick evaluations': a more granular, situated, and discursive measurement framework for evaluating representations of social worlds in AI images.
arXiv Detail & Related papers (2025-03-24T19:01:14Z) - Mind the Gesture: Evaluating AI Sensitivity to Culturally Offensive Non-Verbal Gestures [67.89565097372751]
Gestures are an integral part of non-verbal communication, with meanings that vary across cultures.
As AI systems become more integrated into global applications, ensuring they do not inadvertently perpetuate cultural offenses is critical.
We introduce Multi-Cultural Set of Inappropriate Gestures and Nonverbal Signs (MC-SIGNS), a dataset of 288 gesture-country pairs annotated for offensiveness, cultural significance, and contextual factors.
arXiv Detail & Related papers (2025-02-24T23:10:08Z) - CultureVLM: Characterizing and Improving Cultural Understanding of Vision-Language Models for over 100 Countries [63.00147630084146]
Vision-language models (VLMs) have advanced human-AI interaction but struggle with cultural understanding.
CultureVerse is a large-scale multimodal benchmark covering 19, 682 cultural concepts, 188 countries/regions, 15 cultural concepts, and 3 question types.
We propose CultureVLM, a series of VLMs fine-tuned on our dataset to achieve significant performance improvement in cultural understanding.
arXiv Detail & Related papers (2025-01-02T14:42:37Z) - Exploring the Boundaries of Content Moderation in Text-to-Image Generation [9.476463361600828]
This paper analyzes the community safety guidelines of five text-to-image (T2I) generation platforms and audits five T2I models.
We argue that the concept of safety is difficult to define and operationalize, reflected in a discrepancy between the officially published safety guidelines and the actual behavior of the T2I models.
arXiv Detail & Related papers (2024-09-09T18:37:08Z) - Do Generative AI Models Output Harm while Representing Non-Western Cultures: Evidence from A Community-Centered Approach [8.805524738976073]
This research investigates the impact of Generative Artificial Intelligence (GAI) models, specifically text-to-image generators (T2Is), on the representation of non-Western cultures.
arXiv Detail & Related papers (2024-07-20T07:01:37Z) - Beyond Aesthetics: Cultural Competence in Text-to-Image Models [34.98692829036475]
CUBE is a first-of-its-kind benchmark to evaluate cultural competence of Text-to-Image models.
CUBE covers cultural artifacts associated with 8 countries across different geo-cultural regions.
CUBE-CSpace is a larger dataset of cultural artifacts that serves as grounding to evaluate cultural diversity.
arXiv Detail & Related papers (2024-07-09T13:50:43Z) - Generative AI and Digital Neocolonialism in Global Education: Towards an Equitable Framework [0.5586073503694489]
This paper critically discusses how generative artificial intelligence (GenAI) might impose Western ideologies on non-Western societies.
It suggests strategies for local and global stakeholders to mitigate these effects.
arXiv Detail & Related papers (2024-06-05T05:43:55Z) - T-HITL Effectively Addresses Problematic Associations in Image
Generation and Maintains Overall Visual Quality [52.5529784801908]
We focus on addressing the generation of problematic associations between demographic groups and semantic concepts.
We propose a new methodology with twice-human-in-the-loop (T-HITL) that promises improvements in both reducing problematic associations and also maintaining visual quality.
arXiv Detail & Related papers (2024-02-27T00:29:33Z) - Language Models: A Guide for the Perplexed [51.88841610098437]
This tutorial aims to help narrow the gap between those who study language models and those who are intrigued and want to learn more.
We offer a scientific viewpoint that focuses on questions amenable to study through experimentation.
We situate language models as they are today in the context of the research that led to their development.
arXiv Detail & Related papers (2023-11-29T01:19:02Z) - Not All Countries Celebrate Thanksgiving: On the Cultural Dominance in
Large Language Models [89.94270049334479]
This paper identifies a cultural dominance issue within large language models (LLMs)
LLMs often provide inappropriate English-culture-related answers that are not relevant to the expected culture when users ask in non-English languages.
arXiv Detail & Related papers (2023-10-19T05:38:23Z) - R&B: Region and Boundary Aware Zero-shot Grounded Text-to-image
Generation [74.5598315066249]
We probe into zero-shot grounded T2I generation with diffusion models.
We propose a Region and Boundary (R&B) aware cross-attention guidance approach.
arXiv Detail & Related papers (2023-10-13T05:48:42Z) - Foundational Models Defining a New Era in Vision: A Survey and Outlook [151.49434496615427]
Vision systems to see and reason about the compositional nature of visual scenes are fundamental to understanding our world.
The models learned to bridge the gap between such modalities coupled with large-scale training data facilitate contextual reasoning, generalization, and prompt capabilities at test time.
The output of such models can be modified through human-provided prompts without retraining, e.g., segmenting a particular object by providing a bounding box, having interactive dialogues by asking questions about an image or video scene or manipulating the robot's behavior through language instructions.
arXiv Detail & Related papers (2023-07-25T17:59:18Z) - On the Cultural Gap in Text-to-Image Generation [75.69755281031951]
One challenge in text-to-image (T2I) generation is the inadvertent reflection of culture gaps present in the training data.
There is no benchmark to systematically evaluate a T2I model's ability to generate cross-cultural images.
We propose a Challenging Cross-Cultural (C3) benchmark with comprehensive evaluation criteria, which can assess how well-suited a model is to a target culture.
arXiv Detail & Related papers (2023-07-06T13:17:55Z) - Critical pedagogy in the implementation of educational technologies [0.0]
This paper presents a review of the challenges to the implementation of learning technologies in developing countries.
The research question is: what extent does education empower learners to be full participants in a socially democratic society?
arXiv Detail & Related papers (2020-05-30T12:00:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.