Machine learning as a model for cultural learning: Teaching an algorithm
what it means to be fat
- URL: http://arxiv.org/abs/2003.12133v2
- Date: Sat, 13 Jun 2020 22:58:22 GMT
- Title: Machine learning as a model for cultural learning: Teaching an algorithm
what it means to be fat
- Authors: Alina Arseniev-Koehler and Jacob G. Foster
- Abstract summary: We show that neural word embeddings provide a parsimonious and cognitively plausible model of the representations learned from natural language.
We identify several cultural schemata that link obesity to gender, immorality, poor health, and low socioeconomic class.
Our findings reinforce ongoing concerns that machine learning can also encode, and reproduce, harmful human biases.
- Score: 2.0305676256390934
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: As we navigate our cultural environment, we learn cultural biases, like those
around gender, social class, health, and body weight. It is unclear, however,
exactly how public culture becomes private culture. In this paper, we provide a
theoretical account of such cultural learning. We propose that neural word
embeddings provide a parsimonious and cognitively plausible model of the
representations learned from natural language. Using neural word embeddings, we
extract cultural schemata about body weight from New York Times articles. We
identify several cultural schemata that link obesity to gender, immorality,
poor health, and low socioeconomic class. Such schemata may be subtly but
pervasively activated in public culture; thus, language can chronically
reproduce biases. Our findings reinforce ongoing concerns that machine learning
can also encode, and reproduce, harmful human biases.
Related papers
- CARE: Aligning Language Models for Regional Cultural Awareness [28.676469530858924]
Existing language models (LMs) often exhibit a Western-centric bias and struggle to represent diverse cultural knowledge.
Previous attempts to address this rely on synthetic data and express cultural knowledge only in English.
We first introduce CARE, a multilingual resource of 24.1k responses with human preferences on 2,580 questions about Chinese and Arab cultures.
arXiv Detail & Related papers (2025-04-07T14:57:06Z) - Attributing Culture-Conditioned Generations to Pretraining Corpora [26.992883552982335]
We analyze how models associate entities with cultures based on pretraining data patterns.
We find that high-frequency cultures in pretraining data yield more generations with memorized symbols, while some low-frequency cultures produce none.
arXiv Detail & Related papers (2024-12-30T07:09:25Z) - CulturePark: Boosting Cross-cultural Understanding in Large Language Models [63.452948673344395]
This paper introduces CulturePark, an LLM-powered multi-agent communication framework for cultural data collection.
It generates high-quality cross-cultural dialogues encapsulating human beliefs, norms, and customs.
We evaluate these models across three downstream tasks: content moderation, cultural alignment, and cultural education.
arXiv Detail & Related papers (2024-05-24T01:49:02Z) - CultureBank: An Online Community-Driven Knowledge Base Towards Culturally Aware Language Technologies [53.2331634010413]
CultureBank is a knowledge base built upon users' self-narratives.
It contains 12K cultural descriptors sourced from TikTok and 11K from Reddit.
We offer recommendations for future culturally aware language technologies.
arXiv Detail & Related papers (2024-04-23T17:16:08Z) - CULTURE-GEN: Revealing Global Cultural Perception in Language Models through Natural Language Prompting [73.94059188347582]
We uncover culture perceptions of three SOTA models on 110 countries and regions on 8 culture-related topics through culture-conditioned generations.
We discover that culture-conditioned generation consist of linguistic "markers" that distinguish marginalized cultures apart from default cultures.
arXiv Detail & Related papers (2024-04-16T00:50:43Z) - Massively Multi-Cultural Knowledge Acquisition & LM Benchmarking [48.21982147529661]
This paper introduces a novel approach for massively multicultural knowledge acquisition.
Our method strategically navigates from densely informative Wikipedia documents on cultural topics to an extensive network of linked pages.
Our work marks an important step towards deeper understanding and bridging the gaps of cultural disparities in AI.
arXiv Detail & Related papers (2024-02-14T18:16:54Z) - Not All Countries Celebrate Thanksgiving: On the Cultural Dominance in
Large Language Models [89.94270049334479]
This paper identifies a cultural dominance issue within large language models (LLMs)
LLMs often provide inappropriate English-culture-related answers that are not relevant to the expected culture when users ask in non-English languages.
arXiv Detail & Related papers (2023-10-19T05:38:23Z) - Towards Intercultural Affect Recognition: Audio-Visual Affect
Recognition in the Wild Across Six Cultures [15.37706529432381]
We present the first systematic study of intercultural affect recognition models using videos of real-world dyadic interactions from six cultures.
Across all six cultures, our findings demonstrate that intercultural affect recognition models were as effective or more effective than intracultural models.
Our paper presents a proof-of-concept and motivation for the future development of intercultural affect recognition systems.
arXiv Detail & Related papers (2022-07-31T02:39:17Z) - StereoKG: Data-Driven Knowledge Graph Construction for Cultural
Knowledge and Stereotypes [17.916919837253108]
We present a fully data-driven pipeline for generating a knowledge graph (KG) of cultural knowledge and stereotypes.
Our resulting KG covers 5 religious groups and 5 nationalities and can easily be extended to include more entities.
arXiv Detail & Related papers (2022-05-27T15:09:56Z) - Learning Robust Real-Time Cultural Transmission without Human Data [82.05222093231566]
We provide a method for generating zero-shot, high recall cultural transmission in artificially intelligent agents.
Our agents succeed at real-time cultural transmission from humans in novel contexts without using any pre-collected human data.
This paves the way for cultural evolution as an algorithm for developing artificial general intelligence.
arXiv Detail & Related papers (2022-03-01T19:32:27Z) - Growing knowledge culturally across generations to solve novel, complex
tasks [29.579223105173217]
We take a first step towards reverse-engineering cultural learning through language.
We develop a suite of complex high-stakes tasks in the form of minimalist-style video games.
Knowledge accumulated gradually across generations, allowing later generations to advance further in the games.
arXiv Detail & Related papers (2021-07-28T14:09:40Z) - Deception detection in text and its relation to the cultural dimension
of individualism/collectivism [6.17866386107486]
We investigate if differences in the usage of specific linguistic features of deception across cultures can be confirmed and attributed to norms in respect to the individualism/collectivism divide.
We create culture/language-aware classifiers by experimenting with a wide range of n-gram features based on phonology, morphology and syntax.
We conducted our experiments over 11 datasets from 5 languages i.e., English, Dutch, Russian, Spanish and Romanian, from six countries (US, Belgium, India, Russia, Mexico and Romania)
arXiv Detail & Related papers (2021-05-26T13:09:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.