Towards an Atlas of Cultural Commonsense for Machine Reasoning
- URL: http://arxiv.org/abs/2009.05664v3
- Date: Fri, 18 Dec 2020 23:26:25 GMT
- Title: Towards an Atlas of Cultural Commonsense for Machine Reasoning
- Authors: Anurag Acharya, Kartik Talamadupula and Mark A Finlayson
- Abstract summary: Existing commonsense reasoning datasets for AI and NLP tasks fail to address an important aspect of human life: cultural differences.
We introduce an approach that extends prior work on crowdsourcing commonsense knowledge by incorporating differences in knowledge that are attributable to cultural or national groups.
We demonstrate the technique by collecting commonsense knowledge that surrounds six fairly universal rituals across two national groups: the United States and India.
- Score: 18.472517610024866
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Existing commonsense reasoning datasets for AI and NLP tasks fail to address
an important aspect of human life: cultural differences. We introduce an
approach that extends prior work on crowdsourcing commonsense knowledge by
incorporating differences in knowledge that are attributable to cultural or
national groups. We demonstrate the technique by collecting commonsense
knowledge that surrounds six fairly universal rituals -- birth, coming-of-age,
marriage, funerals, new year, and birthdays -- across two national groups: the
United States and India. Our study expands the different types of relationships
identified by existing work in the field of commonsense reasoning for
commonplace events, and uses these new types to gather information that
distinguish the identity of the groups providing the knowledge. It also moves
us a step closer towards building a machine that doesn't assume a rigid
framework of universal (and likely Western-biased) commonsense knowledge, but
rather has the ability to reason in a contextually and culturally sensitive
way. Our hope is that cultural knowledge of this sort will lead to more
human-like performance in NLP tasks such as question answering (QA) and text
understanding and generation.
Related papers
- Extrinsic Evaluation of Cultural Competence in Large Language Models [53.626808086522985]
We focus on extrinsic evaluation of cultural competence in two text generation tasks.
We evaluate model outputs when an explicit cue of culture, specifically nationality, is perturbed in the prompts.
We find weak correlations between text similarity of outputs for different countries and the cultural values of these countries.
arXiv Detail & Related papers (2024-06-17T14:03:27Z) - Understanding the Capabilities and Limitations of Large Language Models for Cultural Commonsense [98.09670425244462]
Large language models (LLMs) have demonstrated substantial commonsense understanding.
This paper examines the capabilities and limitations of several state-of-the-art LLMs in the context of cultural commonsense tasks.
arXiv Detail & Related papers (2024-05-07T20:28:34Z) - What You Use is What You Get: Unforced Errors in Studying Cultural Aspects in Agile Software Development [2.9418191027447906]
Investigating the influence of cultural characteristics is challenging due to the multi-faceted concept of culture.
Cultural and social aspects are of high importance for their successful use in practice.
arXiv Detail & Related papers (2024-04-25T20:08:37Z) - CULTURE-GEN: Revealing Global Cultural Perception in Language Models through Natural Language Prompting [73.94059188347582]
We uncover culture perceptions of three SOTA models on 110 countries and regions on 8 culture-related topics through culture-conditioned generations.
We discover that culture-conditioned generation consist of linguistic "markers" that distinguish marginalized cultures apart from default cultures.
arXiv Detail & Related papers (2024-04-16T00:50:43Z) - Massively Multi-Cultural Knowledge Acquisition & LM Benchmarking [48.21982147529661]
This paper introduces a novel approach for massively multicultural knowledge acquisition.
Our method strategically navigates from densely informative Wikipedia documents on cultural topics to an extensive network of linked pages.
Our work marks an important step towards deeper understanding and bridging the gaps of cultural disparities in AI.
arXiv Detail & Related papers (2024-02-14T18:16:54Z) - Extracting Cultural Commonsense Knowledge at Scale [28.856786775318486]
CANDLE is an end-to-end methodology for extracting high-quality cultural commonsense knowledge at scale.
It organizes assertions into coherent clusters for 3 domains of subjects (geography, religion, occupation) and several cultural facets.
It includes judicious techniques for classification-based filtering and scoring of interestingness.
arXiv Detail & Related papers (2022-10-14T12:53:57Z) - CIKQA: Learning Commonsense Inference with a Unified
Knowledge-in-the-loop QA Paradigm [120.98789964518562]
We argue that due to the large scale of commonsense knowledge, it is infeasible to annotate a large enough training set for each task to cover all commonsense for learning.
We focus on investigating models' commonsense inference capabilities from two perspectives.
We name the benchmark as Commonsense Inference with Knowledge-in-the-loop Question Answering (CIKQA)
arXiv Detail & Related papers (2022-10-12T14:32:39Z) - Learning Robust Real-Time Cultural Transmission without Human Data [82.05222093231566]
We provide a method for generating zero-shot, high recall cultural transmission in artificially intelligent agents.
Our agents succeed at real-time cultural transmission from humans in novel contexts without using any pre-collected human data.
This paves the way for cultural evolution as an algorithm for developing artificial general intelligence.
arXiv Detail & Related papers (2022-03-01T19:32:27Z) - Deception detection in text and its relation to the cultural dimension
of individualism/collectivism [6.17866386107486]
We investigate if differences in the usage of specific linguistic features of deception across cultures can be confirmed and attributed to norms in respect to the individualism/collectivism divide.
We create culture/language-aware classifiers by experimenting with a wide range of n-gram features based on phonology, morphology and syntax.
We conducted our experiments over 11 datasets from 5 languages i.e., English, Dutch, Russian, Spanish and Romanian, from six countries (US, Belgium, India, Russia, Mexico and Romania)
arXiv Detail & Related papers (2021-05-26T13:09:47Z) - Identifying Distributional Perspective Differences from Colingual Groups [41.58939666949895]
A lack of mutual understanding among different groups about their perspectives on specific values or events may lead to uninformed decisions or biased opinions.
We study colingual groups and use language corpora as a proxy to identify their distributional perspectives.
We present a novel computational approach to learn shared understandings, and benchmark our method by building culturally-aware models for the English, Chinese, and Japanese languages.
arXiv Detail & Related papers (2020-04-10T08:13:07Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.