Related papers: Grammatical Gender's Influence on Distributional Semantics: A Causal Perspective

Grammatical Gender's Influence on Distributional Semantics: A Causal Perspective

URL: http://arxiv.org/abs/2311.18567v1
Date: Thu, 30 Nov 2023 13:58:13 GMT
Title: Grammatical Gender's Influence on Distributional Semantics: A Causal Perspective
Authors: Karolina Sta\'nczak, Kevin Du, Adina Williams, Isabelle Augenstein, Ryan Cotterell
Abstract summary: How much meaning influences gender assignment across languages is an active area of research in modern linguistics and cognitive science. We offer a novel, causal graphical model that jointly represents the interactions between a noun's grammatical gender, its meaning, and adjective choice. We find that grammatical gender has a near-zero effect on adjective choice, thereby calling the neo-Whorfian hypothesis into question.
Score: 100.47362690469669
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: How much meaning influences gender assignment across languages is an active area of research in modern linguistics and cognitive science. We can view current approaches as aiming to determine where gender assignment falls on a spectrum, from being fully arbitrarily determined to being largely semantically determined. For the latter case, there is a formulation of the neo-Whorfian hypothesis, which claims that even inanimate noun gender influences how people conceive of and talk about objects (using the choice of adjective used to modify inanimate nouns as a proxy for meaning). We offer a novel, causal graphical model that jointly represents the interactions between a noun's grammatical gender, its meaning, and adjective choice. In accordance with past results, we find a relationship between the gender of nouns and the adjectives which modify them. However, when we control for the meaning of the noun, we find that grammatical gender has a near-zero effect on adjective choice, thereby calling the neo-Whorfian hypothesis into question.

Related papers

What an Elegant Bridge: Multilingual LLMs are Biased Similarly in Different Languages [51.0349882045866]
This paper investigates biases of Large Language Models (LLMs) through the lens of grammatical gender. We prompt a model to describe nouns with adjectives in various languages, focusing specifically on languages with grammatical gender. We find that a simple classifier can not only predict noun gender above chance but also exhibit cross-language transferability.
arXiv Detail & Related papers (2024-07-12T22:10:16Z)
Neighboring Words Affect Human Interpretation of Saliency Explanations [65.29015910991261]
Word-level saliency explanations are often used to communicate feature-attribution in text-based models. Recent studies found that superficial factors such as word length can distort human interpretation of the communicated saliency scores. We investigate how the marking of a word's neighboring words affect the explainee's perception of the word's importance in the context of a saliency explanation.
arXiv Detail & Related papers (2023-05-04T09:50:25Z)
An exploration of the encoding of grammatical gender in word embeddings [0.6461556265872973]
The study of grammatical gender based on word embeddings can give insight into discussions on how grammatical genders are determined. It is found that there is an overlap in how grammatical gender is encoded in Swedish, Danish, and Dutch embeddings.
arXiv Detail & Related papers (2020-08-05T06:01:46Z)
Grammatical gender associations outweigh topical gender bias in crosslinguistic word embeddings [0.0]
Crosslinguistic word embeddings reveal that topical gender bias interacts with, and is surpassed in magnitude by, the effect of grammatical gender associations. This finding has implications for downstream applications such as machine translation.
arXiv Detail & Related papers (2020-05-18T16:39:16Z)
On the Relationships Between the Grammatical Genders of Inanimate Nouns and Their Co-Occurring Adjectives and Verbs [57.015586483981885]
We use large-scale corpora in six different gendered languages. We find statistically significant relationships between the grammatical genders of inanimate nouns and the verbs that take those nouns as direct objects, indirect objects, and as subjects.
arXiv Detail & Related papers (2020-05-03T22:49:44Z)
Predicting Declension Class from Form and Meaning [70.65971611552871]
Class membership is far from deterministic, but the phonological form of a noun and/or its meaning can often provide imperfect clues. We operationalize this by measuring how much information, in bits, we can glean about declension class from knowing the form and/or meaning of nouns. We find for two Indo-European languages (Czech and German) that form and meaning respectively share significant amounts of information with class.
arXiv Detail & Related papers (2020-05-01T21:48:48Z)
Where New Words Are Born: Distributional Semantic Analysis of Neologisms and Their Semantic Neighborhoods [51.34667808471513]
We investigate the importance of two factors, semantic sparsity and frequency growth rates of semantic neighbors, formalized in the distributional semantics paradigm. We show that both factors are predictive word emergence although we find more support for the latter hypothesis.
arXiv Detail & Related papers (2020-01-21T19:09:49Z)

This list is automatically generated from the titles and abstracts of the papers in this site.

This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.