Multilingual Contextual Affective Analysis of LGBT People Portrayals in
Wikipedia
- URL: http://arxiv.org/abs/2010.10820v2
- Date: Thu, 8 Apr 2021 08:20:12 GMT
- Title: Multilingual Contextual Affective Analysis of LGBT People Portrayals in
Wikipedia
- Authors: Chan Young Park, Xinru Yan, Anjalie Field, Yulia Tsvetkov
- Abstract summary: Specific lexical choices in narrative text reflect both the writer's attitudes towards people in the narrative and influence the audience's reactions.
We show how word connotations differ across languages and cultures, highlighting the difficulty of generalizing existing English datasets.
We then demonstrate the usefulness of our method by analyzing Wikipedia biography pages of members of the LGBT community across three languages.
- Score: 34.183132688084534
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Specific lexical choices in narrative text reflect both the writer's
attitudes towards people in the narrative and influence the audience's
reactions. Prior work has examined descriptions of people in English using
contextual affective analysis, a natural language processing (NLP) technique
that seeks to analyze how people are portrayed along dimensions of power,
agency, and sentiment. Our work presents an extension of this methodology to
multilingual settings, which is enabled by a new corpus that we collect and a
new multilingual model. We additionally show how word connotations differ
across languages and cultures, highlighting the difficulty of generalizing
existing English datasets and methods. We then demonstrate the usefulness of
our method by analyzing Wikipedia biography pages of members of the LGBT
community across three languages: English, Russian, and Spanish. Our results
show systematic differences in how the LGBT community is portrayed across
languages, surfacing cultural differences in narratives and signs of social
biases. Practically, this model can be used to identify Wikipedia articles for
further manual analysis -- articles that might contain content gaps or an
imbalanced representation of particular social groups.
Related papers
- Locating Information Gaps and Narrative Inconsistencies Across Languages: A Case Study of LGBT People Portrayals on Wikipedia [49.80565462746646]
We introduce the InfoGap method -- an efficient and reliable approach to locating information gaps and inconsistencies in articles at the fact level.
We evaluate InfoGap by analyzing LGBT people's portrayals, across 2.7K biography pages on English, Russian, and French Wikipedias.
arXiv Detail & Related papers (2024-10-05T20:40:49Z) - What an Elegant Bridge: Multilingual LLMs are Biased Similarly in Different Languages [51.0349882045866]
This paper investigates biases of Large Language Models (LLMs) through the lens of grammatical gender.
We prompt a model to describe nouns with adjectives in various languages, focusing specifically on languages with grammatical gender.
We find that a simple classifier can not only predict noun gender above chance but also exhibit cross-language transferability.
arXiv Detail & Related papers (2024-07-12T22:10:16Z) - Inclusivity in Large Language Models: Personality Traits and Gender Bias in Scientific Abstracts [49.97673761305336]
We evaluate three large language models (LLMs) for their alignment with human narrative styles and potential gender biases.
Our findings indicate that, while these models generally produce text closely resembling human authored content, variations in stylistic features suggest significant gender biases.
arXiv Detail & Related papers (2024-06-27T19:26:11Z) - Cordyceps@LT-EDI: Patching Language-Specific Homophobia/Transphobia
Classifiers with a Multilingual Understanding [0.0]
We present a joint multilingual (M-L) and language-specific (L-S) approach to homophobia and transphobic hate speech detection.
M-L models are needed to catch words, phrases, and concepts that are less common or missing in a particular language.
L-S models are better situated to understand the cultural and linguistic context of the users who typically write in a particular language.
arXiv Detail & Related papers (2023-09-24T06:37:54Z) - Analyzing Gender Representation in Multilingual Models [59.21915055702203]
We focus on the representation of gender distinctions as a practical case study.
We examine the extent to which the gender concept is encoded in shared subspaces across different languages.
arXiv Detail & Related papers (2022-04-20T00:13:01Z) - Whose Opinions Matter? Perspective-aware Models to Identify Opinions of
Hate Speech Victims in Abusive Language Detection [6.167830237917662]
We present an in-depth study to model polarized opinions coming from different communities.
We believe that by relying on this information, we can divide the annotators into groups sharing similar perspectives.
We propose a novel resource, a multi-perspective English language dataset annotated according to different sub-categories relevant for characterising online abuse.
arXiv Detail & Related papers (2021-06-30T08:35:49Z) - Pick a Fight or Bite your Tongue: Investigation of Gender Differences in
Idiomatic Language Usage [9.892162266128306]
We compile a novel, large and diverse corpus of spontaneous linguistic productions annotated with speakers' gender.
We perform a first large-scale empirical study of distinctions in the usage of textitfigurative language between male and female authors.
arXiv Detail & Related papers (2020-10-31T18:44:07Z) - Gender Bias in Multilingual Embeddings and Cross-Lingual Transfer [101.58431011820755]
We study gender bias in multilingual embeddings and how it affects transfer learning for NLP applications.
We create a multilingual dataset for bias analysis and propose several ways for quantifying bias in multilingual representations.
arXiv Detail & Related papers (2020-05-02T04:34:37Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.