Evaluating Gender Bias in the Translation of Gender-Neutral Languages
  into English
        - URL: http://arxiv.org/abs/2311.08836v2
- Date: Wed, 13 Dec 2023 04:15:26 GMT
- Title: Evaluating Gender Bias in the Translation of Gender-Neutral Languages
  into English
- Authors: Spencer Rarrick, Ranjita Naik, Sundar Poudel, Vishal Chowdhary
- Abstract summary: We introduce GATE X-E, an extension to the GATE corpus, that consists of human translations from Turkish, Hungarian, Finnish, and Persian into English.
The dataset features natural sentences with a wide range of sentence lengths and domains, challenging translation rewriters on various linguistic phenomena.
We present an English gender rewriting solution built on GPT-3.5 Turbo and use GATE X-E to evaluate it.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract:   Machine Translation (MT) continues to improve in quality and adoption, yet
the inadvertent perpetuation of gender bias remains a significant concern.
Despite numerous studies into gender bias in translations from gender-neutral
languages such as Turkish into more strongly gendered languages like English,
there are no benchmarks for evaluating this phenomenon or for assessing
mitigation strategies. To address this gap, we introduce GATE X-E, an extension
to the GATE (Rarrick et al., 2023) corpus, that consists of human translations
from Turkish, Hungarian, Finnish, and Persian into English. Each translation is
accompanied by feminine, masculine, and neutral variants for each possible
gender interpretation. The dataset, which contains between 1250 and 1850
instances for each of the four language pairs, features natural sentences with
a wide range of sentence lengths and domains, challenging translation rewriters
on various linguistic phenomena. Additionally, we present an English gender
rewriting solution built on GPT-3.5 Turbo and use GATE X-E to evaluate it. We
open source our contributions to encourage further research on gender
debiasing.
 
      
        Related papers
        - EuroGEST: Investigating gender stereotypes in multilingual language   models [53.88459905621724]
 Large language models increasingly support multiple languages, yet most benchmarks for gender bias remain English-centric.<n>We introduce EuroGEST, a dataset designed to measure gender-stereotypical reasoning in LLMs across English and 29 European languages.
 arXiv  Detail & Related papers  (2025-06-04T11:58:18Z)
- The Lou Dataset -- Exploring the Impact of Gender-Fair Language in   German Text Classification [57.06913662622832]
 Gender-fair language fosters inclusion by addressing all genders or using neutral forms.
Gender-fair language substantially impacts predictions by flipping labels, reducing certainty, and altering attention patterns.
While we offer initial insights on the effect on German text classification, the findings likely apply to other languages.
 arXiv  Detail & Related papers  (2024-09-26T15:08:17Z)
- GenderCARE: A Comprehensive Framework for Assessing and Reducing Gender   Bias in Large Language Models [73.23743278545321]
 Large language models (LLMs) have exhibited remarkable capabilities in natural language generation, but have also been observed to magnify societal biases.
GenderCARE is a comprehensive framework that encompasses innovative Criteria, bias Assessment, Reduction techniques, and Evaluation metrics.
 arXiv  Detail & Related papers  (2024-08-22T15:35:46Z)
- Beyond Binary Gender: Evaluating Gender-Inclusive Machine Translation   with Ambiguous Attitude Words [85.48043537327258]
 Existing machine translation gender bias evaluations are primarily focused on male and female genders.
This study presents a benchmark AmbGIMT (Gender-Inclusive Machine Translation with Ambiguous attitude words)
We propose a novel process to evaluate gender bias based on the Emotional Attitude Score (EAS), which is used to quantify ambiguous attitude words.
 arXiv  Detail & Related papers  (2024-07-23T08:13:51Z)
- GATE X-E : A Challenge Set for Gender-Fair Translations from
  Weakly-Gendered Languages [0.0]
 We introduce GATE X-E, an extension to the GATE corpus, that consists of human translations from Turkish, Hungarian, Finnish, and Persian into English.
The dataset features natural sentences with a wide range of sentence lengths and domains, challenging translation rewriters on various linguistic phenomena.
We present a translation gender rewriting solution built with GPT-4 and use GATE X-E to evaluate it.
 arXiv  Detail & Related papers  (2024-02-22T04:36:14Z)
- The Gender-GAP Pipeline: A Gender-Aware Polyglot Pipeline for Gender
  Characterisation in 55 Languages [51.2321117760104]
 This paper describes the Gender-GAP Pipeline, an automatic pipeline to characterize gender representation in large-scale datasets for 55 languages.
The pipeline uses a multilingual lexicon of gendered person-nouns to quantify the gender representation in text.
We showcase it to report gender representation in WMT training data and development data for the News task, confirming that current data is skewed towards masculine representation.
 arXiv  Detail & Related papers  (2023-08-31T17:20:50Z)
- Gender Lost In Translation: How Bridging The Gap Between Languages
  Affects Gender Bias in Zero-Shot Multilingual Translation [12.376309678270275]
 bridging the gap between languages for which parallel data is not available affects gender bias in multilingual NMT.
We study the effect of encouraging language-agnostic hidden representations on models' ability to preserve gender.
We find that language-agnostic representations mitigate zero-shot models' masculine bias, and with increased levels of gender inflection in the bridge language, pivoting surpasses zero-shot translation regarding fairer gender preservation for speaker-related gender agreement.
 arXiv  Detail & Related papers  (2023-05-26T13:51:50Z)
- Multilingual Holistic Bias: Extending Descriptors and Patterns to Unveil
  Demographic Biases in Languages at Scale [0.21079694661943604]
 This extension consists of 20,459 sentences in 50 languages distributed across all 13 demographic axes.
Our benchmark is intended to uncover demographic imbalances and be the tool to quantify mitigations towards them.
 arXiv  Detail & Related papers  (2023-05-22T16:29:04Z)
- GATE: A Challenge Set for Gender-Ambiguous Translation Examples [0.31498833540989407]
 When source gender is ambiguous, machine translation models typically default to stereotypical gender roles, perpetuating harmful bias.
Recent work has led to the development of "gender rewriters" that generate alternative gender translations on such ambiguous inputs, but such systems are plagued by poor linguistic coverage.
We present and release GATE, a linguistically diverse corpus of gender-ambiguous source sentences along with multiple alternative target language translations.
 arXiv  Detail & Related papers  (2023-03-07T15:23:38Z)
- Towards Understanding Gender-Seniority Compound Bias in Natural Language
  Generation [64.65911758042914]
 We investigate how seniority impacts the degree of gender bias exhibited in pretrained neural generation models.
Our results show that GPT-2 amplifies bias by considering women as junior and men as senior more often than the ground truth in both domains.
These results suggest that NLP applications built using GPT-2 may harm women in professional capacities.
 arXiv  Detail & Related papers  (2022-05-19T20:05:02Z)
- Multi-Dimensional Gender Bias Classification [67.65551687580552]
 Machine learning models can inadvertently learn socially undesirable patterns when training on gender biased text.
We propose a general framework that decomposes gender bias in text along several pragmatic and semantic dimensions.
Using this fine-grained framework, we automatically annotate eight large scale datasets with gender information.
 arXiv  Detail & Related papers  (2020-05-01T21:23:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.