Gender Neutralization for an Inclusive Machine Translation: from
Theoretical Foundations to Open Challenges
- URL: http://arxiv.org/abs/2301.10075v3
- Date: Tue, 4 Jul 2023 09:01:41 GMT
- Title: Gender Neutralization for an Inclusive Machine Translation: from
Theoretical Foundations to Open Challenges
- Authors: Andrea Piergentili, Dennis Fucci, Beatrice Savoldi, Luisa Bentivogli,
Matteo Negri
- Abstract summary: We explore gender-neutral translation (GNT) as a form of gender inclusivity and a goal to be achieved by machine translation (MT) models.
Specifically, we focus on translation from English into Italian, a language pair representative of salient gender-related linguistic transfer problems.
- Score: 11.37307883423629
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Gender inclusivity in language technologies has become a prominent research
topic. In this study, we explore gender-neutral translation (GNT) as a form of
gender inclusivity and a goal to be achieved by machine translation (MT)
models, which have been found to perpetuate gender bias and discrimination.
Specifically, we focus on translation from English into Italian, a language
pair representative of salient gender-related linguistic transfer problems. To
define GNT, we review a selection of relevant institutional guidelines for
gender-inclusive language, discuss its scenarios of use, and examine the
technical challenges of performing GNT in MT, concluding with a discussion of
potential solutions to encourage advancements toward greater inclusivity in MT.
Related papers
- mGeNTE: A Multilingual Resource for Gender-Neutral Language and Translation [21.461095625903504]
mGeNTE is a dataset of English-Italian/German/Spanish language pairs.
It enables research in both automatic Gender-Neutral Translation (GNT) and language modelling for three grammatical gender languages.
arXiv Detail & Related papers (2025-01-16T09:35:15Z) - Beyond Binary Gender: Evaluating Gender-Inclusive Machine Translation with Ambiguous Attitude Words [85.48043537327258]
Existing machine translation gender bias evaluations are primarily focused on male and female genders.
This study presents a benchmark AmbGIMT (Gender-Inclusive Machine Translation with Ambiguous attitude words)
We propose a novel process to evaluate gender bias based on the Emotional Attitude Score (EAS), which is used to quantify ambiguous attitude words.
arXiv Detail & Related papers (2024-07-23T08:13:51Z) - Gender Bias in Machine Translation and The Era of Large Language Models [0.8702432681310399]
This chapter examines the role of Machine Translation in perpetuating gender bias, highlighting the challenges posed by cross-linguistic settings and statistical dependencies.
A comprehensive overview of relevant existing work related to gender bias in both conventional Neural Machine Translation approaches and Generative Pretrained Transformer models employed as Machine Translation systems is provided.
arXiv Detail & Related papers (2024-01-18T14:34:49Z) - Evaluating Gender Bias in the Translation of Gender-Neutral Languages
into English [0.0]
We introduce GATE X-E, an extension to the GATE corpus, that consists of human translations from Turkish, Hungarian, Finnish, and Persian into English.
The dataset features natural sentences with a wide range of sentence lengths and domains, challenging translation rewriters on various linguistic phenomena.
We present an English gender rewriting solution built on GPT-3.5 Turbo and use GATE X-E to evaluate it.
arXiv Detail & Related papers (2023-11-15T10:25:14Z) - Hi Guys or Hi Folks? Benchmarking Gender-Neutral Machine Translation
with the GeNTE Corpus [15.388894407006852]
Machine translation (MT) often defaults to masculine and stereotypical representations by making undue binary gender assumptions.
Our work addresses the rising demand for inclusive language by focusing head-on on gender-neutral translation from English to Italian.
arXiv Detail & Related papers (2023-10-08T21:44:00Z) - Participatory Research as a Path to Community-Informed, Gender-Fair
Machine Translation [19.098548371499678]
We propose a method and case study building on participatory action research to include queer and non-binary people, translators, and MT experts.
The case study focuses on German, where central findings are the importance of context dependency to avoid identity invalidation.
arXiv Detail & Related papers (2023-06-15T07:20:14Z) - "I'm fully who I am": Towards Centering Transgender and Non-Binary
Voices to Measure Biases in Open Language Generation [69.25368160338043]
Transgender and non-binary (TGNB) individuals disproportionately experience discrimination and exclusion from daily life.
We assess how the social reality surrounding experienced marginalization of TGNB persons contributes to and persists within Open Language Generation.
We introduce TANGO, a dataset of template-based real-world text curated from a TGNB-oriented community.
arXiv Detail & Related papers (2023-05-17T04:21:45Z) - Analyzing Gender Representation in Multilingual Models [59.21915055702203]
We focus on the representation of gender distinctions as a practical case study.
We examine the extent to which the gender concept is encoded in shared subspaces across different languages.
arXiv Detail & Related papers (2022-04-20T00:13:01Z) - GENder-IT: An Annotated English-Italian Parallel Challenge Set for
Cross-Linguistic Natural Gender Phenomena [2.4366811507669124]
gENder-IT is an English--Italian challenge set focusing on the resolution of natural gender phenomena.
It provides word-level gender tags on the English source side and multiple gender alternative, where translations needed, on the Italian target side.
arXiv Detail & Related papers (2021-08-05T21:08:45Z) - They, Them, Theirs: Rewriting with Gender-Neutral English [56.14842450974887]
We perform a case study on the singular they, a common way to promote gender inclusion in English.
We show how a model can be trained to produce gender-neutral English with 1% word error rate with no human-labeled data.
arXiv Detail & Related papers (2021-02-12T21:47:48Z) - Decoding and Diversity in Machine Translation [90.33636694717954]
We characterize differences between cost diversity paid for the BLEU scores enjoyed by NMT.
Our study implicates search as a salient source of known bias when translating gender pronouns.
arXiv Detail & Related papers (2020-11-26T21:09:38Z) - Gender Bias in Multilingual Embeddings and Cross-Lingual Transfer [101.58431011820755]
We study gender bias in multilingual embeddings and how it affects transfer learning for NLP applications.
We create a multilingual dataset for bias analysis and propose several ways for quantifying bias in multilingual representations.
arXiv Detail & Related papers (2020-05-02T04:34:37Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.