The User-Aware Arabic Gender Rewriter
- URL: http://arxiv.org/abs/2210.07538v1
- Date: Fri, 14 Oct 2022 05:34:57 GMT
- Title: The User-Aware Arabic Gender Rewriter
- Authors: Bashar Alhafni, Ossama Obeid, Nizar Habash
- Abstract summary: We introduce the User-Aware Arabic Gender Rewriter, a user-centric web-based system for Arabic gender rewriting.
The system takes either Arabic or English sentences as input, and provides users with the ability to specify their desired first and/or second person target genders.
- Score: 12.26152421175485
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We introduce the User-Aware Arabic Gender Rewriter, a user-centric web-based
system for Arabic gender rewriting in contexts involving two users. The system
takes either Arabic or English sentences as input, and provides users with the
ability to specify their desired first and/or second person target genders. The
system outputs gender rewritten alternatives of the Arabic input sentences (or
their Arabic translations in case of English input) to match the target users'
gender preferences.
Related papers
- Beyond Binary Gender: Evaluating Gender-Inclusive Machine Translation with Ambiguous Attitude Words [85.48043537327258]
Existing machine translation gender bias evaluations are primarily focused on male and female genders.
This study presents a benchmark AmbGIMT (Gender-Inclusive Machine Translation with Ambiguous attitude words)
We propose a novel process to evaluate gender bias based on the Emotional Attitude Score (EAS), which is used to quantify ambiguous attitude words.
arXiv Detail & Related papers (2024-07-23T08:13:51Z) - VisoGender: A dataset for benchmarking gender bias in image-text pronoun
resolution [80.57383975987676]
VisoGender is a novel dataset for benchmarking gender bias in vision-language models.
We focus on occupation-related biases within a hegemonic system of binary gender, inspired by Winograd and Winogender schemas.
We benchmark several state-of-the-art vision-language models and find that they demonstrate bias in resolving binary gender in complex scenes.
arXiv Detail & Related papers (2023-06-21T17:59:51Z) - Beyond Arabic: Software for Perso-Arabic Script Manipulation [67.31374614549237]
We provide a set of finite-state transducer (FST) components and corresponding utilities for manipulating the writing systems of languages that use the Perso-Arabic script.
The library also provides simple FST-based romanization and transliteration.
arXiv Detail & Related papers (2023-01-26T20:37:03Z) - The Shared Task on Gender Rewriting [7.6676670534261175]
The task of gender rewriting refers to generating alternatives of a given sentence to match different target user gender contexts.
This requires changing the grammatical gender of certain words referring to the users.
A total of five teams from four countries participated in the shared task.
arXiv Detail & Related papers (2022-10-22T10:27:53Z) - User-Centric Gender Rewriting [12.519348416773553]
We define the task of gender rewriting in contexts involving two users (I and/or You)
We develop a multi-step system that combines the positive aspects of both rule-based and neural rewriting models.
Our results successfully demonstrate the viability of this approach on a recently created corpus for Arabic gender rewriting.
arXiv Detail & Related papers (2022-05-04T17:46:17Z) - The Arabic Parallel Gender Corpus 2.0: Extensions and Analyses [17.253633576291897]
We introduce a new corpus for gender identification and rewriting in contexts involving one or two target users.
We focus on Arabic, a gender-marking morphologically rich language.
arXiv Detail & Related papers (2021-10-18T12:06:17Z) - User Factor Adaptation for User Embedding via Multitask Learning [45.56193775870044]
We treat the user interest as domains and empirically examine how the user language can vary across the user factor.
We propose a user embedding model to account for the language variability of user interests via a multitask learning framework.
The model learns user language and its variations without human supervision.
arXiv Detail & Related papers (2021-02-22T15:21:01Z) - They, Them, Theirs: Rewriting with Gender-Neutral English [56.14842450974887]
We perform a case study on the singular they, a common way to promote gender inclusion in English.
We show how a model can be trained to produce gender-neutral English with 1% word error rate with no human-labeled data.
arXiv Detail & Related papers (2021-02-12T21:47:48Z) - Neural Machine Translation Doesn't Translate Gender Coreference Right
Unless You Make It [18.148675498274866]
We propose schemes for incorporating explicit word-level gender inflection tags into Neural Machine Translation.
We find that simple existing approaches can over-generalize a gender-feature to multiple entities in a sentence.
We also propose an extension to assess translations of gender-neutral entities from English given a corresponding linguistic convention.
arXiv Detail & Related papers (2020-10-11T20:05:42Z) - Gender in Danger? Evaluating Speech Translation Technology on the
MuST-SHE Corpus [20.766890957411132]
Translating from languages without productive grammatical gender like English into gender-marked languages is a well-known difficulty for machines.
Can audio provide additional information to reduce gender bias?
We present the first thorough investigation of gender bias in speech translation, contributing with the release of a benchmark useful for future studies.
arXiv Detail & Related papers (2020-06-10T09:55:38Z) - Multi-Dimensional Gender Bias Classification [67.65551687580552]
Machine learning models can inadvertently learn socially undesirable patterns when training on gender biased text.
We propose a general framework that decomposes gender bias in text along several pragmatic and semantic dimensions.
Using this fine-grained framework, we automatically annotate eight large scale datasets with gender information.
arXiv Detail & Related papers (2020-05-01T21:23:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.