GenderedNews: Une approche computationnelle des \'ecarts de
repr\'esentation des genres dans la presse fran\c{c}aise
- URL: http://arxiv.org/abs/2202.05682v2
- Date: Mon, 7 Mar 2022 10:07:59 GMT
- Title: GenderedNews: Une approche computationnelle des \'ecarts de
repr\'esentation des genres dans la presse fran\c{c}aise
- Authors: Ange Richard and Gilles Bastin and Fran\c{c}ois Portet
- Abstract summary: We present it GenderedNews (urlhttps://gendered-news.imag.fr), an online dashboard which gives weekly measures of gender imbalance in French online press.
We use Natural Language Processing (NLP) methods to quantify gender inequalities in the media.
We describe the data collected daily (seven main titles of French online news media) and the methodology behind our metrics.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: In this article, we present {\it GenderedNews}
(\url{https://gendered-news.imag.fr}), an online dashboard which gives weekly
measures of gender imbalance in French online press. We use Natural Language
Processing (NLP) methods to quantify gender inequalities in the media, in the
wake of global projects like the Global Media Monitoring Project. Such projects
are instrumental in highlighting gender imbalance in the media and its very
slow evolution. However, their generalisation is limited by their sampling and
cost in terms of time, data and staff. Automation allows us to offer
complementary measures to quantify inequalities in gender representation. We
understand representation as the presence and distribution of men and women
mentioned and quoted in the news -- as opposed to representation as
stereotypification. In this paper, we first review different means adopted by
previous studies on gender inequality in the media : qualitative content
analysis, quantitative content analysis and computational methods. We then
detail the methods adopted by {\it GenderedNews} and the two metrics
implemented: the masculinity rate of mentions and the proportion of men quoted
in online news. We describe the data collected daily (seven main titles of
French online news media) and the methodology behind our metrics, as well as a
few visualisations. We finally propose to illustrate possible analysis of our
data by conducting an in-depth observation of a sample of two months of our
database.
Related papers
- Beyond Binary Gender: Evaluating Gender-Inclusive Machine Translation with Ambiguous Attitude Words [85.48043537327258]
Existing machine translation gender bias evaluations are primarily focused on male and female genders.
This study presents a benchmark AmbGIMT (Gender-Inclusive Machine Translation with Ambiguous attitude words)
We propose a novel process to evaluate gender bias based on the Emotional Attitude Score (EAS), which is used to quantify ambiguous attitude words.
arXiv Detail & Related papers (2024-07-23T08:13:51Z) - GenderBias-\emph{VL}: Benchmarking Gender Bias in Vision Language Models via Counterfactual Probing [72.0343083866144]
This paper introduces the GenderBias-emphVL benchmark to evaluate occupation-related gender bias in Large Vision-Language Models.
Using our benchmark, we extensively evaluate 15 commonly used open-source LVLMs and state-of-the-art commercial APIs.
Our findings reveal widespread gender biases in existing LVLMs.
arXiv Detail & Related papers (2024-06-30T05:55:15Z) - What is Your Favorite Gender, MLM? Gender Bias Evaluation in Multilingual Masked Language Models [8.618945530676614]
This paper proposes an approach to estimate gender bias in multilingual lexicons from 5 languages: Chinese, English, German, Portuguese, and Spanish.
A novel model-based method is presented to generate sentence pairs for a more robust analysis of gender bias.
Our results suggest that gender bias should be studied on a large dataset using multiple evaluation metrics for best practice.
arXiv Detail & Related papers (2024-04-09T21:12:08Z) - Hi Guys or Hi Folks? Benchmarking Gender-Neutral Machine Translation
with the GeNTE Corpus [15.388894407006852]
Machine translation (MT) often defaults to masculine and stereotypical representations by making undue binary gender assumptions.
Our work addresses the rising demand for inclusive language by focusing head-on on gender-neutral translation from English to Italian.
arXiv Detail & Related papers (2023-10-08T21:44:00Z) - The Gender-GAP Pipeline: A Gender-Aware Polyglot Pipeline for Gender
Characterisation in 55 Languages [51.2321117760104]
This paper describes the Gender-GAP Pipeline, an automatic pipeline to characterize gender representation in large-scale datasets for 55 languages.
The pipeline uses a multilingual lexicon of gendered person-nouns to quantify the gender representation in text.
We showcase it to report gender representation in WMT training data and development data for the News task, confirming that current data is skewed towards masculine representation.
arXiv Detail & Related papers (2023-08-31T17:20:50Z) - VisoGender: A dataset for benchmarking gender bias in image-text pronoun
resolution [80.57383975987676]
VisoGender is a novel dataset for benchmarking gender bias in vision-language models.
We focus on occupation-related biases within a hegemonic system of binary gender, inspired by Winograd and Winogender schemas.
We benchmark several state-of-the-art vision-language models and find that they demonstrate bias in resolving binary gender in complex scenes.
arXiv Detail & Related papers (2023-06-21T17:59:51Z) - Radar de Parit\'e: An NLP system to measure gender representation in
French news stories [0.05735035463793007]
Radar de Parit'e measures the proportion of women and men quoted daily in six Canadian French-language media outlets.
We outline the system's architecture and detail the challenges we overcame to address French-specific issues.
arXiv Detail & Related papers (2023-04-19T21:33:59Z) - Auditing Gender Presentation Differences in Text-to-Image Models [54.16959473093973]
We study how gender is presented differently in text-to-image models.
By probing gender indicators in the input text, we quantify the frequency differences of presentation-centric attributes.
We propose an automatic method to estimate such differences.
arXiv Detail & Related papers (2023-02-07T18:52:22Z) - Inference of Media Bias and Content Quality Using Natural-Language
Processing [6.092956184948962]
We present a framework to infer both political bias and content quality of media outlets from text.
We apply a bidirectional long short-term memory (LSTM) neural network to a data set of more than 1 million tweets.
Our results illustrate the importance of leveraging word order into machine-learning methods in text analysis.
arXiv Detail & Related papers (2022-12-01T03:04:55Z) - Gender bias in magazines oriented to men and women: a computational
approach [58.720142291102135]
We compare the content of a women-oriented magazine with that of a men-oriented one, both produced by the same editorial group over a decade.
With Topic Modelling techniques we identify the main themes discussed in the magazines and quantify how much the presence of these topics differs between magazines over time.
Our results show that the frequency of appearance of the topics Family, Business and Women as sex objects, present an initial bias that tends to disappear over time.
arXiv Detail & Related papers (2020-11-24T14:02:49Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.