Quantifying Gender Stereotypes in Japan between 1900 and 1999 with Word Embeddings
- URL: http://arxiv.org/abs/2510.03905v1
- Date: Sat, 04 Oct 2025 19:03:34 GMT
- Title: Quantifying Gender Stereotypes in Japan between 1900 and 1999 with Word Embeddings
- Authors: Shintaro Sakai, Haewoon Kwak, Jisun An, Akira Matsui,
- Abstract summary: We quantify the evolution of gender stereotypes in Japan from 1900 to 1999 using a series of 100 word embeddings.<n>We examine trajectories of gender stereotype across three traditionally gendered domains: Home, Work, and Politics, as well as occupations.
- Score: 3.539535591235625
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We quantify the evolution of gender stereotypes in Japan from 1900 to 1999 using a series of 100 word embeddings, each trained on a corpus from a specific year. We define the gender stereotype value to measure the strength of a word's gender association by computing the difference in cosine similarity of the word to female- versus male-related attribute words. We examine trajectories of gender stereotype across three traditionally gendered domains: Home, Work, and Politics, as well as occupations. The results indicate that language-based gender stereotypes partially evolved to reflect women's increasing participation in the workplace and politics: Work and Politics domains become more strongly female-stereotyped over the years. Yet, Home also became more female-stereotyped, suggesting that women were increasingly viewed as fulfilling multiple roles such as homemakers, workers, and politicians, rather than having one role replace another. Furthermore, the strength of female stereotype for occupations positively correlate with the proportion of women in each occupation, indicating that word-embedding-based measures of gender stereotype mirrored demographic shifts to a considerable extent.
Related papers
- A Large Scale Analysis of Gender Biases in Text-to-Image Generative Models [47.16682882493828]
This paper presents a large-scale study on gender bias in text-to-image (T2I) models, focusing on everyday situations.<n>We create a dataset of 3,217 gender-neutral prompts and generate 200 images over 5 prompt variations per prompt from five leading T2I models.<n>We automatically detect the perceived gender of people in the generated images and filter out images with no person or multiple people of different genders.<n>Our analysis shows that T2I models reinforce traditional gender roles and reflect common gender stereotypes in household roles.
arXiv Detail & Related papers (2025-03-30T11:11:51Z) - Evaluating Gender Bias in Large Language Models [0.8636148452563583]
The study examines the extent to which Large Language Models (LLMs) exhibit gender bias in pronoun selection in occupational contexts.
The jobs considered include a range of occupations, from those with a significant male presence to those with a notable female concentration.
The results show a positive correlation between the models' pronoun choices and the gender distribution present in U.S. labor force data.
arXiv Detail & Related papers (2024-11-14T22:23:13Z) - Gender Bias in Decision-Making with Large Language Models: A Study of Relationship Conflicts [15.676219253088211]
We study gender equity within large language models (LLMs) through a decision-making lens.
We explore nine relationship configurations through name pairs across three name lists (men, women, neutral)
arXiv Detail & Related papers (2024-10-14T20:50:11Z) - Beyond Binary Gender: Evaluating Gender-Inclusive Machine Translation with Ambiguous Attitude Words [85.48043537327258]
Existing machine translation gender bias evaluations are primarily focused on male and female genders.
This study presents a benchmark AmbGIMT (Gender-Inclusive Machine Translation with Ambiguous attitude words)
We propose a novel process to evaluate gender bias based on the Emotional Attitude Score (EAS), which is used to quantify ambiguous attitude words.
arXiv Detail & Related papers (2024-07-23T08:13:51Z) - The Resume Paradox: Greater Language Differences, Smaller Pay Gaps [0.0]
We analyze the language in millions of US workers' resumes to investigate how differences in workers' self-representation by gender compare to differences in earnings.
Across US occupations, language differences between male and female resumes correspond to 11% of the variation in gender pay gap.
A doubling of the language difference between female and male resumes results in an annual wage increase of $2,797 for the average female worker.
arXiv Detail & Related papers (2023-07-17T15:49:35Z) - VisoGender: A dataset for benchmarking gender bias in image-text pronoun
resolution [80.57383975987676]
VisoGender is a novel dataset for benchmarking gender bias in vision-language models.
We focus on occupation-related biases within a hegemonic system of binary gender, inspired by Winograd and Winogender schemas.
We benchmark several state-of-the-art vision-language models and find that they demonstrate bias in resolving binary gender in complex scenes.
arXiv Detail & Related papers (2023-06-21T17:59:51Z) - Towards Understanding Gender-Seniority Compound Bias in Natural Language
Generation [64.65911758042914]
We investigate how seniority impacts the degree of gender bias exhibited in pretrained neural generation models.
Our results show that GPT-2 amplifies bias by considering women as junior and men as senior more often than the ground truth in both domains.
These results suggest that NLP applications built using GPT-2 may harm women in professional capacities.
arXiv Detail & Related papers (2022-05-19T20:05:02Z) - Gender stereotypes in the mediated personalization of politics:
Empirical evidence from a lexical, syntactic and sentiment analysis [2.7071541526963805]
We show that the political personalization in Italy is more detrimental for women than men.
Women politicians are covered with a more negative tone than their men counterpart when personal details are reported.
The major contribution to the observed gender differences comes from online news rather than print news.
arXiv Detail & Related papers (2022-02-07T11:40:44Z) - Quantifying Gender Bias Towards Politicians in Cross-Lingual Language
Models [104.41668491794974]
We quantify the usage of adjectives and verbs generated by language models surrounding the names of politicians as a function of their gender.
We find that while some words such as dead, and designated are associated with both male and female politicians, a few specific words such as beautiful and divorced are predominantly associated with female politicians.
arXiv Detail & Related papers (2021-04-15T15:03:26Z) - Language, communication and society: a gender based linguistics analysis [0.0]
The purpose of this study is to find evidence for supporting the hypothesis that language is the mirror of our thinking.
The answers have been analysed to see if gender stereotypes were present such as the attribution of psychological and behavioural characteristics.
arXiv Detail & Related papers (2020-07-14T08:38:37Z) - Multi-Dimensional Gender Bias Classification [67.65551687580552]
Machine learning models can inadvertently learn socially undesirable patterns when training on gender biased text.
We propose a general framework that decomposes gender bias in text along several pragmatic and semantic dimensions.
Using this fine-grained framework, we automatically annotate eight large scale datasets with gender information.
arXiv Detail & Related papers (2020-05-01T21:23:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.