Fugu-MT 論文翻訳(概要): Is She Even Relevant? When BERT Ignores Explicit Gender Cues

論文の概要: Is She Even Relevant? When BERT Ignores Explicit Gender Cues

arxiv url: http://arxiv.org/abs/2605.07622v1
Date: Fri, 08 May 2026 11:48:22 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-11 19:43:39.022118
Title: Is She Even Relevant? When BERT Ignores Explicit Gender Cues
Title（参考訳）: 彼女は関係あるのか? BERTがジェンダークイズを無視する時
Authors: Jonas Klein, Chiara Manna, Eva Vanmassenhove,
Abstract要約: 本稿では,オランダのBERTモデルにおいて,スクラッチから学習した性別情報がどのように出現するかについて検討する。これはトランスフォーマーアーキテクチャにおいて、過剰な形態的性別マーキングとジェネリックフォームを組み合わせた言語のための最初のチェックポイントレベルのバイアス生成分析の1つである。
参考スコア（独自算出の注目度）: 0.962211505591938
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Gender bias in large language models has primarily been investigated for English, while languages with grammatical or morphological gender remain comparatively understudied. This paper investigates how and when gender information emerges in a Dutch BERT model trained from scratch, offering one of the first checkpoint-level analyses of bias formation in a Transformer architecture for a language combining overt morphological gender marking and generic forms. By extracting contextual embeddings throughout training, we construct dynamic gender subspaces using linear SVMs to trace when gender becomes linearly encoded and how this encoding evolves over time. Contextual embeddings are often assumed to integrate contextual cues robustly, allowing models to adjust the representation of a word depending on its more local usage. We therefore test whether explicit gender cues in controlled sentence templates (e.g., Zij is een loodgieter ('She is a plumber')) can override learned statistical associations (plumber -> male). Our findings challenge this assumption: although gender becomes clearly linearly separable around epoch 20 and is distributed across multiple embedding dimensions, the model struggles to update its internal gender representation in light of explicit contextual cues in short sentence templates. Stereotypical gender-profession pairings are predicted far more accurately than anti-stereotypical ones, and generic forms in Dutch systematically default to a male interpretation, even when the context explicitly denotes a female referent. Together, our results seem to indicate that contextualization in the representations learned by our Dutch BERT model is not sufficiently dynamic along the probed gender direction: explicit gender cues in anti-stereotypical contexts are not reliably reflected in the resulting representations, resulting in persistent male-default behaviour.
Abstract（参考訳）: 大きな言語モデルにおけるジェンダーバイアスは主に英語で研究されているが、文法的または形態学的性別を持つ言語は比較的研究が進んでいない。本稿では,オランダのBERTモデルにおいて,スクラッチから学習した性別情報がどのように出現するかを考察し,過度な形態的性別マーキングとジェネリックフォームを組み合わせた言語用トランスフォーマーアーキテクチャにおけるバイアス生成のチェックポイントレベル解析の1つを提供する。学習を通してコンテキスト埋め込みを抽出することにより、線形SVMを用いて動的ジェンダー部分空間を構築し、ジェンダーが線形にエンコードされ、このエンコーディングが時間とともにどのように進化するかを追跡する。文脈埋め込みはしばしば文脈的手がかりをしっかりと統合し、より局所的な使用量に応じて単語の表現を調整することができると仮定される。したがって、制御された文テンプレート(例: Zij is een loodgieter("She is a plumber"))が学習された統計的関連(plumber -> male)をオーバーライドできるかどうかを検証した。性別はエポック20前後で明確に線形分離可能となり,複数の埋め込み次元に分散するが,短い文章テンプレートの文脈的手がかりを考慮に入れた内部性表現の更新に苦慮する。ステレオタイプとジェンダーとプロのペアリングは、反ステレオタイプよりもはるかに正確に予測され、オランダのジェネリックフォームは、文脈が女性参照を明示的に示している場合でも、男性解釈に体系的に既定である。以上の結果から,オランダのBERTモデルで学習した表現の文脈化は,調査対象の性別方向に沿って十分に動的ではないことが示唆された。

論文の概要: Is She Even Relevant? When BERT Ignores Explicit Gender Cues

関連論文リスト