Fugu-MT 論文翻訳(概要): Beyond the Prompt: Gender Bias in Text-to-Image Models, with a Case Study on Hospital Professions

論文の概要: Beyond the Prompt: Gender Bias in Text-to-Image Models, with a Case Study on Hospital Professions

arxiv url: http://arxiv.org/abs/2510.00045v1
Date: Sat, 27 Sep 2025 11:18:46 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-03 16:59:20.132226
Title: Beyond the Prompt: Gender Bias in Text-to-Image Models, with a Case Study on Hospital Professions
Title（参考訳）: テキスト・ツー・イメージモデルにおけるジェンダーバイアスの枠を超えて : 病院の職業を事例として
Authors: Franck Vandewiele, Remi Synave, Samuel Delepoulle, Remi Cozot,
Abstract要約: 本稿では,Hunyuan Image 2.1,HiDream-I1-dev,Qwen-Image,FLUX.1-dev,Stable-Diffusion 3.5 Large,Stable-Diffusion-XLの6つの最先端オープンウェイトモデルにおける性別表現について検討する。全てのモデルで女性や外科医が主に男性として看護婦を生産しています。この結果は、バイアス認識設計、バランスの取れたデフォルト、およびAI生成における作業ステレオタイプの強化を防ぐためのユーザガイダンスの必要性を浮き彫りにしている。
参考スコア（独自算出の注目度）: 0.0380593514849736
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Text-to-image (TTI) models are increasingly used in professional, educational, and creative contexts, yet their outputs often embed and amplify social biases. This paper investigates gender representation in six state-of-the-art open-weight models: HunyuanImage 2.1, HiDream-I1-dev, Qwen-Image, FLUX.1-dev, Stable-Diffusion 3.5 Large, and Stable-Diffusion-XL. Using carefully designed prompts, we generated 100 images for each combination of five hospital-related professions (cardiologist, hospital director, nurse, paramedic, surgeon) and five portrait qualifiers ("", corporate, neutral, aesthetic, beautiful). Our analysis reveals systematic occupational stereotypes: all models produced nurses exclusively as women and surgeons predominantly as men. However, differences emerge across models: Qwen-Image and SDXL enforce rigid male dominance, HiDream-I1-dev shows mixed outcomes, and FLUX.1-dev skews female in most roles. HunyuanImage 2.1 and Stable-Diffusion 3.5 Large also reproduce gender stereotypes but with varying degrees of sensitivity to prompt formulation. Portrait qualifiers further modulate gender balance, with terms like corporate reinforcing male depictions and beautiful favoring female ones. Sensitivity varies widely: Qwen-Image remains nearly unaffected, while FLUX.1-dev, SDXL, and SD3.5 show strong prompt dependence. These findings demonstrate that gender bias in TTI models is both systematic and model-specific. Beyond documenting disparities, we argue that prompt wording plays a critical role in shaping demographic outcomes. The results underscore the need for bias-aware design, balanced defaults, and user guidance to prevent the reinforcement of occupational stereotypes in generative AI.
Abstract（参考訳）: テキスト・トゥ・イメージ(英語版)(TTI)モデルは、プロ、教育、創造的な文脈でますます使われているが、そのアウトプットはしばしば社会的偏見を埋め込んで増幅する。本稿では,Hunyuan Image 2.1,HiDream-I1-dev,Qwen-Image,FLUX.1-dev,Stable-Diffusion 3.5 Large,Stable-Diffusion-XLの6つの最先端オープンウェイトモデルにおける性別表現について検討する。慎重に設計したプロンプトを用いて,5つの病院関連専門職(心臓科医,病院院長,看護師,救急医,外科医)と5つの肖像画等級(コーポレート,中立,美容,美容,美容)の組み合わせについて,100枚の画像を生成した。全てのモデルで女性や外科医が主に男性として看護婦を生産しています。しかし、Qwen-Image と SDXL は厳格な男性優位を強制し、HiDream-I1-dev は様々な結果を示し、FLUX.1-dev は多くの役割において女性を歪ませる。 HunyuanImage 2.1 と Stable-Diffusion 3.5 Large も性別のステレオタイプを再現するが、様々な感度を持つ。ポートレートの等式はさらに男女のバランスを調節し、企業による男性の描写の強化や、女性の美しい好みの表現といった用語が用いられる。 Qwen-Imageはほとんど影響を受けていないが、FLUX.1-dev、SDXL、SD3.5は強い迅速な依存を示す。これらの結果は、TTIモデルにおける性別バイアスは、体系的およびモデル固有であることを示している。格差の文書化以外にも、人口統計学的結果を形成する上で、早急な言い回しが重要な役割を担っていると論じる。この結果は、バイアス認識設計、バランスの取れたデフォルト、およびAI生成における作業ステレオタイプの強化を防ぐためのユーザガイダンスの必要性を浮き彫りにしている。

論文の概要: Beyond the Prompt: Gender Bias in Text-to-Image Models, with a Case Study on Hospital Professions

関連論文リスト