Fugu-MT 論文翻訳(概要): Harsher on Male? Evaluating LLMs on Gender-Asymmetric Moral Framing Across Diverse Conflict Scenarios

論文の概要: Harsher on Male? Evaluating LLMs on Gender-Asymmetric Moral Framing Across Diverse Conflict Scenarios

arxiv url: http://arxiv.org/abs/2606.14068v1
Date: Fri, 12 Jun 2026 03:29:34 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-15 16:00:42.730247
Title: Harsher on Male? Evaluating LLMs on Gender-Asymmetric Moral Framing Across Diverse Conflict Scenarios
Title（参考訳）: Harsher on Male? : LLMs on Gender-Asymmetric Moral Framing Across Diverse Confenarios
Authors: Guangzong Si, Dong Wang, Zhenhao Li, Yifan Yu, Panwang Pan, Wentao Zhu,
Abstract要約: GAMA-Benchは、親密な関係と社会的対立を対象とする1,298のシナリオの性別ミラー付きベンチマークである。コントロールグリッドとクロスモデルレビューを通じて性中立の誤ったテンプレートを構築し、マッチしたアクター-ジェンダーとロール-参照のバリエーションでペア化された1対のプロンプトにコンパイルする。男性俳優は、より刺激的で、エスカレーション的で、非難中心のフレーミングを受け、女性俳優は、同じ不正行為に対してより治療的で共感指向のフレーミングを受ける。
参考スコア（独自算出の注目度）: 14.290719241304012
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Existing studies on gender bias in LLMs have largely focused on stereotypes, occupational associations, or explicit harmful outputs. In this work, we ask whether LLMs apply consistent response standards to the same negative behavior under matched male-actor and female-actor conditions. We introduce GAMA-Bench, a gender-mirrored benchmark of 1,298 scenarios covering intimate relationship and public social conflicts. It constructs gender-neutral misconduct templates through controlled grids and cross-model review, then compiles them into paired first-person prompts with matched actor-gender and role-reference variations. We further design a structured response-framing protocol to measure how models allocate punishment, empathy, escalation, instruction, and blame. Experiments on 10 representative LLMs reveal a consistent male-disadvantaging asymmetry: male actors receive more punitive, escalatory, and blame-centered framing, whereas female actors receive more therapeutic and empathy-oriented framing for the same misconduct. Further analyses show that this pattern persists across model families, scenario tracks, model scale, and explicit thinking-style reasoning. The official code is available at https://github.com/xufeiqiong/GAMA-Bench.
Abstract（参考訳）: LLMにおけるジェンダーバイアスに関する既存の研究は、主にステレオタイプ、職業的関連、または明らかな有害なアウトプットに焦点を当てている。本研究では,LLMが一致した男性・女性・俳優条件下で同じ負の挙動に一貫した応答標準を適用しているかどうかを問う。 GAMA-Benchは、親密な関係と社会的対立を対象とする1,298のシナリオの性別ミラー付きベンチマークである。コントロールグリッドとクロスモデルレビューを通じて性中立の誤ったテンプレートを構築し、マッチしたアクター-ジェンダーとロール-参照のバリエーションでペア化された1対のプロンプトにコンパイルする。さらに、モデルがどのように罰、共感、エスカレーション、命令、非難を割り当てるかを測定するために、構造化された応答フレーミングプロトコルを設計する。男性俳優は、より刺激的で、エスカレーション的で、非難中心のフレーミングを受け、女性俳優は、同じ不正行為に対してより治療的で共感指向のフレーミングを受ける。さらに分析したところ、このパターンはモデルファミリー、シナリオトラック、モデルスケール、明示的な思考スタイルの推論にまたがって持続することが示された。公式コードはhttps://github.com/xufeiqiong/GAMA-Bench.comで公開されている。

論文の概要: Harsher on Male? Evaluating LLMs on Gender-Asymmetric Moral Framing Across Diverse Conflict Scenarios

関連論文リスト