Fugu-MT 論文翻訳(概要): Do Gender Cues Affect LLM Value Trade-offs? Evidence from a Controlled Decision Benchmark

論文の概要: Do Gender Cues Affect LLM Value Trade-offs? Evidence from a Controlled Decision Benchmark

arxiv url: http://arxiv.org/abs/2606.02214v1
Date: Mon, 01 Jun 2026 13:14:10 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-02 21:34:32.087769
Title: Do Gender Cues Affect LLM Value Trade-offs? Evidence from a Controlled Decision Benchmark
Title（参考訳）: ジェンダー・キューはLLM価値のトレードオフに影響を及ぼすか? -制御決定ベンチマークによる証拠-
Authors: Yangyang Liu, Dong Yu, Pengyuan Liu,
Abstract要約: シナリオを保持しながらロールジェンダーの設定だけを変えるベンチマークを構築します。明示的なジェンダーの手がかりは、有界だが体系的な意思決定のフリップを引き起こす。ジェンダー効果は、決定的な値境界の近くに集中しており、ジェンダーの手がかりが局所的な境界シフト要因として働くことを示唆している。
参考スコア（独自算出の注目度）: 33.25199452418043
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Large language models are increasingly used in value-sensitive decision settings, where irrelevant demographic cues should not alter judgments. We construct the Realistic Value Decision Benchmark (RVDB), a controlled benchmark that varies only the role-gender configuration while holding the scenario, ordered value pair, roles, candidate decisions, Value Distance, and Decision Severity fixed. Using a position-balanced evaluation across seven models, we test whether models preserve decision invariance under gender perturbations and whether their self-attributions reflect observed behavioral changes. We find that explicit gender cues induce bounded but systematic decision flips, including under an explicit gender-attribution prompt that asks models to report whether gender influenced their choice. Cross-gender role swaps reveal a consistent female-proposed-decision asymmetry, while models often attribute flipped decisions to No Influence or other non-gender factors. Further analysis shows that gender effects concentrate near less determinate value boundaries and under more severe decision contexts, suggesting that gender cues act as local boundary-shifting factors rather than global overrides of value reasoning. Value rankings remain largely stable, but ordered value-pair trade-offs shift unevenly across role-gender configurations. These results show that gender can enter LLM value trade-offs behaviorally while remaining obscured in self-attribution, motivating controlled behavioral audits beyond explanation-based evaluation.
Abstract（参考訳）: 大規模な言語モデルは、無関係な人口統計学的手がかりが判断を変えるべきではないような、価値に敏感な意思決定設定において、ますます使われるようになっている。我々は、シナリオ、順序付けられた値ペア、役割、候補決定、値距離、決定重症度を固定しながら、ロールジェンダー構成だけを変える制御されたベンチマークであるRealistic Value Decision Benchmark(RVDB)を構築した。 7つのモデルにまたがる位置バランス評価を用いて、モデルが性摂動下での意思決定の不分散を保ち、その自己帰属が観察された行動変化を反映するかどうかを検証した。明示的なジェンダー・キューは、性別が選択に影響を及ぼしたかどうかをモデルに報告するよう求める明示的なジェンダー属性・プロンプトを含む、有界だが体系的な意思決定のフリップを誘発する。異性間ロールスワップは、一貫した女性決定非対称性を示す一方、モデルはしばしば、No Influenceや他の非性的要因に反転した決定に帰着する。さらなる分析により、性別効果は、決定的な値境界に近づき、より厳しい決定コンテキスト下に集中していることが示され、男女の手がかりは、価値推論のグローバルなオーバーライドよりも、局所的な境界シフト要因として働くことが示唆された。価値ランキングは依然として安定しているが、順序付けられた価値対価値のトレードオフはロール-ジェンダー構成に不均一に移行している。これらの結果から,性別は自己帰属に不明瞭なままで,かつ,説明に基づく評価以上の行動監査を動機付けることができることがわかった。

論文の概要: Do Gender Cues Affect LLM Value Trade-offs? Evidence from a Controlled Decision Benchmark

関連論文リスト