Fugu-MT 論文翻訳(概要): Identifying and Mitigating Gender Cues in Academic Recommendation Letters: An Interpretability Case Study

論文の概要: Identifying and Mitigating Gender Cues in Academic Recommendation Letters: An Interpretability Case Study

arxiv url: http://arxiv.org/abs/2604.12337v1
Date: Tue, 14 Apr 2026 06:19:35 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-15 19:11:32.279306
Title: Identifying and Mitigating Gender Cues in Academic Recommendation Letters: An Interpretability Case Study
Title（参考訳）: 学術レコメンデーションレターにおけるジェンダーキュウリの同定と誤用 : 解釈可能性に関する事例研究
Authors: Charlotte S. Alexander, Shane Storks, Souradip Pal, Sayak Chakrabarty, Arushi Sharma, Mlen-Too Wesley, Bailey Russo,
Abstract要約: レコメンデーションのレター(LoR)は、下流の決定に影響を及ぼす暗黙的に性別が付けられた言語のパターンを運ぶことができる。本研究では,トランスフォーマーをベースとしたエンコーダモデルが,米国医療保険プログラムに提出された学術LoRの応募者の性別を推測できる範囲について検討する。
参考スコア（独自算出の注目度）: 3.7181065265645814
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Letters of recommendation (LoRs) can carry patterns of implicitly gendered language that can inadvertently influence downstream decisions, e.g. in hiring and admissions. In this work, we investigate the extent to which Transformer-based encoder models as well as Large Language Models (LLMs) can infer the gender of applicants in academic LoRs submitted to an U.S. medical-residency program after explicit identifiers like names and pronouns are de-gendered. While using three models (DistilBERT, RoBERTa, and Llama 2) to classify the gender of anonymized and de-gendered LoRs, significant gender leakage was observed as evident from up to 68% classification accuracy. Text interpretation methods, like TF-IDF and SHAP, demonstrate that certain linguistic patterns are strong proxies for gender, e.g. "emotional'' and "humanitarian'' are commonly associated with LoRs from female applicants. As an experiment in creating truly gender-neutral LoRs, these implicit gender cues were remove resulting in a drop of up to 5.5% accuracy and 2.7% macro $F_1$ score on re-training the classifiers. However, applicant gender prediction still remains better than chance. In this case study, our findings highlight that 1) LoRs contain gender-identifying cues that are hard to remove and may activate bias in decision-making and 2) while our technical framework may be a concrete step toward fairer academic and professional evaluations, future work is needed to interrogate the role that gender plays in LoR review. Taken together, our findings motivate upstream auditing of evaluative text in real-world academic letters of recommendation as a necessary complement to model-level fairness interventions.
Abstract（参考訳）: レコメンデーション(LoR)のレコメンデーション(レコメンデーションのレコメンデーション)は、インタヴューやインタヴューなど、下流の判断に不注意に影響を及ぼす可能性のある、暗黙的に性的な言語パターンを持てる。本研究では,トランスフォーマーをベースとしたエンコーダモデルとLarge Language Models (LLMs) が,名称や代名詞などの明示的な識別子がデジェンダー化された後,米国医学研修プログラムに提出された学術LoRの応募者の性別を推定できる範囲について検討する。使用中(DistilBERT,RoBERTa,Llama) 2) 匿名化・非性別化LoRの性別を分類するために, 68%の分類精度から有意な性別リークが観察された。 TF-IDFやSHAPのようなテキスト解釈法は、特定の言語パターンがジェンダーの強力なプロキシであることを示す。真に性中立なLoRを作成する実験として、これらの暗黙の性別の手がかりは取り除かれ、その結果、最大5.5%の精度と2.7%のマクロ$F_1$スコアが、分類器を再訓練した。しかし、応募者の性別予測は依然として偶然よりも優れている。今回のケーススタディでは,この知見が注目されている。 1)LoRは、削除が困難で意思決定の偏見を活性化し得る性別識別手段を含む。 2)我々の技術枠組みは、より公正な学術的・専門的な評価に向けての具体的な一歩かもしれないが、今後は、LoRレビューにおいてジェンダーが果たす役割を問う必要がある。本研究は, モデルレベルのフェアネス介入を補完する上で必要となる, 実世界の学術論文推薦書における評価テキストの上流監査を動機づけるものである。

論文の概要: Identifying and Mitigating Gender Cues in Academic Recommendation Letters: An Interpretability Case Study

関連論文リスト