Fugu-MT 論文翻訳(概要): Are Non-English Papers Reviewed Fairly? Language-of-Study Bias in NLP Peer Reviews

論文の概要: Are Non-English Papers Reviewed Fairly? Language-of-Study Bias in NLP Peer Reviews

arxiv url: http://arxiv.org/abs/2604.07119v1
Date: Wed, 08 Apr 2026 14:14:36 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-09 17:30:51.573118
Title: Are Non-English Papers Reviewed Fairly? Language-of-Study Bias in NLP Peer Reviews
Title（参考訳）: 非英語論文は公正にレビューされるか? : NLP Peer Reviewsにおける言語と学習バイアス
Authors: Ehsan Barkhordar, Abdulfattah Safa, Verena Blaschke, Erika Lombart, Marie-Catherine de Marneffe, Gözde Gül Şahin,
Abstract要約: 言語・オブ・スタディ(LoS)バイアス(Language-of-study)は、レビュアーが研究する言語に基づいて、その科学的メリットではなく、異なる評価を行う傾向である。陰性型と正の型を区別したLoSバイアスを初めて体系的に評価し,人間による注釈付きデータセットLOBSTERを紹介した。
参考スコア（独自算出の注目度）: 6.0093124241390745
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Peer review plays a central role in the NLP publication process, but is susceptible to various biases. Here, we study language-of-study (LoS) bias: the tendency for reviewers to evaluate a paper differently based on the language(s) it studies, rather than its scientific merit. Despite being explicitly flagged in reviewing guidelines, such biases are poorly understood. Prior work treats such comments as part of broader categories of weak or unconstructive reviews without defining them as a distinct form of bias. We present the first systematic characterization of LoS bias, distinguishing negative and positive forms, and introduce the human-annotated dataset LOBSTER (Language-Of-study Bias in ScienTific pEer Review) and a method achieving 87.37 macro F1 for detection. We analyze 15,645 reviews to estimate how negative and positive biases differ with respect to the LoS, and find that non-English papers face substantially higher bias rates than English-only ones, with negative bias consistently outweighing positive bias. Finally, we identify four subcategories of negative bias, and find that demanding unjustified cross-lingual generalization is the most dominant form. We publicly release all resources to support work on fairer reviewing practices in NLP and beyond.
Abstract（参考訳）: ピアレビューはNLP出版プロセスにおいて中心的な役割を果たすが、様々なバイアスに影響を受けやすい。そこで本研究では,研究対象の言語(LoS)バイアスについて検討し,その科学的メリットではなく,研究対象の言語(s)に基づいて,レビュアーが論文を評価する傾向について考察した。ガイドラインに明示的にフラグ付けされているにもかかわらず、そのようなバイアスは理解されていない。それまでの作業では、このようなコメントを、弱い、あるいは建設的でないレビューのより広いカテゴリの一部として扱い、バイアスの明確な形式として定義することはなかった。我々は,LoSバイアスを初めて体系的に評価し,否定型と肯定型を区別し,人間の注釈付きデータセットLOBSTER(Language-Of-study Bias in ScienTific pEer Review)と検出のための87.37マクロF1を実現する方法を紹介する。我々は15,645のレビューを分析し、LoSに対する負のバイアスと正のバイアスの差を推定し、非英語論文が英語のみのものよりもかなり高いバイアス率に直面し、負のバイアスが常に正のバイアスを上回ることを発見した。最後に、負バイアスの4つのサブカテゴリを特定し、不当な言語間一般化を要求することが最も支配的な形式であることを示す。 NLP以降のフェアアレビュープラクティスの開発を支援するため、すべてのリソースを公開しています。

論文の概要: Are Non-English Papers Reviewed Fairly? Language-of-Study Bias in NLP Peer Reviews

関連論文リスト