Fugu-MT 論文翻訳(概要): IYKYK (But AI Doesn't): Automated Content Moderation Does Not Capture Communities' Heterogeneous Attitudes Towards Reclaimed Language

論文の概要: IYKYK (But AI Doesn't): Automated Content Moderation Does Not Capture Communities' Heterogeneous Attitudes Towards Reclaimed Language

arxiv url: http://arxiv.org/abs/2604.16654v2
Date: Tue, 21 Apr 2026 17:02:41 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-22 14:04:47.799008
Title: IYKYK (But AI Doesn't): Automated Content Moderation Does Not Capture Communities' Heterogeneous Attitudes Towards Reclaimed Language
Title（参考訳）: IYKYK (But AI does't): 自動コンテンツモデレーションはコミュニティの再生言語に対する不均一な態度を捉えない
Authors: Christina Chance, Rebecca Pattichis, Arjun Subramonian, James He, Shruti Narayanan, Saadia Gabriel, Kai-Wei Chang,
Abstract要約: 我々は, LGBTQIA+, Black, and female community around reclaimed slursにおいて, ソーシャルメディア利用者の態度を定量的に, 質的に検討する。グループ内アノテータ間ではかなりの意見の相違が示され、低アノテータ間アノテータ合意が守られた。アノテータの判断とパースペクティブAPIによる自動ヘイトスピーチアセスメントの整合性は低い。
参考スコア（独自算出の注目度）: 45.4201325387611
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Reclaimed slur usage is a common and meaningful practice online for many marginalized communities. It serves as a source of solidarity, identity, and shared experience. However, contemporary automated and AI-based moderation tools for online content largely fail to distinguish between reclaimed and hateful uses of slurs, resulting in the suppression of marginalized voices. In this work, we use quantitative and qualitative methods to examine the attitudes of social media users in LGBTQIA+, Black, and women communities around reclaimed slurs targeting our focus groups including the f-word, n-word, and b-word. With social media users from these communities, we collect and analyze an annotated online slur usage corpus. The corpus includes annotators' perceptions of whether an online text containing a slur should be flagged as hate speech, as well as contextual features of the slur usage. Across all communities and annotation questions, we observe low inter-annotator agreement, indicating substantial disagreement among in-group annotators. This is compounded by the fact that, absent clear contextual signals of identity and intent, even in-group members may disagree on how to interpret reclaimed slur usage online. Semi-structured interviews with annotators suggest that differences in lived experience and personal history contribute to this variation as well. We find poor alignment between annotator judgments and automated hate speech assessments produced by Perspective API. We further observe that certain features of a text such as whether the slur usage was derogatory and if the slur was targeted at oneself are more associated with whether annotators report the text as hate speech. Together, these findings highlight the inherent subjectivity and contextual nature of how marginalized communities interpret slurs online.
Abstract（参考訳）: 干拓スラリーの使用は、多くの地域社会にとって、オンライン上でありふれた意味のある実践である。連帯、アイデンティティ、共有エクスペリエンスの源泉として機能する。しかし、オンラインコンテンツのための現代的自動化およびAIベースのモデレーションツールは、再利用とヘイトフルなスラリーの使用を区別することができない。本研究は,f-word,n-word,b-wordを含む集中型グループを対象として,LGBTQIA+,Black,および女性コミュニティにおけるソーシャルメディア利用者の態度を定量的に定性的に分析する手法である。これらのコミュニティのソーシャルメディア利用者は、注釈付きオンラインスラー利用コーパスを収集、分析する。コーパスには、スラーを含むオンラインテキストがヘイトスピーチとしてフラグ付けされるべきかどうかについての注釈や、スラー使用の文脈的特徴が含まれている。すべてのコミュニティとアノテーションに関する質問に対して,アノテータ間の合意の低さを観察し,グループ内のアノテータ間ではかなりの意見の相違が示唆された。これは、アイデンティティと意図の明確な文脈的なシグナルが欠如しているにもかかわらず、グループ内のメンバーでさえ、オンラインで再利用されたスラリーの使用をどう解釈するかについて意見が一致しないという事実によって複雑化している。半構造化されたアノテーターとのインタビューは、生きた経験と個人の歴史の違いが、この変化に寄与していることを示唆している。アノテータの判断とパースペクティブAPIによる自動ヘイトスピーチアセスメントの整合性は低い。さらに, テキストの特定の特徴として, スラーの使用が軽蔑的であったか, スラーが自分自身を対象としていたか, 注釈者がヘイトスピーチとしてテキストを報告しているかなど, 関連性が高いことが確認された。これらの知見は、疎外化コミュニティがオンラインでどのようにスラリーを解釈するかという点において、固有の主観性と文脈の性質を浮き彫りにしたものである。

論文の概要: IYKYK (But AI Doesn't): Automated Content Moderation Does Not Capture Communities' Heterogeneous Attitudes Towards Reclaimed Language

関連論文リスト