Fugu-MT 論文翻訳(概要): More Than Sum of Its Parts: Deciphering Intent Shifts in Multimodal Hate Speech Detection

論文の概要: More Than Sum of Its Parts: Deciphering Intent Shifts in Multimodal Hate Speech Detection

arxiv url: http://arxiv.org/abs/2603.21298v1
Date: Sun, 22 Mar 2026 15:46:23 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-24 19:11:39.340452
Title: More Than Sum of Its Parts: Deciphering Intent Shifts in Multimodal Hate Speech Detection
Title（参考訳）: パートの総和以上:マルチモーダルヘイト音声検出におけるインテントシフトの解読
Authors: Runze Sun, Yu Zheng, Zexuan Xiong, Zhongjin Qu, Lei Chen, Jiwen Lu, Jie Zhou,
Abstract要約: ソーシャルメディア上でのヘイトスピーチは、サイバースペースの確保には不可欠だが、自動検出システムに大きく依存している。我々は、モダリティが相互作用する意味的意図変化を特徴付け、良心的憎悪から暗黙的憎悪を構築するか、意味的逆転を通じて毒性を中和する。本稿では,これらの複雑な手がかりを効果的に解読するために,裁判所エージェントによる非対称推論(Asymmetric Reasoning)フレームワークを提案する。
参考スコア（独自算出の注目度）: 49.07221734365168
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Combating hate speech on social media is critical for securing cyberspace, yet relies heavily on the efficacy of automated detection systems. As content formats evolve, hate speech is transitioning from solely plain text to complex multimodal expressions, making implicit attacks harder to spot. Current systems, however, often falter on these subtle cases, as they struggle with multimodal content where the emergent meaning transcends the aggregation of individual modalities. To bridge this gap, we move beyond binary classification to characterize semantic intent shifts where modalities interact to construct implicit hate from benign cues or neutralize toxicity through semantic inversion. Guided by this fine-grained formulation, we curate the Hate via Vision-Language Interplay (H-VLI) benchmark where the true intent hinges on the intricate interplay of modalities rather than overt visual or textual slurs. To effectively decipher these complex cues, we further propose the Asymmetric Reasoning via Courtroom Agent DEbate (ARCADE) framework. By simulating a judicial process where agents actively argue for accusation and defense, ARCADE forces the model to scrutinize deep semantic cues before reaching a verdict. Extensive experiments demonstrate that ARCADE significantly outperforms state-of-the-art baselines on H-VLI, particularly for challenging implicit cases, while maintaining competitive performance on established benchmarks. Our code and data are available at: https://github.com/Sayur1n/H-VLI
Abstract（参考訳）: ソーシャルメディア上でヘイトスピーチを議論することは、サイバースペースを確保する上で重要であるが、自動検知システムの有効性に大きく依存している。コンテンツフォーマットが進化するにつれて、ヘイトスピーチは単なる平易なテキストから複雑なマルチモーダル表現へと移行し、暗黙の攻撃を見つけるのが難しくなっている。しかしながら、現在のシステムはしばしば、個々のモダリティの集合を超越する創発的意味を持つマルチモーダルコンテンツに苦しむため、これらの微妙なケースに干渉する。このギャップを埋めるために、我々は二項分類を超えて、モダリティが相互作用し、良心からの暗黙の憎悪を構築したり、意味の逆転を通じて毒性を中和する意味意図シフトを特徴づける。この微細な定式化によって導かれたHateは、視覚やテキストのスラーではなく、モダリティの複雑なインタープレイに真の意図が掛かるH-VLI(Vision-Language Interplay)ベンチマークによってキュレートされる。さらに,これらの複雑な手がかりを効果的に解読するために,Courtroom Agent Debate (ARCADE) フレームワークによる非対称推論を提案する。エージェントが非難と防衛を積極的に主張する司法手続きをシミュレートすることで、ARCADEはモデルに判決に達する前に深い意味を精査するよう強制する。大規模な実験により、ARCADEはH-VLIの最先端のベースライン、特に暗黙のケースにおいて、確立したベンチマーク上での競合性能を維持しながら、大幅に性能が向上することが示された。私たちのコードとデータは、https://github.com/Sayur1n/H-VLIで利用可能です。

論文の概要: More Than Sum of Its Parts: Deciphering Intent Shifts in Multimodal Hate Speech Detection

関連論文リスト