Fugu-MT 論文翻訳(概要): Who Gets Flagged? The Pluralistic Evaluation Gap in AI Content Watermarking

論文の概要: Who Gets Flagged? The Pluralistic Evaluation Gap in AI Content Watermarking

arxiv url: http://arxiv.org/abs/2604.13776v1
Date: Wed, 15 Apr 2026 12:06:56 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-16 20:38:32.52086
Title: Who Gets Flagged? The Pluralistic Evaluation Gap in AI Content Watermarking
Title（参考訳）: 誰がフラッグアップされるのか? AIコンテンツのウォーターマーキングにおける複数の評価ギャップ
Authors: Alexander Nemecek, Osama Zafar, Yuqiao Xu, Wenbiao Li, Erman Ayday,
Abstract要約: コンテンツ依存が偏見へのモダリティ固有の経路をどのように生み出すかを検討する。モダリティ全体にわたる主要な透かしベンチマークをレビューすると、ある例外を除いて、言語、文化コンテンツタイプ、あるいは人口グループ間でのパフォーマンスは報告されていないことが分かりました。電子透かしは、それが支配しようとする生成システムよりも低い公正度基準で保持されていることを示す。
参考スコア（独自算出の注目度）: 39.76203258538667
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Watermarking is becoming the default mechanism for AI content authentication, with governance policies and frameworks referencing it as infrastructure for content provenance. Yet across text, image, and audio modalities, watermark signal strength, detectability, and robustness depend on statistical properties of the content itself, properties that vary systematically across languages, cultural visual traditions, and demographic groups. We examine how this content dependence creates modality-specific pathways to bias. Reviewing the major watermarking benchmarks across modalities, we find that, with one exception, none report performance across languages, cultural content types, or population groups. To address this, we propose three concrete evaluation dimensions for pluralistic watermark benchmarking: cross-lingual detection parity, culturally diverse content coverage, and demographic disaggregation of detection metrics. We connect these to the governance frameworks currently mandating watermarking deployment and show that watermarking is held to a lower fairness standard than the generative systems it is meant to govern. Our position is that evaluation must precede deployment, and that the same bias auditing requirements applied to AI models should extend to the verification layer.
Abstract（参考訳）: ウォーターマーキングは、AIコンテンツ認証のデフォルトメカニズムになりつつある。しかし、テキスト、画像、オーディオのモダリティ、透かし信号の強さ、検出可能性、堅牢性は、コンテンツ自体の統計的特性、言語、文化的な視覚的伝統、人口統計学的グループによって異なる。この内容依存が偏見へのモダリティ固有の経路をいかに生み出すかを検討する。モダリティ全体にわたる主要な透かしベンチマークをレビューすると、ある例外を除いて、言語、文化コンテンツタイプ、あるいは人口グループ間でのパフォーマンスは報告されていないことが分かりました。そこで本研究では,多言語型透かしベンチマークのための3つの具体的評価次元として,言語横断的検出パリティ,文化的に多様なコンテンツカバレッジ,および検出指標の人口集団化について提案する。私たちはこれらを、現在ウォーターマーキングデプロイメントを義務付けているガバナンスフレームワークに接続し、ウォーターマーキングが、それが管理しようとするジェネレーティブシステムよりも低い公平性標準に保持されていることを示す。我々の立場では、評価はデプロイメントに先立ち、AIモデルに適用されるのと同じバイアス監査要件が検証層に拡張されるべきである。

論文の概要: Who Gets Flagged? The Pluralistic Evaluation Gap in AI Content Watermarking

関連論文リスト