Fugu-MT 論文翻訳(概要): OpenFake: An Open Dataset and Platform Toward Real-World Deepfake Detection

論文の概要: OpenFake: An Open Dataset and Platform Toward Real-World Deepfake Detection

arxiv url: http://arxiv.org/abs/2509.09495v2
Date: Mon, 06 Oct 2025 21:24:19 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-08 15:38:21.472742
Title: OpenFake: An Open Dataset and Platform Toward Real-World Deepfake Detection
Title（参考訳）: OpenFake: リアルタイムディープフェイク検出に向けたオープンデータセットとプラットフォーム
Authors: Victor Livernoche, Akshatha Arodi, Andreea Musulan, Zachary Yang, Adam Salvail, Gaétan Marceau Caron, Jean-François Godbout, Reihaneh Rabbany,
Abstract要約: 高度なAI技術を使った合成メディアであるDeepfakesは、情報完全性に対する脅威が増大している。提案するOpenFakeは,高リアリズムを持つ現代的な生成モデルに対してベンチマークを行うためのデータセットである。 OpenFakeでトレーニングされた検出器は、ほぼ完全な分散性能、目に見えないジェネレータへの強力な一般化、および編集済みのソーシャルメディアテストセットでの高精度を実現する。
参考スコア（独自算出の注目度）: 6.949215801100937
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Deepfakes, synthetic media created using advanced AI techniques, pose a growing threat to information integrity, particularly in politically sensitive contexts. This challenge is amplified by the increasing realism of modern generative models, which our human perception study confirms are often indistinguishable from real images. Yet, existing deepfake detection benchmarks rely on outdated generators or narrowly scoped datasets (e.g., single-face imagery), limiting their utility for real-world detection. To address these gaps, we present OpenFake, a large politically grounded dataset specifically crafted for benchmarking against modern generative models with high realism, and designed to remain extensible through an innovative crowdsourced adversarial platform that continually integrates new hard examples. OpenFake comprises nearly four million total images: three million real images paired with descriptive captions and almost one million synthetic counterparts from state-of-the-art proprietary and open-source models. Detectors trained on OpenFake achieve near-perfect in-distribution performance, strong generalization to unseen generators, and high accuracy on a curated in-the-wild social media test set, significantly outperforming models trained on existing datasets. Overall, we demonstrate that with high-quality and continually updated benchmarks, automatic deepfake detection is both feasible and effective in real-world settings.
Abstract（参考訳）: 高度なAI技術を使った合成メディアであるDeepfakesは、特に政治的に敏感な状況において、情報の完全性に対する脅威が増大している。この課題は、現代の生成モデルにおける現実主義の増大によって増幅され、人間の知覚研究は、実際の画像と区別できないことがしばしば確認されている。しかし、既存のディープフェイク検出ベンチマークは、時代遅れのジェネレータや狭い範囲のデータセット(例えば、シングルフェイスイメージ)に依存しており、実世界の検出に使用を制限している。これらのギャップに対処するため、我々は、高いリアリズムを持つ近代的な生成モデルに対してベンチマークを行うために特別に作られた、政治的基盤を持つ大規模なデータセットであるOpenFakeを紹介し、新しいハードケースを継続的に統合する革新的なクラウドソースの敵プラットフォームを通じて拡張性を維持するように設計されている。 OpenFakeは400万近い画像で構成されており、300万枚の実画像と説明文のキャプションが組み合わされ、最先端のプロプライエタリなオープンソースモデルから100万近い合成画像が合成されている。 OpenFakeでトレーニングされた検出器は、ほぼ完璧な分散性能、目に見えないジェネレータへの強力な一般化、既存のデータセットでトレーニングされたモデルよりもはるかに高い精度を実現している。全体として、高品質で継続的に更新されたベンチマークでは、自動ディープフェイク検出が現実の環境で実現可能であり、効果的であることを示す。

論文の概要: OpenFake: An Open Dataset and Platform Toward Real-World Deepfake Detection

関連論文リスト