Fugu-MT 論文翻訳(概要): Omni-Fake: Benchmarking Unified Multimodal Social Media Deepfake Detection

論文の概要: Omni-Fake: Benchmarking Unified Multimodal Social Media Deepfake Detection

arxiv url: http://arxiv.org/abs/2605.01638v1
Date: Sat, 02 May 2026 22:56:17 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-05 20:33:49.861677
Title: Omni-Fake: Benchmarking Unified Multimodal Social Media Deepfake Detection
Title（参考訳）: Omni-Fake: マルチモーダルなソーシャルメディアディープフェイク検出のベンチマーク
Authors: Tianxiao Li, Zhenglin Huang, Haiquan Wen, Yiwei He, Xinze Li, Bingyu Zhu, Wuhui Duan, Congang Chen, Zeyu Fu, Yi Dong, Baoyuan Wu, Jason Li, Guangliang Cheng,
Abstract要約: ソーシャルメディアでは、マルチモーダルなディープフェイクが急増している。既存のベンチマークは、単一のモダリティの範囲、単純化された操作、非現実的な分布によって制約される。ソーシャルメディア設定における総合的なマルチモーダルディープフェイク検出のためのオムニ・フェイク(Omni-Fake)を提案する。
参考スコア（独自算出の注目度）: 42.81849035175321
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Multimodal deepfakes are proliferating on social media and threaten authenticity, information integrity, and digital forensics. Existing benchmarks are constrained by their single-modality scope, simplified manipulations, or unrealistic distributions, which limit their ability to assess real-world robustness. To address these limitations, we present Omni-Fake, a unified omni-dataset for comprehensive multimodal deepfake detection in social-media settings. It comprises Omni-Fake-Set, a large-scale, high-quality dataset with 1M+ samples, and Omni-Fake-OOD, an out-of-distribution benchmark with 200k+ samples intentionally excluded from training to evaluate generalization. Omni-Fake spans four modalities (image, audio, video, and audio-video talking head) and supports a joint detection-localization-explanation protocol. On top of Omni-Fake, we further propose Omni-Fake-R1, a reinforcement-learning-driven multimodal detector that adaptively integrates visual and auditory cues and outputs structured decisions, localization, and natural-language explanations. Extensive experiments show significant gains in detection accuracy, cross-modal generalization, and explainability over state-of-the-art baselines. Project page: https://tianxiao1201.github.io/omni-fake-project-page/
Abstract（参考訳）: マルチモーダルディープフェイクはソーシャルメディア上で増加しており、真正性、情報の完全性、およびデジタル法医学を脅かしている。既存のベンチマークは、単一のモダリティの範囲、単純化された操作、非現実的な分布によって制約され、現実世界の堅牢性を評価する能力を制限する。これらの制約に対処するために、ソーシャルメディア設定における総合的なマルチモーダルディープフェイク検出のためのオムニ・フェイク(Omni-Fake)を統一したオムニ・フェイク(Omni-Fake)を提案する。 1M以上のサンプルを持つ大規模で高品質なデータセットであるOmni-Fake-Setと、200k以上のサンプルを意図的にトレーニングから除外して一般化を評価するアウト・オブ・ディストリビューションのベンチマークであるOmni-Fake-OODで構成されている。 Omni-Fakeは4つのモード(画像、音声、ビデオ、音声-ビデオ)にまたがり、共同検出-局所化-説明プロトコルをサポートする。さらに,Omni-Fake-R1を提案する。Omni-Fake-R1は,視覚的および聴覚的手がかりを適応的に統合し,構造化決定,局所化,自然言語的説明を出力する強化学習駆動型マルチモーダル検出器である。大規模な実験は、検出精度、クロスモーダル一般化、および最先端のベースラインに対する説明可能性において有意な向上を示した。プロジェクトページ:https://tianxiao1201.github.io/omni-fake-project-page/

論文の概要: Omni-Fake: Benchmarking Unified Multimodal Social Media Deepfake Detection

関連論文リスト