Fugu-MT 論文翻訳(概要): A Multi-Domain Feature Fusion Framework for Generalizable Deepfake Detection Across Different Generators

論文の概要: A Multi-Domain Feature Fusion Framework for Generalizable Deepfake Detection Across Different Generators

arxiv url: http://arxiv.org/abs/2606.14230v1
Date: Fri, 12 Jun 2026 08:12:03 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-15 16:00:42.81558
Title: A Multi-Domain Feature Fusion Framework for Generalizable Deepfake Detection Across Different Generators
Title（参考訳）: 異なる発電機間での一般化可能なディープフェイク検出のためのマルチドメイン特徴融合フレームワーク
Authors: Amna Amjid, Sana Qadir, Mehwish Fatima, Raja Khurram Shahzad,
Abstract要約: 本稿では,SGFF-Netと呼ばれるマルチドメインのディープフェイク検出フレームワークを提案する。空間、勾配、DWTに基づく周波数表現を二重残差学習アーキテクチャに統合する。データセット内評価において98.95%の精度を実現し、クロスモデルとクロスパラダイムの両方での性能を向上させる。
参考スコア（独自算出の注目度）: 1.8499314936771558
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Deepfakes are artificially generated images, audio, or videos that threaten privacy, security, and information integrity. Detecting such content is crucial for countering disinformation, as the latest models generate highly realistic content. While spatial- or frequency-based approaches achieve good detection rates on Generative Adversarial Networks (GANs)-based generated deepfakes, they often struggle with recent diffusion model-generated images. In particular, existing approaches rarely exploit complementary multi-domain representations or systematically evaluate cross-generator robustness. To address these challenges, we propose a multi-domain deepfake detection framework called SGFF-Net (Spatial-Gradient-Frequency Fusion Network) that integrates spatial, gradient, and DWT (Discrete Wavelet Transform)-based frequency representations within a dual residual learning architecture. Experimental results show that the SGFF-Net achieves 98.95\% accuracy in intra-dataset evaluation and improves performance in both cross-model (70.46\%) and cross-paradigm (69.94\%) settings. Incorporating multi-source training and data augmentation further enhances robustness, increasing accuracy from 70.46\% to 79.80\% in cross-model evaluation, from 69\% to 78\% in cross-paradigm evaluation, and from 61.50\% to 75.80\% on real-world data. Unlike single-domain detectors, the SGFF-Net learns complementary forensic cues across spatial, gradient, and wavelet-frequency domains, resulting in greater robustness under cross-generator and cross-paradigm evaluation. The results further show that combining multi-domain representations with data diversity and augmentation substantially improves generalization, providing practical insights for developing more reliable deepfake detection systems.
Abstract（参考訳）: Deepfakesは、プライバシ、セキュリティ、情報の整合性を脅かす、人工的に生成された画像、オーディオ、ビデオだ。このようなコンテンツを検出することは、最新のモデルが非常にリアルなコンテンツを生成するため、偽情報に対抗するために不可欠である。空間的あるいは周波数的アプローチは、GAN(Generative Adversarial Networks)ベースの生成したディープフェイクに対して良好な検出率を達成するが、近年の拡散モデル生成画像と競合することが多い。特に、既存のアプローチでは、補完的なマルチドメイン表現を利用することはめったにない。これらの課題に対処するために,空間,勾配,DWT(Discrete Wavelet Transform)に基づく周波数表現を統合したSGFF-Net(Spatial-Gradient-Frequency Fusion Network)と呼ばれるマルチドメインディープフェイク検出フレームワークを提案する。実験結果から,SGFF-Netはデータセット内評価において98.95 %の精度を実現し,クロスモデル (70.46 %) とクロスパラダイム (69.94 %) の両方の性能向上を図っている。マルチソーストレーニングとデータ拡張の併用により、より堅牢性が向上し、クロスモデル評価では70.46\%から79.80\%、クロスパラダイム評価では69\%から78\%、実世界のデータでは61.50\%から75.80\%に精度が向上する。単一ドメイン検出器とは異なり、SGFF-Netは空間、勾配、ウェーブレット周波数領域の相補的な法医学的手がかりを学習し、クロスジェネレータおよびクロスパラダイム評価の下でより堅牢性を高める。さらに,マルチドメイン表現とデータ多様性と拡張性を組み合わせることで,一般化が大幅に向上し,より信頼性の高い深度検出システムを構築するための実用的な知見が得られた。

論文の概要: A Multi-Domain Feature Fusion Framework for Generalizable Deepfake Detection Across Different Generators

関連論文リスト