Learning Pairwise Interaction for Generalizable DeepFake Detection
- URL: http://arxiv.org/abs/2302.13288v1
- Date: Sun, 26 Feb 2023 10:39:08 GMT
- Title: Learning Pairwise Interaction for Generalizable DeepFake Detection
- Authors: Ying Xu, Kiran Raja, Luisa Verdoliva, Marius Pedersen
- Abstract summary: A fast-paced development of DeepFake generation techniques challenge the detection schemes designed for known type DeepFakes.
We propose a new approach, Multi-Channel Xception Attention Pairwise Interaction (MCX-API), that exploits the power of pairwise learning and complementary information from different color space representations.
Our experiments indicate that our proposed method can generalize better than the state-of-the-art Deepfakes detectors.
- Score: 20.723277551489186
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: A fast-paced development of DeepFake generation techniques challenge the
detection schemes designed for known type DeepFakes. A reliable Deepfake
detection approach must be agnostic to generation types, which can present
diverse quality and appearance. Limited generalizability across different
generation schemes will restrict the wide-scale deployment of detectors if they
fail to handle unseen attacks in an open set scenario. We propose a new
approach, Multi-Channel Xception Attention Pairwise Interaction (MCX-API), that
exploits the power of pairwise learning and complementary information from
different color space representations in a fine-grained manner. We first
validate our idea on a publicly available dataset in a intra-class setting
(closed set) with four different Deepfake schemes. Further, we report all the
results using balanced-open-set-classification (BOSC) accuracy in an
inter-class setting (open-set) using three public datasets. Our experiments
indicate that our proposed method can generalize better than the
state-of-the-art Deepfakes detectors. We obtain 98.48% BOSC accuracy on the
FF++ dataset and 90.87% BOSC accuracy on the CelebDF dataset suggesting a
promising direction for generalization of DeepFake detection. We further
utilize t-SNE and attention maps to interpret and visualize the decision-making
process of our proposed network. https://github.com/xuyingzhongguo/MCX-API
Related papers
- Leveraging Mixture of Experts for Improved Speech Deepfake Detection [53.69740463004446]
Speech deepfakes pose a significant threat to personal security and content authenticity.
We introduce a novel approach for enhancing speech deepfake detection performance using a Mixture of Experts architecture.
arXiv Detail & Related papers (2024-09-24T13:24:03Z) - DF40: Toward Next-Generation Deepfake Detection [62.073997142001424]
existing works identify top-notch detection algorithms and models by adhering to the common practice: training detectors on one specific dataset and testing them on other prevalent deepfake datasets.
But can these stand-out "winners" be truly applied to tackle the myriad of realistic and diverse deepfakes lurking in the real world?
We construct a highly diverse deepfake detection dataset called DF40, which comprises 40 distinct deepfake techniques.
arXiv Detail & Related papers (2024-06-19T12:35:02Z) - Explicit Correlation Learning for Generalizable Cross-Modal Deepfake Detection [33.20064862916194]
This paper aims to learn potential cross-modal correlation to enhance deepfake detection towards various generation scenarios.
Our approach introduces a correlation distillation task, which models the inherent cross-modal correlation based on content information.
We present the Cross-Modal Deepfake dataset with four generation methods to evaluate the detection of diverse cross-modal deepfakes.
arXiv Detail & Related papers (2024-04-30T00:25:44Z) - CrossDF: Improving Cross-Domain Deepfake Detection with Deep Information Decomposition [53.860796916196634]
We propose a Deep Information Decomposition (DID) framework to enhance the performance of Cross-dataset Deepfake Detection (CrossDF)
Unlike most existing deepfake detection methods, our framework prioritizes high-level semantic features over specific visual artifacts.
It adaptively decomposes facial features into deepfake-related and irrelevant information, only using the intrinsic deepfake-related information for real/fake discrimination.
arXiv Detail & Related papers (2023-09-30T12:30:25Z) - Towards General Visual-Linguistic Face Forgery Detection [95.73987327101143]
Deepfakes are realistic face manipulations that can pose serious threats to security, privacy, and trust.
Existing methods mostly treat this task as binary classification, which uses digital labels or mask signals to train the detection model.
We propose a novel paradigm named Visual-Linguistic Face Forgery Detection(VLFFD), which uses fine-grained sentence-level prompts as the annotation.
arXiv Detail & Related papers (2023-07-31T10:22:33Z) - Prompt-driven efficient Open-set Semi-supervised Learning [52.30303262499391]
Open-set semi-supervised learning (OSSL) has attracted growing interest, which investigates a more practical scenario where out-of-distribution (OOD) samples are only contained in unlabeled data.
We propose a prompt-driven efficient OSSL framework, called OpenPrompt, which can propagate class information from labeled to unlabeled data with only a small number of trainable parameters.
arXiv Detail & Related papers (2022-09-28T16:25:08Z) - MD-CSDNetwork: Multi-Domain Cross Stitched Network for Deepfake
Detection [80.83725644958633]
Current deepfake generation methods leave discriminative artifacts in the frequency spectrum of fake images and videos.
We present a novel approach, termed as MD-CSDNetwork, for combining the features in the spatial and frequency domains to mine a shared discriminative representation.
arXiv Detail & Related papers (2021-09-15T14:11:53Z) - TAR: Generalized Forensic Framework to Detect Deepfakes using Weakly
Supervised Learning [17.40885531847159]
Deepfakes have become a critical social problem, and detecting them is of utmost importance.
In this work, we introduce a practical digital forensic tool to detect different types of deepfakes simultaneously.
We develop an autoencoder-based detection model with Residual blocks and sequentially perform transfer learning to detect different types of deepfakes simultaneously.
arXiv Detail & Related papers (2021-05-13T07:31:08Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.