Spatial-Phase Shallow Learning: Rethinking Face Forgery Detection in
Frequency Domain
- URL: http://arxiv.org/abs/2103.01856v1
- Date: Tue, 2 Mar 2021 16:45:08 GMT
- Title: Spatial-Phase Shallow Learning: Rethinking Face Forgery Detection in
Frequency Domain
- Authors: Honggu Liu, Xiaodan Li, Wenbo Zhou, Yuefeng Chen, Yuan He, Hui Xue,
Weiming Zhang and Nenghai Yu
- Abstract summary: We present a novel Spatial-Phase Shallow Learning (SPSL) method, which combines spatial image and phase spectrum to capture the up-sampling artifacts of face forgery.
SPSL can achieve the state-of-the-art performance on cross-datasets evaluation as well as multi-class classification and obtain comparable results on single dataset evaluation.
- Score: 88.7339322596758
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The remarkable success in face forgery techniques has received considerable
attention in computer vision due to security concerns. We observe that
up-sampling is a necessary step of most face forgery techniques, and cumulative
up-sampling will result in obvious changes in the frequency domain, especially
in the phase spectrum. According to the property of natural images, the phase
spectrum preserves abundant frequency components that provide extra information
and complement the loss of the amplitude spectrum. To this end, we present a
novel Spatial-Phase Shallow Learning (SPSL) method, which combines spatial
image and phase spectrum to capture the up-sampling artifacts of face forgery
to improve the transferability, for face forgery detection. And we also
theoretically analyze the validity of utilizing the phase spectrum. Moreover,
we notice that local texture information is more crucial than high-level
semantic information for the face forgery detection task. So we reduce the
receptive fields by shallowing the network to suppress high-level features and
focus on the local region. Extensive experiments show that SPSL can achieve the
state-of-the-art performance on cross-datasets evaluation as well as
multi-class classification and obtain comparable results on single dataset
evaluation.
Related papers
- Wavelet-based Bi-dimensional Aggregation Network for SAR Image Change Detection [53.842568573251214]
Experimental results on three SAR datasets demonstrate that our WBANet significantly outperforms contemporary state-of-the-art methods.
Our WBANet achieves 98.33%, 96.65%, and 96.62% of percentage of correct classification (PCC) on the respective datasets.
arXiv Detail & Related papers (2024-07-18T04:36:10Z) - Frequency-Aware Deepfake Detection: Improving Generalizability through
Frequency Space Learning [81.98675881423131]
This research addresses the challenge of developing a universal deepfake detector that can effectively identify unseen deepfake images.
Existing frequency-based paradigms have relied on frequency-level artifacts introduced during the up-sampling in GAN pipelines to detect forgeries.
We introduce a novel frequency-aware approach called FreqNet, centered around frequency domain learning, specifically designed to enhance the generalizability of deepfake detectors.
arXiv Detail & Related papers (2024-03-12T01:28:00Z) - Diffusion Facial Forgery Detection [56.69763252655695]
This paper introduces DiFF, a comprehensive dataset dedicated to face-focused diffusion-generated images.
We conduct extensive experiments on the DiFF dataset via a human test and several representative forgery detection methods.
The results demonstrate that the binary detection accuracy of both human observers and automated detectors often falls below 30%.
arXiv Detail & Related papers (2024-01-29T03:20:19Z) - FS-BAND: A Frequency-Sensitive Banding Detector [55.59101150019851]
Banding artifact, as known as staircase-like contour, is a common quality annoyance that happens in compression, transmission, etc.
We propose a no-reference banding detection model to capture and evaluate banding artifacts, called the Frequency-Sensitive BANding Detector (FS-BAND)
Experimental results show that the proposed FS-BAND method outperforms state-of-the-art image quality assessment (IQA) approaches with higher accuracy in banding classification task.
arXiv Detail & Related papers (2023-11-30T03:20:42Z) - A Survey of Graph and Attention Based Hyperspectral Image Classification
Methods for Remote Sensing Data [5.1901440366375855]
The use of Deep Learning techniques for classification in Hyperspectral Imaging (HSI) is rapidly growing.
Recent methods have also explored the usage of Graph Convolution Networks and their unique ability to use node features in prediction.
arXiv Detail & Related papers (2023-10-16T00:42:25Z) - Attention Consistency Refined Masked Frequency Forgery Representation
for Generalizing Face Forgery Detection [96.539862328788]
Existing forgery detection methods suffer from unsatisfactory generalization ability to determine the authenticity in the unseen domain.
We propose a novel Attention Consistency Refined masked frequency forgery representation model toward generalizing face forgery detection algorithm (ACMF)
Experiment results on several public face forgery datasets demonstrate the superior performance of the proposed method compared with the state-of-the-art methods.
arXiv Detail & Related papers (2023-07-21T08:58:49Z) - MC-LCR: Multi-modal contrastive classification by locally correlated
representations for effective face forgery detection [11.124150983521158]
We propose a novel framework named Multi-modal Contrastive Classification by Locally Correlated Representations.
Our MC-LCR aims to amplify implicit local discrepancies between authentic and forged faces from both spatial and frequency domains.
We achieve state-of-the-art performance and demonstrate the robustness and generalization of our method.
arXiv Detail & Related papers (2021-10-07T09:24:12Z) - Amplitude-Phase Recombination: Rethinking Robustness of Convolutional
Neural Networks in Frequency Domain [31.182376196295365]
CNN tends to converge at the local optimum which is closely related to the high-frequency components of the training images.
A new perspective on data augmentation designed by re-combing the phase spectrum of the current image and the amplitude spectrum of the distracter image.
arXiv Detail & Related papers (2021-08-19T04:04:41Z) - SSCAN: A Spatial-spectral Cross Attention Network for Hyperspectral
Image Denoising [12.873607414761093]
We propose a novel HSI denoising network, termed SSCAN, that combines group convolutions and attention modules.
We propose a spectral-spatial attention block (SSAB) to exploit the spatial and spectral information in hyperspectral images in an effective manner.
The experimental results indicate that the proposed SSCAN outperforms several state-of-the-art HSI denoising algorithms.
arXiv Detail & Related papers (2021-05-23T14:36:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.