Exploiting Fine-grained Face Forgery Clues via Progressive Enhancement
Learning
- URL: http://arxiv.org/abs/2112.13977v1
- Date: Tue, 28 Dec 2021 03:18:53 GMT
- Title: Exploiting Fine-grained Face Forgery Clues via Progressive Enhancement
Learning
- Authors: Qiqi Gu, Shen Chen, Taiping Yao, Yang Chen, Shouhong Ding, Ran Yi
- Abstract summary: forgery detection has attracted more and more attention due to security concerns.
Existing approaches attempt to use frequency information to mine subtle artifacts under high-quality forged faces.
We propose a progressive enhancement learning framework to exploit both the RGB and fine-grained frequency clues.
- Score: 12.585152735152937
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: With the rapid development of facial forgery techniques, forgery detection
has attracted more and more attention due to security concerns. Existing
approaches attempt to use frequency information to mine subtle artifacts under
high-quality forged faces. However, the exploitation of frequency information
is coarse-grained, and more importantly, their vanilla learning process
struggles to extract fine-grained forgery traces. To address this issue, we
propose a progressive enhancement learning framework to exploit both the RGB
and fine-grained frequency clues. Specifically, we perform a fine-grained
decomposition of RGB images to completely decouple the real and fake traces in
the frequency space. Subsequently, we propose a progressive enhancement
learning framework based on a two-branch network, combined with
self-enhancement and mutual-enhancement modules. The self-enhancement module
captures the traces in different input spaces based on spatial noise
enhancement and channel attention. The Mutual-enhancement module concurrently
enhances RGB and frequency features by communicating in the shared spatial
dimension. The progressive enhancement process facilitates the learning of
discriminative features with fine-grained face forgery clues. Extensive
experiments on several datasets show that our method outperforms the
state-of-the-art face forgery detection methods.
Related papers
- DA-HFNet: Progressive Fine-Grained Forgery Image Detection and Localization Based on Dual Attention [12.36906630199689]
We construct a DA-HFNet forged image dataset guided by text or image-assisted GAN and Diffusion model.
Our goal is to utilize a hierarchical progressive network to capture forged artifacts at different scales for detection and localization.
arXiv Detail & Related papers (2024-06-03T16:13:33Z) - Unified Frequency-Assisted Transformer Framework for Detecting and
Grounding Multi-Modal Manipulation [109.1912721224697]
We present the Unified Frequency-Assisted transFormer framework, named UFAFormer, to address the DGM4 problem.
By leveraging the discrete wavelet transform, we decompose images into several frequency sub-bands, capturing rich face forgery artifacts.
Our proposed frequency encoder, incorporating intra-band and inter-band self-attentions, explicitly aggregates forgery features within and across diverse sub-bands.
arXiv Detail & Related papers (2023-09-18T11:06:42Z) - Adaptive Frequency Learning in Two-branch Face Forgery Detection [66.91715092251258]
We propose Adaptively learn Frequency information in the two-branch Detection framework, dubbed AFD.
We liberate our network from the fixed frequency transforms, and achieve better performance with our data- and task-dependent transform layers.
arXiv Detail & Related papers (2022-03-27T14:25:52Z) - MC-LCR: Multi-modal contrastive classification by locally correlated
representations for effective face forgery detection [11.124150983521158]
We propose a novel framework named Multi-modal Contrastive Classification by Locally Correlated Representations.
Our MC-LCR aims to amplify implicit local discrepancies between authentic and forged faces from both spatial and frequency domains.
We achieve state-of-the-art performance and demonstrate the robustness and generalization of our method.
arXiv Detail & Related papers (2021-10-07T09:24:12Z) - SSCAN: A Spatial-spectral Cross Attention Network for Hyperspectral
Image Denoising [12.873607414761093]
We propose a novel HSI denoising network, termed SSCAN, that combines group convolutions and attention modules.
We propose a spectral-spatial attention block (SSAB) to exploit the spatial and spectral information in hyperspectral images in an effective manner.
The experimental results indicate that the proposed SSCAN outperforms several state-of-the-art HSI denoising algorithms.
arXiv Detail & Related papers (2021-05-23T14:36:17Z) - Local Relation Learning for Face Forgery Detection [73.73130683091154]
We propose a novel perspective of face forgery detection via local relation learning.
Specifically, we propose a Multi-scale Patch Similarity Module (MPSM), which measures the similarity between features of local regions.
We also propose an RGB-Frequency Attention Module (RFAM) to fuse information in both RGB and frequency domains for more comprehensive local feature representation.
arXiv Detail & Related papers (2021-05-06T10:44:32Z) - Generalizing Face Forgery Detection with High-frequency Features [63.33397573649408]
Current CNN-based detectors tend to overfit to method-specific color textures and thus fail to generalize.
We propose to utilize the high-frequency noises for face forgery detection.
The first is the multi-scale high-frequency feature extraction module that extracts high-frequency noises at multiple scales.
The second is the residual-guided spatial attention module that guides the low-level RGB feature extractor to concentrate more on forgery traces from a new perspective.
arXiv Detail & Related papers (2021-03-23T08:19:21Z) - Spatial-Phase Shallow Learning: Rethinking Face Forgery Detection in
Frequency Domain [88.7339322596758]
We present a novel Spatial-Phase Shallow Learning (SPSL) method, which combines spatial image and phase spectrum to capture the up-sampling artifacts of face forgery.
SPSL can achieve the state-of-the-art performance on cross-datasets evaluation as well as multi-class classification and obtain comparable results on single dataset evaluation.
arXiv Detail & Related papers (2021-03-02T16:45:08Z) - Robust Data Hiding Using Inverse Gradient Attention [82.73143630466629]
In the data hiding task, each pixel of cover images should be treated differently since they have divergent tolerabilities.
We propose a novel deep data hiding scheme with Inverse Gradient Attention (IGA), combing the ideas of adversarial learning and attention mechanism.
Empirically, extensive experiments show that the proposed model outperforms the state-of-the-art methods on two prevalent datasets.
arXiv Detail & Related papers (2020-11-21T19:08:23Z) - Two-Level Attention-based Fusion Learning for RGB-D Face Recognition [21.735238213921804]
A novel attention aware method is proposed to fuse two image modalities, RGB and depth, for enhanced RGB-D facial recognition.
The proposed method first extracts features from both modalities using a convolutional feature extractor.
These features are then fused using a two-layer attention mechanism.
arXiv Detail & Related papers (2020-02-29T03:18:52Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.