Generalizing Face Forgery Detection with High-frequency Features
- URL: http://arxiv.org/abs/2103.12376v1
- Date: Tue, 23 Mar 2021 08:19:21 GMT
- Title: Generalizing Face Forgery Detection with High-frequency Features
- Authors: Yuchen Luo, Yong Zhang, Junchi Yan, Wei Liu
- Abstract summary: Current CNN-based detectors tend to overfit to method-specific color textures and thus fail to generalize.
We propose to utilize the high-frequency noises for face forgery detection.
The first is the multi-scale high-frequency feature extraction module that extracts high-frequency noises at multiple scales.
The second is the residual-guided spatial attention module that guides the low-level RGB feature extractor to concentrate more on forgery traces from a new perspective.
- Score: 63.33397573649408
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Current face forgery detection methods achieve high accuracy under the
within-database scenario where training and testing forgeries are synthesized
by the same algorithm. However, few of them gain satisfying performance under
the cross-database scenario where training and testing forgeries are
synthesized by different algorithms. In this paper, we find that current
CNN-based detectors tend to overfit to method-specific color textures and thus
fail to generalize. Observing that image noises remove color textures and
expose discrepancies between authentic and tampered regions, we propose to
utilize the high-frequency noises for face forgery detection. We carefully
devise three functional modules to take full advantage of the high-frequency
features. The first is the multi-scale high-frequency feature extraction module
that extracts high-frequency noises at multiple scales and composes a novel
modality. The second is the residual-guided spatial attention module that
guides the low-level RGB feature extractor to concentrate more on forgery
traces from a new perspective. The last is the cross-modality attention module
that leverages the correlation between the two complementary modalities to
promote feature learning for each other. Comprehensive evaluations on several
benchmark databases corroborate the superior generalization performance of our
proposed method.
Related papers
- Multiple Contexts and Frequencies Aggregation Network forDeepfake Detection [5.65128683992597]
Deepfake detection faces increasing challenges since the fast growth of generative models in developing massive and diverse Deepfake technologies.
Recent advances rely on introducing features from spatial or frequency domains rather than modeling general forgery features within backbones.
We propose an efficient network for face forgery detection named MkfaNet, which consists of two core modules.
arXiv Detail & Related papers (2024-08-03T05:34:53Z) - UniForensics: Face Forgery Detection via General Facial Representation [60.5421627990707]
High-level semantic features are less susceptible to perturbations and not limited to forgery-specific artifacts, thus having stronger generalization.
We introduce UniForensics, a novel deepfake detection framework that leverages a transformer-based video network, with a meta-functional face classification for enriched facial representation.
arXiv Detail & Related papers (2024-07-26T20:51:54Z) - Statistics-aware Audio-visual Deepfake Detector [11.671275975119089]
Methods in audio-visualfake detection mostly assess the synchronization between audio and visual features.
We propose a statistical feature loss to enhance the discrimination capability of the model.
Experiments on the DFDC and FakeAVCeleb datasets demonstrate the relevance of the proposed method.
arXiv Detail & Related papers (2024-07-16T12:15:41Z) - Frequency-Aware Deepfake Detection: Improving Generalizability through
Frequency Space Learning [81.98675881423131]
This research addresses the challenge of developing a universal deepfake detector that can effectively identify unseen deepfake images.
Existing frequency-based paradigms have relied on frequency-level artifacts introduced during the up-sampling in GAN pipelines to detect forgeries.
We introduce a novel frequency-aware approach called FreqNet, centered around frequency domain learning, specifically designed to enhance the generalizability of deepfake detectors.
arXiv Detail & Related papers (2024-03-12T01:28:00Z) - Fully Differentiable Correlation-driven 2D/3D Registration for X-ray to CT Image Fusion [3.868072865207522]
Image-based rigid 2D/3D registration is a critical technique for fluoroscopic guided surgical interventions.
We propose a novel fully differentiable correlation-driven network using a dual-branch CNN-transformer encoder.
A correlation-driven loss is proposed for low-frequency feature and high-frequency feature decomposition based on embedded information.
arXiv Detail & Related papers (2024-02-04T14:12:51Z) - Frequency Perception Network for Camouflaged Object Detection [51.26386921922031]
We propose a novel learnable and separable frequency perception mechanism driven by the semantic hierarchy in the frequency domain.
Our entire network adopts a two-stage model, including a frequency-guided coarse localization stage and a detail-preserving fine localization stage.
Compared with the currently existing models, our proposed method achieves competitive performance in three popular benchmark datasets.
arXiv Detail & Related papers (2023-08-17T11:30:46Z) - ReDFeat: Recoupling Detection and Description for Multimodal Feature
Learning [51.07496081296863]
We recouple independent constraints of detection and description of multimodal feature learning with a mutual weighting strategy.
We propose a detector that possesses a large receptive field and is equipped with learnable non-maximum suppression layers.
We build a benchmark that contains cross visible, infrared, near-infrared and synthetic aperture radar image pairs for evaluating the performance of features in feature matching and image registration tasks.
arXiv Detail & Related papers (2022-05-16T04:24:22Z) - Exploiting Fine-grained Face Forgery Clues via Progressive Enhancement
Learning [12.585152735152937]
forgery detection has attracted more and more attention due to security concerns.
Existing approaches attempt to use frequency information to mine subtle artifacts under high-quality forged faces.
We propose a progressive enhancement learning framework to exploit both the RGB and fine-grained frequency clues.
arXiv Detail & Related papers (2021-12-28T03:18:53Z) - Multi-scale Interactive Network for Salient Object Detection [91.43066633305662]
We propose the aggregate interaction modules to integrate the features from adjacent levels.
To obtain more efficient multi-scale features, the self-interaction modules are embedded in each decoder unit.
Experimental results on five benchmark datasets demonstrate that the proposed method without any post-processing performs favorably against 23 state-of-the-art approaches.
arXiv Detail & Related papers (2020-07-17T15:41:37Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.