Learning Second Order Local Anomaly for General Face Forgery Detection
- URL: http://arxiv.org/abs/2209.15490v1
- Date: Fri, 30 Sep 2022 14:27:11 GMT
- Title: Learning Second Order Local Anomaly for General Face Forgery Detection
- Authors: Jianwei Fei, Yunshu Dai, Peipeng Yu, Tianrun Shen, Zhihua Xia, Jian
Weng
- Abstract summary: We propose a novel method to improve the generalization ability of CNN-based face forgery detectors.
Specifically, we propose a weakly supervised Second Order Local Anomaly (SOLA) learning module to mine anomalies in local regions.
We also propose a Local Enhancement Module (LEM) to improve the discrimination between local features of real and forged regions.
- Score: 7.6896977871741985
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In this work, we propose a novel method to improve the generalization ability
of CNN-based face forgery detectors. Our method considers the feature anomalies
of forged faces caused by the prevalent blending operations in face forgery
algorithms. Specifically, we propose a weakly supervised Second Order Local
Anomaly (SOLA) learning module to mine anomalies in local regions using deep
feature maps. SOLA first decomposes the neighborhood of local features by
different directions and distances and then calculates the first and second
order local anomaly maps which provide more general forgery traces for the
classifier. We also propose a Local Enhancement Module (LEM) to improve the
discrimination between local features of real and forged regions, so as to
ensure accuracy in calculating anomalies. Besides, an improved Adaptive Spatial
Rich Model (ASRM) is introduced to help mine subtle noise features via
learnable high pass filters. With neither pixel level annotations nor external
synthetic data, our method using a simple ResNet18 backbone achieves
competitive performances compared with state-of-the-art works when evaluated on
unseen forgeries.
Related papers
- Mixture-of-Noises Enhanced Forgery-Aware Predictor for Multi-Face Manipulation Detection and Localization [52.87635234206178]
This paper proposes a new framework, namely MoNFAP, specifically tailored for multi-face manipulation detection and localization.
The framework incorporates two novel modules: the Forgery-aware Unified Predictor (FUP) Module and the Mixture-of-Noises Module (MNM)
arXiv Detail & Related papers (2024-08-05T08:35:59Z) - DETR Doesn't Need Multi-Scale or Locality Design [69.56292005230185]
This paper presents an improved DETR detector that maintains a "plain" nature.
It uses a single-scale feature map and global cross-attention calculations without specific locality constraints.
We show that two simple technologies are surprisingly effective within a plain design to compensate for the lack of multi-scale feature maps and locality constraints.
arXiv Detail & Related papers (2023-08-03T17:59:04Z) - Region-Based Semantic Factorization in GANs [67.90498535507106]
We present a highly efficient algorithm to factorize the latent semantics learned by Generative Adversarial Networks (GANs) concerning an arbitrary image region.
Through an appropriately defined generalized Rayleigh quotient, we solve such a problem without any annotations or training.
Experimental results on various state-of-the-art GAN models demonstrate the effectiveness of our approach.
arXiv Detail & Related papers (2022-02-19T17:46:02Z) - Denoised Non-Local Neural Network for Semantic Segmentation [18.84185406522064]
We propose a Denoised Non-Local Network (Denoised NL) to eliminate the inter-class and intra-class noises respectively.
Our proposed NL can achieve the state-of-the-art performance of 83.5% and 46.69% mIoU on Cityscapes and ADE20K, respectively.
arXiv Detail & Related papers (2021-10-27T06:16:31Z) - Local Relation Learning for Face Forgery Detection [73.73130683091154]
We propose a novel perspective of face forgery detection via local relation learning.
Specifically, we propose a Multi-scale Patch Similarity Module (MPSM), which measures the similarity between features of local regions.
We also propose an RGB-Frequency Attention Module (RFAM) to fuse information in both RGB and frequency domains for more comprehensive local feature representation.
arXiv Detail & Related papers (2021-05-06T10:44:32Z) - Generalizing Face Forgery Detection with High-frequency Features [63.33397573649408]
Current CNN-based detectors tend to overfit to method-specific color textures and thus fail to generalize.
We propose to utilize the high-frequency noises for face forgery detection.
The first is the multi-scale high-frequency feature extraction module that extracts high-frequency noises at multiple scales.
The second is the residual-guided spatial attention module that guides the low-level RGB feature extractor to concentrate more on forgery traces from a new perspective.
arXiv Detail & Related papers (2021-03-23T08:19:21Z) - Making Affine Correspondences Work in Camera Geometry Computation [62.7633180470428]
Local features provide region-to-region rather than point-to-point correspondences.
We propose guidelines for effective use of region-to-region matches in the course of a full model estimation pipeline.
Experiments show that affine solvers can achieve accuracy comparable to point-based solvers at faster run-times.
arXiv Detail & Related papers (2020-07-20T12:07:48Z) - ASLFeat: Learning Local Features of Accurate Shape and Localization [42.70030492742363]
We present ASLFeat, with three light-weight yet effective modifications to mitigate above issues.
First, we resort to deformable convolutional networks to densely estimate and apply local transformation.
Second, we take advantage of the inherent feature hierarchy to restore spatial resolution and low-level details for accurate keypoint localization.
arXiv Detail & Related papers (2020-03-23T04:03:03Z) - Deep Fusion of Local and Non-Local Features for Precision Landslide
Recognition [17.896249114628336]
This paper proposes an effective approach to fuse both local and non-local features to surmount the contextual problem.
Built upon the U-Net architecture that is widely adopted in the remote sensing community, we utilize two additional modules.
Experimental evaluations revealed that the proposed method outperformed state-of-the-art general-purpose semantic segmentation approaches.
arXiv Detail & Related papers (2020-02-20T03:18:59Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.