Investigating the Impact of Pre-processing and Prediction Aggregation on
the DeepFake Detection Task
- URL: http://arxiv.org/abs/2006.07084v3
- Date: Mon, 19 Oct 2020 10:22:15 GMT
- Title: Investigating the Impact of Pre-processing and Prediction Aggregation on
the DeepFake Detection Task
- Authors: Polychronis Charitidis, Giorgos Kordopatis-Zilos, Symeon Papadopoulos,
Ioannis Kompatsiaris
- Abstract summary: We propose a pre-processing step to improve the training data quality and examine its effect on the performance of DeepFake detection.
We also propose and evaluate the effect of video-level prediction aggregation approaches.
- Score: 20.21594285488186
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Recent advances in content generation technologies (widely known as
DeepFakes) along with the online proliferation of manipulated media content
render the detection of such manipulations a task of increasing importance.
Even though there are many DeepFake detection methods, only a few focus on the
impact of dataset preprocessing and the aggregation of frame-level to
video-level prediction on model performance. In this paper, we propose a
pre-processing step to improve the training data quality and examine its effect
on the performance of DeepFake detection. We also propose and evaluate the
effect of video-level prediction aggregation approaches. Experimental results
show that the proposed pre-processing approach leads to considerable
improvements in the performance of detection models, and the proposed
prediction aggregation scheme further boosts the detection efficiency in cases
where there are multiple faces in a video.
Related papers
- Leveraging Mixture of Experts for Improved Speech Deepfake Detection [53.69740463004446]
Speech deepfakes pose a significant threat to personal security and content authenticity.
We introduce a novel approach for enhancing speech deepfake detection performance using a Mixture of Experts architecture.
arXiv Detail & Related papers (2024-09-24T13:24:03Z) - Data Augmentation via Latent Diffusion for Saliency Prediction [67.88936624546076]
Saliency prediction models are constrained by the limited diversity and quantity of labeled data.
We propose a novel data augmentation method for deep saliency prediction that edits natural images while preserving the complexity and variability of real-world scenes.
arXiv Detail & Related papers (2024-09-11T14:36:24Z) - Towards More General Video-based Deepfake Detection through Facial Feature Guided Adaptation for Foundation Model [15.61920157541529]
We propose a novel Deepfake detection approach by adapting the Foundation Models with rich information encoded inside.
Inspired by the recent advances of parameter efficient fine-tuning, we propose a novel side-network-based decoder.
Our approach exhibits superior effectiveness in identifying unseen Deepfake samples, achieving notable performance improvement.
arXiv Detail & Related papers (2024-04-08T14:58:52Z) - DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception [78.26734070960886]
Current perceptive models heavily depend on resource-intensive datasets.
We introduce perception-aware loss (P.A. loss) through segmentation, improving both quality and controllability.
Our method customizes data augmentation by extracting and utilizing perception-aware attribute (P.A. Attr) during generation.
arXiv Detail & Related papers (2024-03-20T04:58:03Z) - Self-Supervised Graph Transformer for Deepfake Detection [1.8133635752982105]
Deepfake detection methods have shown promising results in recognizing forgeries within a given dataset.
Deepfake detection system must remain impartial to forgery types, appearance, and quality for guaranteed generalizable detection performance.
This study introduces a deepfake detection framework, leveraging a self-supervised pre-training model that delivers exceptional generalization ability.
arXiv Detail & Related papers (2023-07-27T17:22:41Z) - CamoDiffusion: Camouflaged Object Detection via Conditional Diffusion
Models [72.93652777646233]
Camouflaged Object Detection (COD) is a challenging task in computer vision due to the high similarity between camouflaged objects and their surroundings.
We propose a new paradigm that treats COD as a conditional mask-generation task leveraging diffusion models.
Our method, dubbed CamoDiffusion, employs the denoising process of diffusion models to iteratively reduce the noise of the mask.
arXiv Detail & Related papers (2023-05-29T07:49:44Z) - A positive feedback method based on F-measure value for Salient Object
Detection [1.9249287163937976]
This paper proposes a positive feedback method based on F-measure value for salient object detection (SOD)
Our proposed method takes an image to be detected and inputs it into several existing models to obtain their respective prediction maps.
Experimental results on five publicly available datasets show that our proposed positive feedback method outperforms the latest 12 methods in five evaluation metrics for saliency map prediction.
arXiv Detail & Related papers (2023-04-28T04:05:13Z) - Impact of Video Processing Operations in Deepfake Detection [13.334500258498798]
Digital face manipulation in video has attracted extensive attention due to the increased risk to public trust.
Deep learning-based deepfake detection methods have been developed and have shown impressive results.
The performance of these detectors is often evaluated using benchmarks that hardly reflect real-world situations.
arXiv Detail & Related papers (2023-03-30T09:24:17Z) - AntPivot: Livestream Highlight Detection via Hierarchical Attention
Mechanism [64.70568612993416]
We formulate a new task Livestream Highlight Detection, discuss and analyze the difficulties listed above and propose a novel architecture AntPivot to solve this problem.
We construct a fully-annotated dataset AntHighlight to instantiate this task and evaluate the performance of our model.
arXiv Detail & Related papers (2022-06-10T05:58:11Z) - A New Approach to Improve Learning-based Deepfake Detection in Realistic
Conditions [13.334500258498798]
Deep convolutional neural networks have achieved exceptional results on multiple detection and recognition tasks.
The impact of conventional distortions and processing operations found in imaging such as compression, noise, and enhancement are not sufficiently studied.
This paper proposes a more effective data augmentation scheme based on real-world image degradation process.
arXiv Detail & Related papers (2022-03-22T15:16:54Z) - Learning Monocular Dense Depth from Events [53.078665310545745]
Event cameras produce brightness changes in the form of a stream of asynchronous events instead of intensity frames.
Recent learning-based approaches have been applied to event-based data, such as monocular depth prediction.
We propose a recurrent architecture to solve this task and show significant improvement over standard feed-forward methods.
arXiv Detail & Related papers (2020-10-16T12:36:23Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.