Investigating the Impact of Pre-processing and Prediction Aggregation on
the DeepFake Detection Task
- URL: http://arxiv.org/abs/2006.07084v3
- Date: Mon, 19 Oct 2020 10:22:15 GMT
- Title: Investigating the Impact of Pre-processing and Prediction Aggregation on
the DeepFake Detection Task
- Authors: Polychronis Charitidis, Giorgos Kordopatis-Zilos, Symeon Papadopoulos,
Ioannis Kompatsiaris
- Abstract summary: We propose a pre-processing step to improve the training data quality and examine its effect on the performance of DeepFake detection.
We also propose and evaluate the effect of video-level prediction aggregation approaches.
- Score: 20.21594285488186
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Recent advances in content generation technologies (widely known as
DeepFakes) along with the online proliferation of manipulated media content
render the detection of such manipulations a task of increasing importance.
Even though there are many DeepFake detection methods, only a few focus on the
impact of dataset preprocessing and the aggregation of frame-level to
video-level prediction on model performance. In this paper, we propose a
pre-processing step to improve the training data quality and examine its effect
on the performance of DeepFake detection. We also propose and evaluate the
effect of video-level prediction aggregation approaches. Experimental results
show that the proposed pre-processing approach leads to considerable
improvements in the performance of detection models, and the proposed
prediction aggregation scheme further boosts the detection efficiency in cases
where there are multiple faces in a video.
Related papers
- Leveraging Mixture of Experts for Improved Speech Deepfake Detection [53.69740463004446]
Speech deepfakes pose a significant threat to personal security and content authenticity.
We introduce a novel approach for enhancing speech deepfake detection performance using a Mixture of Experts architecture.
arXiv Detail & Related papers (2024-09-24T13:24:03Z) - Data Augmentation via Latent Diffusion for Saliency Prediction [67.88936624546076]
Saliency prediction models are constrained by the limited diversity and quantity of labeled data.
We propose a novel data augmentation method for deep saliency prediction that edits natural images while preserving the complexity and variability of real-world scenes.
arXiv Detail & Related papers (2024-09-11T14:36:24Z) - Standing on the Shoulders of Giants: Reprogramming Visual-Language Model for General Deepfake Detection [16.21235742118949]
We propose a novel approach that repurposes a well-trained Vision-Language Models (VLMs) for general deepfake detection.
Motivated by the model reprogramming paradigm that manipulates the model prediction via input perturbations, our method can reprogram a pre-trained VLM model.
Experiments on several popular benchmark datasets demonstrate that the cross-dataset and cross-manipulation performances of deepfake detection can be significantly and consistently improved.
arXiv Detail & Related papers (2024-09-04T12:46:30Z) - DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception [78.26734070960886]
Current perceptive models heavily depend on resource-intensive datasets.
We introduce perception-aware loss (P.A. loss) through segmentation, improving both quality and controllability.
Our method customizes data augmentation by extracting and utilizing perception-aware attribute (P.A. Attr) during generation.
arXiv Detail & Related papers (2024-03-20T04:58:03Z) - Self-Supervised Graph Transformer for Deepfake Detection [1.8133635752982105]
Deepfake detection methods have shown promising results in recognizing forgeries within a given dataset.
Deepfake detection system must remain impartial to forgery types, appearance, and quality for guaranteed generalizable detection performance.
This study introduces a deepfake detection framework, leveraging a self-supervised pre-training model that delivers exceptional generalization ability.
arXiv Detail & Related papers (2023-07-27T17:22:41Z) - CamoDiffusion: Camouflaged Object Detection via Conditional Diffusion
Models [72.93652777646233]
Camouflaged Object Detection (COD) is a challenging task in computer vision due to the high similarity between camouflaged objects and their surroundings.
We propose a new paradigm that treats COD as a conditional mask-generation task leveraging diffusion models.
Our method, dubbed CamoDiffusion, employs the denoising process of diffusion models to iteratively reduce the noise of the mask.
arXiv Detail & Related papers (2023-05-29T07:49:44Z) - A positive feedback method based on F-measure value for Salient Object
Detection [1.9249287163937976]
This paper proposes a positive feedback method based on F-measure value for salient object detection (SOD)
Our proposed method takes an image to be detected and inputs it into several existing models to obtain their respective prediction maps.
Experimental results on five publicly available datasets show that our proposed positive feedback method outperforms the latest 12 methods in five evaluation metrics for saliency map prediction.
arXiv Detail & Related papers (2023-04-28T04:05:13Z) - Impact of Video Processing Operations in Deepfake Detection [13.334500258498798]
Digital face manipulation in video has attracted extensive attention due to the increased risk to public trust.
Deep learning-based deepfake detection methods have been developed and have shown impressive results.
The performance of these detectors is often evaluated using benchmarks that hardly reflect real-world situations.
arXiv Detail & Related papers (2023-03-30T09:24:17Z) - AntPivot: Livestream Highlight Detection via Hierarchical Attention
Mechanism [64.70568612993416]
We formulate a new task Livestream Highlight Detection, discuss and analyze the difficulties listed above and propose a novel architecture AntPivot to solve this problem.
We construct a fully-annotated dataset AntHighlight to instantiate this task and evaluate the performance of our model.
arXiv Detail & Related papers (2022-06-10T05:58:11Z) - A New Approach to Improve Learning-based Deepfake Detection in Realistic
Conditions [13.334500258498798]
Deep convolutional neural networks have achieved exceptional results on multiple detection and recognition tasks.
The impact of conventional distortions and processing operations found in imaging such as compression, noise, and enhancement are not sufficiently studied.
This paper proposes a more effective data augmentation scheme based on real-world image degradation process.
arXiv Detail & Related papers (2022-03-22T15:16:54Z) - Learning Monocular Dense Depth from Events [53.078665310545745]
Event cameras produce brightness changes in the form of a stream of asynchronous events instead of intensity frames.
Recent learning-based approaches have been applied to event-based data, such as monocular depth prediction.
We propose a recurrent architecture to solve this task and show significant improvement over standard feed-forward methods.
arXiv Detail & Related papers (2020-10-16T12:36:23Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.