How Deep Are the Fakes? Focusing on Audio Deepfake: A Survey
- URL: http://arxiv.org/abs/2111.14203v1
- Date: Sun, 28 Nov 2021 18:28:30 GMT
- Title: How Deep Are the Fakes? Focusing on Audio Deepfake: A Survey
- Authors: Zahra Khanjani, Gabrielle Watson, and Vandana P. Janeja
- Abstract summary: This paper critically analyzes and provides a unique source of audio deepfake research, mostly ranging from 2016 to 2020.
This survey provides readers with a summary of 1) different deepfake categories 2) how they could be created and detected 3) the most recent trends in this domain and shortcomings in detection methods.
We found that Generative Adversarial Networks(GAN), Convolutional Neural Networks (CNN), and Deep Neural Networks (DNN) are common ways of creating and detecting deepfakes.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Deepfake is content or material that is synthetically generated or
manipulated using artificial intelligence (AI) methods, to be passed off as
real and can include audio, video, image, and text synthesis. This survey has
been conducted with a different perspective compared to existing survey papers,
that mostly focus on just video and image deepfakes. This survey not only
evaluates generation and detection methods in the different deepfake
categories, but mainly focuses on audio deepfakes that are overlooked in most
of the existing surveys. This paper critically analyzes and provides a unique
source of audio deepfake research, mostly ranging from 2016 to 2020. To the
best of our knowledge, this is the first survey focusing on audio deepfakes in
English. This survey provides readers with a summary of 1) different deepfake
categories 2) how they could be created and detected 3) the most recent trends
in this domain and shortcomings in detection methods 4) audio deepfakes, how
they are created and detected in more detail which is the main focus of this
paper. We found that Generative Adversarial Networks(GAN), Convolutional Neural
Networks (CNN), and Deep Neural Networks (DNN) are common ways of creating and
detecting deepfakes. In our evaluation of over 140 methods we found that the
majority of the focus is on video deepfakes and in particular in the generation
of video deepfakes. We found that for text deepfakes there are more generation
methods but very few robust methods for detection, including fake news
detection, which has become a controversial area of research because of the
potential of heavy overlaps with human generation of fake content. This paper
is an abbreviated version of the full survey and reveals a clear need to
research audio deepfakes and particularly detection of audio deepfakes.
Related papers
- Deepfake detection in videos with multiple faces using geometric-fakeness features [79.16635054977068]
Deepfakes of victims or public figures can be used by fraudsters for blackmailing, extorsion and financial fraud.
In our research we propose to use geometric-fakeness features (GFF) that characterize a dynamic degree of a face presence in a video.
We employ our approach to analyze videos with multiple faces that are simultaneously present in a video.
arXiv Detail & Related papers (2024-10-10T13:10:34Z) - DF40: Toward Next-Generation Deepfake Detection [62.073997142001424]
existing works identify top-notch detection algorithms and models by adhering to the common practice: training detectors on one specific dataset and testing them on other prevalent deepfake datasets.
But can these stand-out "winners" be truly applied to tackle the myriad of realistic and diverse deepfakes lurking in the real world?
We construct a highly diverse deepfake detection dataset called DF40, which comprises 40 distinct deepfake techniques.
arXiv Detail & Related papers (2024-06-19T12:35:02Z) - Unmasking Illusions: Understanding Human Perception of Audiovisual Deepfakes [49.81915942821647]
This paper aims to evaluate the human ability to discern deepfake videos through a subjective study.
We present our findings by comparing human observers to five state-ofthe-art audiovisual deepfake detection models.
We found that all AI models performed better than humans when evaluated on the same 40 videos.
arXiv Detail & Related papers (2024-05-07T07:57:15Z) - Recent Advancements In The Field Of Deepfake Detection [0.0]
A deepfake is a photo or video of a person whose image has been digitally altered or partially replaced with an image of someone else.
Deepfakes have the potential to cause a variety of problems and are often used maliciously.
Our objective is to survey and analyze a variety of current methods and advances in the field of deepfake detection.
arXiv Detail & Related papers (2023-08-10T13:24:27Z) - System Fingerprint Recognition for Deepfake Audio: An Initial Dataset
and Investigation [51.06875680387692]
We present the first deepfake audio dataset for system fingerprint recognition (SFR)
We collected the dataset from the speech synthesis systems of seven Chinese vendors that use the latest state-of-the-art deep learning technologies.
arXiv Detail & Related papers (2022-08-21T05:15:40Z) - Audio Deepfake Perceptions in College Going Populations [0.0]
This study tries to assess audio deepfake perceptions among college students from different majors.
We also analyzed the results based on different aspects of: grade level, complexity of the grammar used in the audio clips, length of the audio clips, those who knew the term deepfakes and those who did not.
arXiv Detail & Related papers (2021-12-06T20:53:41Z) - Evaluation of an Audio-Video Multimodal Deepfake Dataset using Unimodal
and Multimodal Detectors [18.862258543488355]
Deepfakes can cause security and privacy issues.
New domain of cloning human voices using deep-learning technologies is also emerging.
To develop a good deepfake detector, we need a detector that can detect deepfakes of multiple modalities.
arXiv Detail & Related papers (2021-09-07T11:00:20Z) - FakeAVCeleb: A Novel Audio-Video Multimodal Deepfake Dataset [21.199288324085444]
Recently, a new problem of generating cloned or synthesized human voice of a person is emerging.
With the emerging threat of impersonation attacks using deepfake videos and audios, new deepfake detectors are need that focuses on both, video and audio.
We propose a novel Audio-Video Deepfake dataset (FakeAVCeleb) that not only contains deepfake videos but respective synthesized cloned audios as well.
arXiv Detail & Related papers (2021-08-11T07:49:36Z) - WildDeepfake: A Challenging Real-World Dataset for Deepfake Detection [82.42495493102805]
We introduce a new dataset WildDeepfake which consists of 7,314 face sequences extracted from 707 deepfake videos collected completely from the internet.
We conduct a systematic evaluation of a set of baseline detection networks on both existing and our WildDeepfake datasets, and show that WildDeepfake is indeed a more challenging dataset, where the detection performance can decrease drastically.
arXiv Detail & Related papers (2021-01-05T11:10:32Z) - The Creation and Detection of Deepfakes: A Survey [32.04375809239154]
Generative deep learning algorithms have progressed to a point where it is difficult to tell the difference between what is real and what is fake.
In this paper, we explore the creation and detection of deepfakes and provide an in-depth view of how these architectures work.
arXiv Detail & Related papers (2020-04-23T13:35:49Z) - Emotions Don't Lie: An Audio-Visual Deepfake Detection Method Using
Affective Cues [75.1731999380562]
We present a learning-based method for detecting real and fake deepfake multimedia content.
We extract and analyze the similarity between the two audio and visual modalities from within the same video.
We compare our approach with several SOTA deepfake detection methods and report per-video AUC of 84.4% on the DFDC and 96.6% on the DF-TIMIT datasets.
arXiv Detail & Related papers (2020-03-14T22:07:26Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.