SVDD Challenge 2024: A Singing Voice Deepfake Detection Challenge Evaluation Plan
- URL: http://arxiv.org/abs/2405.05244v1
- Date: Wed, 8 May 2024 17:40:12 GMT
- Title: SVDD Challenge 2024: A Singing Voice Deepfake Detection Challenge Evaluation Plan
- Authors: You Zhang, Yongyi Zang, Jiatong Shi, Ryuichi Yamamoto, Jionghao Han, Yuxun Tang, Tomoki Toda, Zhiyao Duan,
- Abstract summary: "SVDD Challenge" is the first research challenge focusing on SVDD for lab-controlled and in-the-wild bonafide and deepfake singing voice recordings.
The challenge will be held in conjunction with the 2024 IEEE Spoken Language Technology Workshop (SLT 2024)
- Score: 44.260755521474735
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The rapid advancement of AI-generated singing voices, which now closely mimic natural human singing and align seamlessly with musical scores, has led to heightened concerns for artists and the music industry. Unlike spoken voice, singing voice presents unique challenges due to its musical nature and the presence of strong background music, making singing voice deepfake detection (SVDD) a specialized field requiring focused attention. To promote SVDD research, we recently proposed the "SVDD Challenge," the very first research challenge focusing on SVDD for lab-controlled and in-the-wild bonafide and deepfake singing voice recordings. The challenge will be held in conjunction with the 2024 IEEE Spoken Language Technology Workshop (SLT 2024).
Related papers
- Speech Foundation Model Ensembles for the Controlled Singing Voice Deepfake Detection (CtrSVDD) Challenge 2024 [8.940008511570207]
This work details our approach to achieving a leading system with a 1.79% pooled equal error rate (EER)
The rapid advancement of generative AI models presents significant challenges for detecting AI-generated deepfake singing voices.
The Singing Voice Deepfake Detection (SVDD) Challenge 2024 aims to address this complex task.
arXiv Detail & Related papers (2024-09-03T21:28:45Z) - ICAGC 2024: Inspirational and Convincing Audio Generation Challenge 2024 [32.96984318966757]
TheICAGC 2024 challenge aims to enhance the persuasiveness and acceptability of synthesized audio.
A total of 19 teams have registered for the challenge, and the results of the competition and the competition are described in this paper.
arXiv Detail & Related papers (2024-07-01T13:15:16Z) - Event-Based Eye Tracking. AIS 2024 Challenge Survey [73.41168445057629]
This survey reviews the AIS 2024 Event-Based Eye Tracking (EET) Challenge.
The task of the challenge focuses on processing eye movement recorded with event cameras and predicting the pupil center of the eye.
The challenge emphasizes efficient eye tracking with event cameras to achieve good task accuracy and efficiency trade-off.
arXiv Detail & Related papers (2024-04-17T21:53:01Z) - More than words: Advancements and challenges in speech recognition for singing [1.4589372436314496]
I will describe some of my own experiences when performing research on these tasks just as they were starting to gain traction.
My goal is to illuminate the complexities of applying speech recognition to singing, evaluate current capabilities, and outline future research directions.
arXiv Detail & Related papers (2024-03-14T11:37:02Z) - SingFake: Singing Voice Deepfake Detection [16.82140520915859]
Singing voices present different acoustic and linguistic characteristics from speech utterances.
We first present SingFake, the first curated in-the-wild dataset consisting of 28.93 hours of bonafide.
We then use SingFake to evaluate four state-of-the-art speech countermeasure systems trained on speech utterances.
arXiv Detail & Related papers (2023-09-14T08:49:05Z) - The Singing Voice Conversion Challenge 2023 [35.270322663776646]
This year we shifted our focus to singing voice conversion (SVC)
A new database was constructed for two tasks, namely in-domain and cross-domain SVC.
We observed that for both tasks, although human-level naturalness was achieved by the top system, no team was able to obtain a similarity score as high as the target speakers.
arXiv Detail & Related papers (2023-06-26T05:04:58Z) - VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge [95.6159736804855]
The VoxCeleb Speaker Recognition Challenge 2022 (VoxSRC-22) was held in conjunction with INTERSPEECH 2022.
The goal of this challenge was to evaluate how well state-of-the-art speaker recognition systems can diarise and recognise speakers from speech obtained "in the wild"
arXiv Detail & Related papers (2023-02-20T19:27:14Z) - Learning the Beauty in Songs: Neural Singing Voice Beautifier [69.21263011242907]
We are interested in a novel task, singing voice beautifying (SVB)
Given the singing voice of an amateur singer, SVB aims to improve the intonation and vocal tone of the voice, while keeping the content and vocal timbre.
We introduce Neural Singing Voice Beautifier (NSVB), the first generative model to solve the SVB task.
arXiv Detail & Related papers (2022-02-27T03:10:12Z) - ADD 2022: the First Audio Deep Synthesis Detection Challenge [92.41777858637556]
The first Audio Deep synthesis Detection challenge (ADD) was motivated to fill in the gap.
The ADD 2022 includes three tracks: low-quality fake audio detection (LF), partially fake audio detection (PF) and audio fake game (FG)
arXiv Detail & Related papers (2022-02-17T03:29:20Z) - ASVspoof 2021: accelerating progress in spoofed and deepfake speech
detection [70.45884214674057]
ASVspoof 2021 is the forth edition in the series of bi-annual challenges which aim to promote the study of spoofing.
This paper describes all three tasks, the new databases for each of them, the evaluation metrics, four challenge baselines, the evaluation platform and a summary of challenge results.
arXiv Detail & Related papers (2021-09-01T16:17:31Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.