Voice Anonymization for All -- Bias Evaluation of the Voice Privacy
Challenge Baseline System
- URL: http://arxiv.org/abs/2311.15804v1
- Date: Mon, 27 Nov 2023 13:26:49 GMT
- Title: Voice Anonymization for All -- Bias Evaluation of the Voice Privacy
Challenge Baseline System
- Authors: Anna Leschanowsky, \"Unal Ege Gaznepoglu, Nils Peters
- Abstract summary: This study investigates bias in voice anonymization systems within the context of the Voice Privacy Challenge.
We curate a novel benchmark dataset to assess performance disparities among speaker subgroups based on sex and dialect.
- Score: 0.48342038441006807
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: In an age of voice-enabled technology, voice anonymization offers a solution
to protect people's privacy, provided these systems work equally well across
subgroups. This study investigates bias in voice anonymization systems within
the context of the Voice Privacy Challenge. We curate a novel benchmark dataset
to assess performance disparities among speaker subgroups based on sex and
dialect. We analyze the impact of three anonymization systems and attack models
on speaker subgroup bias and reveal significant performance variations.
Notably, subgroup bias intensifies with advanced attacker capabilities,
emphasizing the challenge of achieving equal performance across all subgroups.
Our study highlights the need for inclusive benchmark datasets and
comprehensive evaluation strategies that address subgroup bias in voice
anonymization.
Related papers
- Houston we have a Divergence: A Subgroup Performance Analysis of ASR Models [2.5168553347063862]
The Fearless Steps APOLLO Community Resource provides unparalleled opportunities to explore the potential of multi-speaker team communications.
This study focuses on discovering the characteristics that make Apollo recordings more or less intelligible to Automatic Speech Recognition (ASR) methods.
arXiv Detail & Related papers (2024-03-31T10:06:19Z) - Anonymizing Speech: Evaluating and Designing Speaker Anonymization
Techniques [1.2691047660244337]
The growing use of voice user interfaces has led to a surge in the collection and storage of speech data.
This thesis proposes solutions for anonymizing speech and evaluating the degree of the anonymization.
arXiv Detail & Related papers (2023-08-05T16:14:17Z) - Improving Fairness and Robustness in End-to-End Speech Recognition
through unsupervised clustering [49.069298478971696]
We present a privacy preserving approach to improve fairness and robustness of end-to-end ASR.
We extract utterance level embeddings using a speaker ID model trained on a public dataset.
We use cluster IDs instead of speaker utterance embeddings as extra features during model training.
arXiv Detail & Related papers (2023-06-06T21:13:08Z) - Controllable speech synthesis by learning discrete phoneme-level
prosodic representations [53.926969174260705]
We present a novel method for phoneme-level prosody control of F0 and duration using intuitive discrete labels.
We propose an unsupervised prosodic clustering process which is used to discretize phoneme-level F0 and duration features from a multispeaker speech dataset.
arXiv Detail & Related papers (2022-11-29T15:43:36Z) - SAMO: Speaker Attractor Multi-Center One-Class Learning for Voice
Anti-Spoofing [22.47152800242178]
Anti-spoofing systems are crucial auxiliaries for automatic speaker verification (ASV) systems.
We propose speaker attractor multi-center one-class learning (SAMO), which clusters bona fide speech around a number of speaker attractors.
Our proposed system outperforms existing state-of-the-art single systems with a relative improvement of 38% on equal error rate (EER) on the ASVspoof 2019 LA evaluation set.
arXiv Detail & Related papers (2022-11-04T19:31:33Z) - The VoicePrivacy 2022 Challenge Evaluation Plan [46.807999940446294]
Training, development and evaluation datasets are provided.
Participants apply their developed anonymization systems.
Results will be presented at a workshop held in conjunction with INTERSPEECH 2022.
arXiv Detail & Related papers (2022-03-23T15:05:18Z) - The VoicePrivacy 2020 Challenge: Results and findings [60.13468541150838]
The first VoicePrivacy 2020 Challenge focuses on developing anonymization solutions for speech technology.
We provide a systematic overview of the challenge design with an analysis of submitted systems and evaluation results.
arXiv Detail & Related papers (2021-09-01T23:40:38Z) - Self-supervised Text-independent Speaker Verification using Prototypical
Momentum Contrastive Learning [58.14807331265752]
We show that better speaker embeddings can be learned by momentum contrastive learning.
We generalize the self-supervised framework to a semi-supervised scenario where only a small portion of the data is labeled.
arXiv Detail & Related papers (2020-12-13T23:23:39Z) - Introducing the VoicePrivacy Initiative [53.14981205333593]
The VoicePrivacy initiative aims to promote the development of privacy preservation tools for speech technology.
We formulate the voice anonymization task selected for the VoicePrivacy 2020 Challenge and describe the datasets used for system development and evaluation.
arXiv Detail & Related papers (2020-05-04T11:07:52Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.