Optimizing Tandem Speaker Verification and Anti-Spoofing Systems
        - URL: http://arxiv.org/abs/2201.09709v1
 - Date: Mon, 24 Jan 2022 14:27:28 GMT
 - Title: Optimizing Tandem Speaker Verification and Anti-Spoofing Systems
 - Authors: Anssi Kanervisto, Ville Hautam\"aki, Tomi Kinnunen, Junichi Yamagishi
 - Abstract summary: We propose to optimize the tandem system directly by creating a differentiable version of t-DCF and employing techniques from reinforcement learning.
Results indicate that these approaches offer better outcomes than finetuning, with our method providing a 20% relative improvement in the t-DCF in the ASVSpoof19 dataset.
 - Score: 45.66319648049384
 - License: http://creativecommons.org/licenses/by/4.0/
 - Abstract:   As automatic speaker verification (ASV) systems are vulnerable to spoofing
attacks, they are typically used in conjunction with spoofing countermeasure
(CM) systems to improve security. For example, the CM can first determine
whether the input is human speech, then the ASV can determine whether this
speech matches the speaker's identity. The performance of such a tandem system
can be measured with a tandem detection cost function (t-DCF). However, ASV and
CM systems are usually trained separately, using different metrics and data,
which does not optimize their combined performance. In this work, we propose to
optimize the tandem system directly by creating a differentiable version of
t-DCF and employing techniques from reinforcement learning. The results
indicate that these approaches offer better outcomes than finetuning, with our
method providing a 20% relative improvement in the t-DCF in the ASVSpoof19
dataset in a constrained setting.
 
       
      
        Related papers
        - Review, Refine, Repeat: Understanding Iterative Decoding of AI Agents   with Dynamic Evaluation and Selection [71.92083784393418]
Inference-time methods such as Best-of-N (BON) sampling offer a simple yet effective alternative to improve performance.
We propose Iterative Agent Decoding (IAD) which combines iterative refinement with dynamic candidate evaluation and selection guided by a verifier.
arXiv  Detail & Related papers  (2025-04-02T17:40:47Z) - Toward Improving Synthetic Audio Spoofing Detection Robustness via   Meta-Learning and Disentangled Training With Adversarial Examples [33.445126880876415]
We propose a reliable and robust spoofing detection system to filter out spoofing attacks instead of having them reach the automatic speaker verification system.
A weighted additive angular margin loss is proposed to address the data imbalance issue, and different margins has been assigned to improve generalization to unseen spoofing attacks.
We craft adversarial examples by adding imperceptible perturbations to spoofing speech as a data augmentation strategy, then we use an auxiliary batch normalization to guarantee that corresponding normalization statistics are performed exclusively on the adversarial examples.
arXiv  Detail & Related papers  (2024-08-23T19:26:54Z) - Generalizing Speaker Verification for Spoof Awareness in the Embedding
  Space [30.094557217931563]
ASV systems can be spoofed using various types of adversaries.
We propose a novel yet simple backend classifier based on deep neural networks.
Experiments are conducted on the ASVspoof 2019 logical access dataset.
arXiv  Detail & Related papers  (2024-01-20T07:30:22Z) - Audio Anti-spoofing Using a Simple Attention Module and Joint
  Optimization Based on Additive Angular Margin Loss and Meta-learning [43.519717601587864]
This study introduces a simple attention module to infer 3-dim attention weights for the feature map in a convolutional layer.
We propose a joint optimization approach based on the weighted additive angular margin loss for binary classification.
Our proposed approach delivers a competitive result with a pooled EER of 0.99% and min t-DCF of 0.0289.
arXiv  Detail & Related papers  (2022-11-17T21:25:29Z) - Fully Automated End-to-End Fake Audio Detection [57.78459588263812]
This paper proposes a fully automated end-toend fake audio detection method.
We first use wav2vec pre-trained model to obtain a high-level representation of the speech.
For the network structure, we use a modified version of the differentiable architecture search (DARTS) named light-DARTS.
arXiv  Detail & Related papers  (2022-08-20T06:46:55Z) - Tackling Spoofing-Aware Speaker Verification with Multi-Model Fusion [88.34134732217416]
This work focuses on fusion-based SASV solutions and proposes a multi-model fusion framework to leverage the power of multiple state-of-the-art ASV and CM models.
The proposed framework vastly improves the SASV-EER from 8.75% to 1.17%, which is 86% relative improvement compared to the best baseline system in the SASV challenge.
arXiv  Detail & Related papers  (2022-06-18T06:41:06Z) - Anti-Spoofing Using Transfer Learning with Variational Information
  Bottleneck [6.918364447822298]
We propose a transfer learning scheme based on the wav2vec 2.0 pretrained model with variational information bottleneck for speech anti-spoofing task.
Our method improves the performance of distinguishing unseen spoofed and genuine speech, outperforming current state-of-the-art anti-spoofing systems.
arXiv  Detail & Related papers  (2022-04-04T11:08:21Z) - Unsupervised Domain Adaptation for Speech Recognition via Uncertainty
  Driven Self-Training [55.824641135682725]
Domain adaptation experiments using WSJ as a source domain and TED-LIUM 3 as well as SWITCHBOARD show that up to 80% of the performance of a system trained on ground-truth data can be recovered.
arXiv  Detail & Related papers  (2020-11-26T18:51:26Z) - Tandem Assessment of Spoofing Countermeasures and Automatic Speaker
  Verification: Fundamentals [59.34844017757795]
The reliability of spoofing countermeasures (CMs) is gauged using the equal error rate (EER) metric.
This paper presents several new extensions to the tandem detection cost function (t-DCF)
It is hoped that adoption of the t-DCF for the CM assessment will help to foster closer collaboration between the anti-spoofing and ASV research communities.
arXiv  Detail & Related papers  (2020-07-12T12:44:08Z) - An initial investigation on optimizing tandem speaker verification and
  countermeasure systems using reinforcement learning [45.66319648049384]
We study training the ASV and CM components together for a better t-DCF measure by using reinforcement learning.
We demonstrate such training procedure indeed is able to improve the performance of the combined system, and does so with more reliable results than with the standard supervised learning techniques we compare against.
arXiv  Detail & Related papers  (2020-02-06T15:13:49Z) 
        This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.