Related papers: An initial investigation on optimizing tandem speaker verification and countermeasure systems using reinforcement learning

An initial investigation on optimizing tandem speaker verification and countermeasure systems using reinforcement learning

URL: http://arxiv.org/abs/2002.03801v2
Date: Wed, 8 Apr 2020 11:09:14 GMT
Title: An initial investigation on optimizing tandem speaker verification and countermeasure systems using reinforcement learning
Authors: Anssi Kanervisto, Ville Hautam\"aki, Tomi Kinnunen, Junichi Yamagishi
Abstract summary: We study training the ASV and CM components together for a better t-DCF measure by using reinforcement learning. We demonstrate such training procedure indeed is able to improve the performance of the combined system, and does so with more reliable results than with the standard supervised learning techniques we compare against.
Score: 45.66319648049384
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: The spoofing countermeasure (CM) systems in automatic speaker verification (ASV) are not typically used in isolation of each other. These systems can be combined, for example, into a cascaded system where CM produces first a decision whether the input is synthetic or bona fide speech. In case the CM decides it is a bona fide sample, then the ASV system will consider it for speaker verification. End users of the system are not interested in the performance of the individual sub-modules, but instead are interested in the performance of the combined system. Such combination can be evaluated with tandem detection cost function (t-DCF) measure, yet the individual components are trained separately from each other using their own performance metrics. In this work we study training the ASV and CM components together for a better t-DCF measure by using reinforcement learning. We demonstrate that such training procedure indeed is able to improve the performance of the combined system, and does so with more reliable results than with the standard supervised learning techniques we compare against.

Related papers

A Systematic Examination of Preference Learning through the Lens of Instruction-Following [83.71180850955679]
We use a novel synthetic data generation pipeline to generate 48,000 instruction unique-following prompts. With our synthetic prompts, we use two preference dataset curation methods - rejection sampling (RS) and Monte Carlo Tree Search (MCTS) Experiments reveal that shared prefixes in preference pairs, as generated by MCTS, provide marginal but consistent improvements. High-contrast preference pairs generally outperform low-contrast pairs; however, combining both often yields the best performance.
arXiv Detail & Related papers (2024-12-18T15:38:39Z)
Pretraining End-to-End Keyword Search with Automatically Discovered Acoustic Units [8.86336076082867]
We propose a method for pretraining E2E KWS systems with untranscribed data. We show that finetuning such a model significantly outperforms a model trained from scratch.
arXiv Detail & Related papers (2024-07-05T17:07:58Z)
OAEI Machine Learning Dataset for Online Model Generation [0.6472397166280683]
Ontology and knowledge graph matching systems are evaluated annually by the Ontology Alignment Evaluation Initiative (OAEI) We introduce a dataset that contains training, validation, and test sets for most of the OAEI tracks.
arXiv Detail & Related papers (2024-04-29T09:33:53Z)
Enhancement of a Text-Independent Speaker Verification System by using Feature Combination and Parallel-Structure Classifiers [0.0]
This paper explores the two modules: feature extraction and classification. The choice of the most appropriate acoustic features is a crucial factor for performing robust speaker verification. To enhance the system more in noisy environments, the inclusion of the multiband noise removal technique as a preprocessing stage is proposed.
arXiv Detail & Related papers (2024-01-26T17:19:59Z)
Generalizing Speaker Verification for Spoof Awareness in the Embedding Space [30.094557217931563]
ASV systems can be spoofed using various types of adversaries. We propose a novel yet simple backend classifier based on deep neural networks. Experiments are conducted on the ASVspoof 2019 logical access dataset.
arXiv Detail & Related papers (2024-01-20T07:30:22Z)
Combining multiple matchers for fingerprint verification: A case study in biosecure network of excellence [53.598636960435286]
Two reference systems for fingerprint verification have been tested together with two additional non-reference systems. The experimental results show that the best recognition strategy involves both minutiae-based and correlation-based measurements.
arXiv Detail & Related papers (2022-12-04T19:49:05Z)
Deep Feature Learning for Medical Acoustics [78.56998585396421]
The purpose of this paper is to compare different learnables in medical acoustics tasks. A framework has been implemented to classify human respiratory sounds and heartbeats in two categories, i.e. healthy or affected by pathologies.
arXiv Detail & Related papers (2022-08-05T10:39:37Z)
Optimizing Tandem Speaker Verification and Anti-Spoofing Systems [45.66319648049384]
We propose to optimize the tandem system directly by creating a differentiable version of t-DCF and employing techniques from reinforcement learning. Results indicate that these approaches offer better outcomes than finetuning, with our method providing a 20% relative improvement in the t-DCF in the ASVSpoof19 dataset.
arXiv Detail & Related papers (2022-01-24T14:27:28Z)
Visualizing Classifier Adjacency Relations: A Case Study in Speaker Verification and Voice Anti-Spoofing [72.4445825335561]
We propose a simple method to derive 2D representation from detection scores produced by an arbitrary set of binary classifiers. Based upon rank correlations, our method facilitates a visual comparison of classifiers with arbitrary scores. While the approach is fully versatile and can be applied to any detection task, we demonstrate the method using scores produced by automatic speaker verification and voice anti-spoofing systems.
arXiv Detail & Related papers (2021-06-11T13:03:33Z)
Improving Conversational Question Answering Systems after Deployment using Feedback-Weighted Learning [69.42679922160684]
We propose feedback-weighted learning based on importance sampling to improve upon an initial supervised system using binary user feedback. Our work opens the prospect to exploit interactions with real users and improve conversational systems after deployment.
arXiv Detail & Related papers (2020-11-01T19:50:34Z)

This list is automatically generated from the titles and abstracts of the papers in this site.