A Robust Speaker Clustering Method Based on Discrete Tied Variational
Autoencoder
- URL: http://arxiv.org/abs/2003.01955v1
- Date: Wed, 4 Mar 2020 08:54:38 GMT
- Title: A Robust Speaker Clustering Method Based on Discrete Tied Variational
Autoencoder
- Authors: Chen Feng, Jianzong Wang, Tongxu Li, Junqing Peng, Jing Xiao
- Abstract summary: Traditional speaker clustering method based on aggregation hierarchy cluster (AHC) has the shortcomings of long-time running and remains sensitive to environment noise.
We propose a novel speaker clustering method based on Mutual Information (MI) and a non-linear model with discrete variable, which under the enlightenment of Tied Variational Autoencoder (TVAE) to enhance the robustness against noise.
- Score: 27.211505187332385
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Recently, the speaker clustering model based on aggregation hierarchy cluster
(AHC) is a common method to solve two main problems: no preset category number
clustering and fix category number clustering. In general, model takes features
like i-vectors as input of probability and linear discriminant analysis model
(PLDA) aims to form the distance matric in long voice application scenario, and
then clustering results are obtained through the clustering model. However,
traditional speaker clustering method based on AHC has the shortcomings of
long-time running and remains sensitive to environment noise. In this paper, we
propose a novel speaker clustering method based on Mutual Information (MI) and
a non-linear model with discrete variable, which under the enlightenment of
Tied Variational Autoencoder (TVAE), to enhance the robustness against noise.
The proposed method named Discrete Tied Variational Autoencoder (DTVAE) which
shortens the elapsed time substantially. With experience results, it
outperforms the general model and yields a relative Accuracy (ACC) improvement
and significant time reduction.
Related papers
- Dynamic Clustering and Cluster Contrastive Learning for Unsupervised
Person Re-identification [29.167783500369442]
Unsupervised Re-ID methods aim at learning robust and discriminative features from unlabeled data.
We propose a dynamic clustering and cluster contrastive learning (DCCC) method.
Experiments on several widely used public datasets validate the effectiveness of our proposed DCCC.
arXiv Detail & Related papers (2023-03-13T01:56:53Z) - Highly Efficient Real-Time Streaming and Fully On-Device Speaker
Diarization with Multi-Stage Clustering [18.62774420511154]
A multi-stage clustering strategy that uses different clustering algorithms for input of different lengths can address multi-faceted challenges of speaker diarization applications.
This strategy is critical for streaming on-device speaker diarization systems, where the budgets of CPU, memory and battery are tight.
arXiv Detail & Related papers (2022-10-25T01:20:24Z) - Tight integration of neural- and clustering-based diarization through
deep unfolding of infinite Gaussian mixture model [84.57667267657382]
This paper introduces a it trainable clustering algorithm into the integration framework.
Speaker embeddings are optimized during training such that it better fits iGMM clustering.
Experimental results show that the proposed approach outperforms the conventional approach in terms of diarization error rate.
arXiv Detail & Related papers (2022-02-14T07:45:21Z) - Personalized Federated Learning via Convex Clustering [72.15857783681658]
We propose a family of algorithms for personalized federated learning with locally convex user costs.
The proposed framework is based on a generalization of convex clustering in which the differences between different users' models are penalized.
arXiv Detail & Related papers (2022-02-01T19:25:31Z) - IB-GAN: A Unified Approach for Multivariate Time Series Classification
under Class Imbalance [1.854931308524932]
Non-parametric data augmentation with Generative Adversarial Networks (GANs) offers a promising solution.
We propose Imputation Balanced GAN (IB-GAN), a novel method that joins data augmentation and classification in a one-step process via an imputation-balancing approach.
arXiv Detail & Related papers (2021-10-14T15:31:16Z) - Correlation Clustering Reconstruction in Semi-Adversarial Models [70.11015369368272]
Correlation Clustering is an important clustering problem with many applications.
We study the reconstruction version of this problem in which one is seeking to reconstruct a latent clustering corrupted by random noise and adversarial modifications.
arXiv Detail & Related papers (2021-08-10T14:46:17Z) - Unsupervised Clustered Federated Learning in Complex Multi-source
Acoustic Environments [75.8001929811943]
We introduce a realistic and challenging, multi-source and multi-room acoustic environment.
We present an improved clustering control strategy that takes into account the variability of the acoustic scene.
The proposed approach is optimized using clustering-based measures and validated via a network-wide classification task.
arXiv Detail & Related papers (2021-06-07T14:51:39Z) - Progressive Cluster Purification for Unsupervised Feature Learning [48.87365358296371]
In unsupervised feature learning, sample specificity based methods ignore the inter-class information.
We propose a novel clustering based method, which excludes class inconsistent samples during progressive cluster formation.
Our approach, referred to as Progressive Cluster Purification (PCP), implements progressive clustering by gradually reducing the number of clusters during training.
arXiv Detail & Related papers (2020-07-06T08:11:03Z) - High-Dimensional Quadratic Discriminant Analysis under Spiked Covariance
Model [101.74172837046382]
We propose a novel quadratic classification technique, the parameters of which are chosen such that the fisher-discriminant ratio is maximized.
Numerical simulations show that the proposed classifier not only outperforms the classical R-QDA for both synthetic and real data but also requires lower computational complexity.
arXiv Detail & Related papers (2020-06-25T12:00:26Z) - Autoencoder-based time series clustering with energy applications [0.0]
Time series clustering is a challenging task due to the specific nature of the data.
In this paper we investigate the combination of a convolutional autoencoder and a k-medoids algorithm to perfom time series clustering.
arXiv Detail & Related papers (2020-02-10T10:04:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.