Reformulating Speaker Diarization as Community Detection With Emphasis
On Topological Structure
- URL: http://arxiv.org/abs/2204.12112v1
- Date: Tue, 26 Apr 2022 07:18:05 GMT
- Title: Reformulating Speaker Diarization as Community Detection With Emphasis
On Topological Structure
- Authors: Siqi Zheng, Hongbin Suo
- Abstract summary: Clustering-based speaker diarization has stood firm as one of the major approaches in reality.
We propose to view clustering-based diarization as a community detection problem.
- Score: 10.508187462682308
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Clustering-based speaker diarization has stood firm as one of the major
approaches in reality, despite recent development in end-to-end diarization.
However, clustering methods have not been explored extensively for speaker
diarization. Commonly-used methods such as k-means, spectral clustering, and
agglomerative hierarchical clustering only take into account properties such as
proximity and relative densities. In this paper we propose to view
clustering-based diarization as a community detection problem. By doing so the
topological structure is considered. This work has four major contributions.
First it is shown that Leiden community detection algorithm significantly
outperforms the previous methods on the clustering of speaker-segments. Second,
we propose to use uniform manifold approximation to reduce dimension while
retaining global and local topological structure. Third, a masked filtering
approach is introduced to extract "clean" speaker embeddings. Finally, the
community structure is applied to an end-to-end post-processing network to
obtain diarization results. The final system presents a relative DER reduction
of up to 70 percent. The breakdown contribution of each component is analyzed.
Related papers
- Overlap-aware End-to-End Supervised Hierarchical Graph Clustering for
Speaker Diarization [41.24045486520547]
We propose an end-to-end supervised hierarchical clustering algorithm based on graph neural networks (GNN)
The proposed E-SHARC framework improves significantly over the state-of-art diarization systems.
arXiv Detail & Related papers (2024-01-23T15:35:44Z) - Supervised Hierarchical Clustering using Graph Neural Networks for
Speaker Diarization [41.30830281043803]
We propose a novel Supervised HierArchical gRaph Clustering algorithm (SHARC) for speaker diarization.
In this paper, we introduce a hierarchical structure using Graph Neural Network (GNN) to perform supervised clustering.
The supervised clustering is performed using node densities and edge existence probabilities to merge the segments until convergence.
arXiv Detail & Related papers (2023-02-24T16:16:41Z) - DeepCluE: Enhanced Image Clustering via Multi-layer Ensembles in Deep
Neural Networks [53.88811980967342]
This paper presents a Deep Clustering via Ensembles (DeepCluE) approach.
It bridges the gap between deep clustering and ensemble clustering by harnessing the power of multiple layers in deep neural networks.
Experimental results on six image datasets confirm the advantages of DeepCluE over the state-of-the-art deep clustering approaches.
arXiv Detail & Related papers (2022-06-01T09:51:38Z) - Perfect Spectral Clustering with Discrete Covariates [68.8204255655161]
We propose a spectral algorithm that achieves perfect clustering with high probability on a class of large, sparse networks.
Our method is the first to offer a guarantee of consistent latent structure recovery using spectral clustering.
arXiv Detail & Related papers (2022-05-17T01:41:06Z) - Tight integration of neural- and clustering-based diarization through
deep unfolding of infinite Gaussian mixture model [84.57667267657382]
This paper introduces a it trainable clustering algorithm into the integration framework.
Speaker embeddings are optimized during training such that it better fits iGMM clustering.
Experimental results show that the proposed approach outperforms the conventional approach in terms of diarization error rate.
arXiv Detail & Related papers (2022-02-14T07:45:21Z) - Correlation Clustering Reconstruction in Semi-Adversarial Models [70.11015369368272]
Correlation Clustering is an important clustering problem with many applications.
We study the reconstruction version of this problem in which one is seeking to reconstruct a latent clustering corrupted by random noise and adversarial modifications.
arXiv Detail & Related papers (2021-08-10T14:46:17Z) - End-to-End Speaker Diarization as Post-Processing [64.12519350944572]
Clustering-based diarization methods partition frames into clusters of the number of speakers.
Some end-to-end diarization methods can handle overlapping speech by treating the problem as multi-label classification.
We propose to use a two-speaker end-to-end diarization method as post-processing of the results obtained by a clustering-based method.
arXiv Detail & Related papers (2020-12-18T05:31:07Z) - Scalable Hierarchical Agglomerative Clustering [65.66407726145619]
Existing scalable hierarchical clustering methods sacrifice quality for speed.
We present a scalable, agglomerative method for hierarchical clustering that does not sacrifice quality and scales to billions of data points.
arXiv Detail & Related papers (2020-10-22T15:58:35Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.