Exact Recovery of Community Detection in k-Community Gaussian Mixture
Model
- URL: http://arxiv.org/abs/2009.01185v1
- Date: Sat, 29 Aug 2020 19:27:20 GMT
- Title: Exact Recovery of Community Detection in k-Community Gaussian Mixture
Model
- Authors: Zhongyang Li
- Abstract summary: We study the community detection problem on a Gaussian mixture model.
We explicitly find the threshold for the exact recovery of the maximum likelihood estimation.
Applications include the community detection on hypergraphs.
- Score: 5.312472550578279
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We study the community detection problem on a Gaussian mixture model, in
which vertices are divided into $k\geq 2$ distinct communities. The major
difference in our model is that the intensities for Gaussian perturbations are
different for different entries in the observation matrix, and we do not assume
that every community has the same number of vertices. We explicitly find the
threshold for the exact recovery of the maximum likelihood estimation.
Applications include the community detection on hypergraphs.
Related papers
- Community Detection in the Multi-View Stochastic Block Model [47.43048484980115]
This paper considers the problem of community detection on multiple potentially correlated graphs from antheoretical perspective.
We first put forth a random graph model, called the multi-view information block model (MVSBM), designed to generate correlated graphs on the same set of nodes.
arXiv Detail & Related papers (2024-01-17T13:39:38Z) - Exact Recovery of Community Detection in dependent Gaussian Mixture
Models [5.312472550578279]
We study the community detection problem on a Gaussian mixture model.
We prove necessary and sufficient conditions for the exact recovery of the maximum likelihood estimation.
arXiv Detail & Related papers (2022-09-23T14:26:19Z) - A Robust and Flexible EM Algorithm for Mixtures of Elliptical
Distributions with Missing Data [71.9573352891936]
This paper tackles the problem of missing data imputation for noisy and non-Gaussian data.
A new EM algorithm is investigated for mixtures of elliptical distributions with the property of handling potential missing data.
Experimental results on synthetic data demonstrate that the proposed algorithm is robust to outliers and can be used with non-Gaussian data.
arXiv Detail & Related papers (2022-01-28T10:01:37Z) - Clustering a Mixture of Gaussians with Unknown Covariance [4.821312633849745]
We derive a Max-Cut integer program based on maximum likelihood estimation.
We develop an efficient spectral algorithm that attains the optimal rate but requires a quadratic sample size.
We generalize the Max-Cut program to a $k$-means program that handles multi-component mixtures with possibly unequal weights.
arXiv Detail & Related papers (2021-10-04T17:59:20Z) - Learning Gaussian Mixtures with Generalised Linear Models: Precise
Asymptotics in High-dimensions [79.35722941720734]
Generalised linear models for multi-class classification problems are one of the fundamental building blocks of modern machine learning tasks.
We prove exacts characterising the estimator in high-dimensions via empirical risk minimisation.
We discuss how our theory can be applied beyond the scope of synthetic data.
arXiv Detail & Related papers (2021-06-07T16:53:56Z) - Exact Recovery in the General Hypergraph Stochastic Block Model [92.28929858529679]
This paper investigates fundamental limits of exact recovery in the general d-uniform hypergraph block model (d-HSBM)
We show that there exists a sharp threshold such that exact recovery is achievable above the threshold and impossible below it.
arXiv Detail & Related papers (2021-05-11T03:39:08Z) - Spectral clustering under degree heterogeneity: a case for the random
walk Laplacian [83.79286663107845]
This paper shows that graph spectral embedding using the random walk Laplacian produces vector representations which are completely corrected for node degree.
In the special case of a degree-corrected block model, the embedding concentrates about K distinct points, representing communities.
arXiv Detail & Related papers (2021-05-03T16:36:27Z) - Information Theoretic Limits of Exact Recovery in Sub-hypergraph Models
for Community Detection [29.266116605396814]
We study the information theoretic bounds for exact recovery in sub-hypergraph models for community detection.
Our bounds are tight and pertain to the community detection problems in various models.
arXiv Detail & Related papers (2021-01-29T02:50:34Z) - Robustly Learning Mixtures of $k$ Arbitrary Gaussians [47.40835932474677]
We give a-time algorithm for the problem of robustly estimating a mixture of $k$ arbitrary Gaussians in $mathbbRd$, for any fixed $k$, in the presence of a constant fraction of arbitrary corruptions.
Our main tools are an efficient emphpartial clustering algorithm that relies on the sum-of-squares method, and a novel tensor decomposition algorithm that allows errors in both Frobenius norm and low-rank terms.
arXiv Detail & Related papers (2020-12-03T17:54:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.