Consistency of regularized spectral clustering in degree-corrected mixed
membership model
- URL: http://arxiv.org/abs/2011.12239v2
- Date: Fri, 27 Aug 2021 03:08:48 GMT
- Title: Consistency of regularized spectral clustering in degree-corrected mixed
membership model
- Authors: Huan Qing and Jingli Wang
- Abstract summary: We propose an efficient approach called mixed regularized spectral clustering (Mixed-RSC for short) based on the regularized Laplacian matrix.
Mixed-RSC is designed based on an ideal cone structure of the variant for the eigen-decomposition of the population regularized Laplacian matrix.
We show that the algorithm is consistent under mild conditions by providing error bounds for the inferred membership vector of each node.
- Score: 1.0965065178451106
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Community detection in network analysis is an attractive research area
recently. Here, under the degree-corrected mixed membership (DCMM) model, we
propose an efficient approach called mixed regularized spectral clustering
(Mixed-RSC for short) based on the regularized Laplacian matrix. Mixed-RSC is
designed based on an ideal cone structure of the variant for the
eigen-decomposition of the population regularized Laplacian matrix. We show
that the algorithm is asymptotically consistent under mild conditions by
providing error bounds for the inferred membership vector of each node. As a
byproduct of our bound, we provide the theoretical optimal choice for the
regularization parameter {\tau}. To demonstrate the performance of our method,
we apply it with previous benchmark methods on both simulated and real-world
networks. To our knowledge, this is the first work to design spectral
clustering algorithm for mixed membership community detection problem under
DCMM model based on the application of regularized Laplacian matrix.
Related papers
- A Fresh Look at Generalized Category Discovery through Non-negative Matrix Factorization [83.12938977698988]
Generalized Category Discovery (GCD) aims to classify both base and novel images using labeled base data.
Current approaches inadequately address the intrinsic optimization of the co-occurrence matrix $barA$ based on cosine similarity.
We propose a Non-Negative Generalized Category Discovery (NN-GCD) framework to address these deficiencies.
arXiv Detail & Related papers (2024-10-29T07:24:11Z) - Regularized Projection Matrix Approximation with Applications to Community Detection [1.3761665705201904]
This paper introduces a regularized projection matrix approximation framework designed to recover cluster information from the affinity matrix.
We investigate three distinct penalty functions, each specifically tailored to address bounded, positive, and sparse scenarios.
Numerical experiments conducted on both synthetic and real-world datasets reveal that our regularized projection matrix approximation approach significantly outperforms state-of-the-art methods in clustering performance.
arXiv Detail & Related papers (2024-05-26T15:18:22Z) - Adaptive Fuzzy C-Means with Graph Embedding [84.47075244116782]
Fuzzy clustering algorithms can be roughly categorized into two main groups: Fuzzy C-Means (FCM) based methods and mixture model based methods.
We propose a novel FCM based clustering model that is capable of automatically learning an appropriate membership degree hyper- parameter value.
arXiv Detail & Related papers (2024-05-22T08:15:50Z) - Fast Semisupervised Unmixing Using Nonconvex Optimization [80.11512905623417]
We introduce a novel convex convex model for semi/library-based unmixing.
We demonstrate the efficacy of Alternating Methods of sparse unsupervised unmixing.
arXiv Detail & Related papers (2024-01-23T10:07:41Z) - Directed degree corrected mixed membership model and estimating
community memberships in directed networks [0.0]
We build an efficient algorithm called DiMSC to infer the community membership vectors for both row nodes and column nodes.
We show that the proposed algorithm is consistent under mild conditions.
arXiv Detail & Related papers (2021-09-16T09:35:16Z) - Joint Network Topology Inference via Structured Fusion Regularization [70.30364652829164]
Joint network topology inference represents a canonical problem of learning multiple graph Laplacian matrices from heterogeneous graph signals.
We propose a general graph estimator based on a novel structured fusion regularization.
We show that the proposed graph estimator enjoys both high computational efficiency and rigorous theoretical guarantee.
arXiv Detail & Related papers (2021-03-05T04:42:32Z) - Optimal Clustering in Anisotropic Gaussian Mixture Models [3.5590836605011047]
We study the clustering task under anisotropic Gaussian Mixture Models.
We characterize the dependence of signal-to-noise ratios on the cluster centers.
arXiv Detail & Related papers (2021-01-14T00:31:52Z) - Estimating Mixed-Memberships Using the Symmetric Laplacian Inverse Matrix [0.4972323953932129]
Mixed-SLIM is a spectral clustering method on the symmetrized Laplacian inverse matrix under the degree-corrected mixed membership model.
We provide theoretical bounds for the estimation error on the proposed algorithm and its regularized version under mild conditions.
These Mixed-SLIM methods outperform state-of-art methods in simulations and substantial empirical datasets for both community detection and mixed membership community detection problems.
arXiv Detail & Related papers (2020-12-17T13:19:06Z) - A unified framework for spectral clustering in sparse graphs [47.82639003096941]
We show that a conveniently parametrized form of regularized Laplacian matrix can be used to perform spectral clustering in sparse networks.
We also exhibit important connections between this proposed matrix and the now popular non-backtracking matrix, the Bethe-Hessian matrix.
arXiv Detail & Related papers (2020-03-20T10:58:37Z) - Clustering Binary Data by Application of Combinatorial Optimization
Heuristics [52.77024349608834]
We study clustering methods for binary data, first defining aggregation criteria that measure the compactness of clusters.
Five new and original methods are introduced, using neighborhoods and population behavior optimization metaheuristics.
From a set of 16 data tables generated by a quasi-Monte Carlo experiment, a comparison is performed for one of the aggregations using L1 dissimilarity, with hierarchical clustering, and a version of k-means: partitioning around medoids or PAM.
arXiv Detail & Related papers (2020-01-06T23:33:31Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.