On consistency of constrained spectral clustering under
representation-aware stochastic block model
- URL: http://arxiv.org/abs/2203.02005v1
- Date: Thu, 3 Mar 2022 20:41:14 GMT
- Title: On consistency of constrained spectral clustering under
representation-aware stochastic block model
- Authors: Shubham Gupta and Ambedkar Dukkipati
- Abstract summary: We study constrained spectral clustering with the aim of finding balanced clusters in a given textitsimilarity graph $mathcalG$.
We develop unnormalized and normalized variants of spectral clustering in this setting.
These algorithms use $mathcalR$ to find clusters in $mathcalG$ that approximately satisfy the proposed constraint.
- Score: 20.6072287343024
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Spectral clustering is widely used in practice due to its flexibility,
computational efficiency, and well-understood theoretical performance
guarantees. Recently, spectral clustering has been studied to find balanced
clusters under population-level constraints. These constraints are specified by
additional information available in the form of auxiliary categorical node
attributes. In this paper, we consider a scenario where these attributes may
not be observable, but manifest as latent features of an auxiliary graph.
Motivated by this, we study constrained spectral clustering with the aim of
finding balanced clusters in a given \textit{similarity graph} $\mathcal{G}$,
such that each individual is adequately represented with respect to an
auxiliary graph $\mathcal{R}$ (we refer to this as representation graph). We
propose an individual-level balancing constraint that formalizes this idea. Our
work leads to an interesting stochastic block model that not only plants the
given partitions in $\mathcal{G}$ but also plants the auxiliary information
encoded in the representation graph $\mathcal{R}$. We develop unnormalized and
normalized variants of spectral clustering in this setting. These algorithms
use $\mathcal{R}$ to find clusters in $\mathcal{G}$ that approximately satisfy
the proposed constraint. We also establish the first statistical consistency
result for constrained spectral clustering under individual-level constraints
for graphs sampled from the above-mentioned variant of the stochastic block
model. Our experimental results corroborate our theoretical findings.
Related papers
- Interpretable Multi-View Clustering Based on Anchor Graph Tensor Factorization [64.00146569922028]
Multi-view clustering methods based on anchor graph factorization lack adequate cluster interpretability for the decomposed matrix.
We address this limitation by using non-negative tensor factorization to decompose an anchor graph tensor that combines anchor graphs from multiple views.
arXiv Detail & Related papers (2024-04-01T03:23:55Z) - Robustness for Spectral Clustering of General Graphs under Local Differential Privacy [2.4447259440166302]
We argue that social networks do not originate from the block model (SBM)
Our primary focus is the edge flipping method -- a common technique for protecting local differential privacy.
Although clustering outcomes have been stable for dense and well-clustered graphs produced from the SBM, we show that in general, spectral clustering may yield highly erratic results on certain dense and well-clustered graphs.
arXiv Detail & Related papers (2023-09-13T10:23:29Z) - Instance-Optimal Cluster Recovery in the Labeled Stochastic Block Model [79.46465138631592]
We devise an efficient algorithm that recovers clusters using the observed labels.
We present Instance-Adaptive Clustering (IAC), the first algorithm whose performance matches these lower bounds both in expectation and with high probability.
arXiv Detail & Related papers (2023-06-18T08:46:06Z) - Nonbacktracking spectral clustering of nonuniform hypergraphs [2.408714894793063]
We study spectral clustering for nonuniform hypergraphs based on the hypergraph nonbacktracking operator.
We propose an alternating algorithm for inference in a hypergraph blockmodel via linearized belief-propagation.
arXiv Detail & Related papers (2022-04-27T01:14:06Z) - Lattice-Based Methods Surpass Sum-of-Squares in Clustering [98.46302040220395]
Clustering is a fundamental primitive in unsupervised learning.
Recent work has established lower bounds against the class of low-degree methods.
We show that, perhaps surprisingly, this particular clustering model textitdoes not exhibit a statistical-to-computational gap.
arXiv Detail & Related papers (2021-12-07T18:50:17Z) - Spectral-Spatial Global Graph Reasoning for Hyperspectral Image
Classification [50.899576891296235]
Convolutional neural networks have been widely applied to hyperspectral image classification.
Recent methods attempt to address this issue by performing graph convolutions on spatial topologies.
arXiv Detail & Related papers (2021-06-26T06:24:51Z) - Average Sensitivity of Spectral Clustering [31.283432482502278]
We study the stability of spectral clustering against edge perturbations in the input graph.
Our results suggest that spectral clustering is stable against edge perturbations when there is a cluster structure in the input graph.
arXiv Detail & Related papers (2020-06-07T09:14:44Z) - Computationally efficient sparse clustering [67.95910835079825]
We provide a finite sample analysis of a new clustering algorithm based on PCA.
We show that it achieves the minimax optimal misclustering rate in the regime $|theta infty$.
arXiv Detail & Related papers (2020-05-21T17:51:30Z) - Strong Consistency, Graph Laplacians, and the Stochastic Block Model [1.2891210250935143]
We study the performance of classical two-step spectral clustering via the graph Laplacian to learn the block model.
We prove that spectral clustering is able to achieve exact recovery of the planted community structure under conditions that match the information-theoretic limits.
arXiv Detail & Related papers (2020-04-21T07:16:46Z) - Robust spectral clustering using LASSO regularization [0.0]
This paper presents a variant of spectral clustering, called 1-spectral clustering, performed on a new random model closely related to block model.
Its goal is to promote a sparse eigenbasis solution of a 1 minimization problem revealing the natural structure of the graph.
arXiv Detail & Related papers (2020-04-08T07:12:56Z) - Embedding Graph Auto-Encoder for Graph Clustering [90.8576971748142]
Graph auto-encoder (GAE) models are based on semi-supervised graph convolution networks (GCN)
We design a specific GAE-based model for graph clustering to be consistent with the theory, namely Embedding Graph Auto-Encoder (EGAE)
EGAE consists of one encoder and dual decoders.
arXiv Detail & Related papers (2020-02-20T09:53:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.