Community Recovery in the Geometric Block Model
- URL: http://arxiv.org/abs/2206.11303v3
- Date: Sat, 18 Nov 2023 03:23:29 GMT
- Title: Community Recovery in the Geometric Block Model
- Authors: Sainyam Galhotra, Arya Mazumdar, Soumyabrata Pal, Barna Saha
- Abstract summary: We show that a simple triangle-counting dataset to detect communities in the geometric block model is near-optimal.
We also show that our algorithm performs extremely well, both theoretically and practically.
- Score: 38.77098549680883
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: To capture the inherent geometric features of many community detection
problems, we propose to use a new random graph model of communities that we
call a Geometric Block Model. The geometric block model builds on the random
geometric graphs (Gilbert, 1961), one of the basic models of random graphs for
spatial networks, in the same way that the well-studied stochastic block model
builds on the Erd\H{o}s-R\'{en}yi random graphs. It is also a natural extension
of random community models inspired by the recent theoretical and practical
advancements in community detection. To analyze the geometric block model, we
first provide new connectivity results for random annulus graphs which are
generalizations of random geometric graphs. The connectivity properties of
geometric graphs have been studied since their introduction, and analyzing them
has been more difficult than their Erd\H{o}s-R\'{en}yi counterparts due to
correlated edge formation.
We then use the connectivity results of random annulus graphs to provide
necessary and sufficient conditions for efficient recovery of communities for
the geometric block model. We show that a simple triangle-counting algorithm to
detect communities in the geometric block model is near-optimal. For this we
consider the following two regimes of graph density.
In the regime where the average degree of the graph grows logarithmically
with the number of vertices, we show that our algorithm performs extremely
well, both theoretically and practically. In contrast, the triangle-counting
algorithm is far from being optimum for the stochastic block model in the
logarithmic degree regime. We simulate our results on both real and synthetic
datasets to show superior performance of both the new model as well as our
algorithm.
Related papers
- A Survey of Geometric Graph Neural Networks: Data Structures, Models and
Applications [67.33002207179923]
This paper presents a survey of data structures, models, and applications related to geometric GNNs.
We provide a unified view of existing models from the geometric message passing perspective.
We also summarize the applications as well as the related datasets to facilitate later research for methodology development and experimental evaluation.
arXiv Detail & Related papers (2024-03-01T12:13:04Z) - Contrastive Graph Clustering in Curvature Spaces [74.03252813800334]
We present a novel end-to-end contrastive graph clustering model named CONGREGATE.
To support geometric clustering, we construct a theoretically grounded Heterogeneous Curvature Space.
We then train the graph clusters by an augmentation-free reweighted contrastive approach.
arXiv Detail & Related papers (2023-05-05T14:04:52Z) - GrannGAN: Graph annotation generative adversarial networks [72.66289932625742]
We consider the problem of modelling high-dimensional distributions and generating new examples of data with complex relational feature structure coherent with a graph skeleton.
The model we propose tackles the problem of generating the data features constrained by the specific graph structure of each data point by splitting the task into two phases.
In the first it models the distribution of features associated with the nodes of the given graph, in the second it complements the edge features conditionally on the node features.
arXiv Detail & Related papers (2022-12-01T11:49:07Z) - Unveiling the Sampling Density in Non-Uniform Geometric Graphs [69.93864101024639]
We consider graphs as geometric graphs: nodes are randomly sampled from an underlying metric space, and any pair of nodes is connected if their distance is less than a specified neighborhood radius.
In a social network communities can be modeled as densely sampled areas, and hubs as nodes with larger neighborhood radius.
We develop methods to estimate the unknown sampling density in a self-supervised fashion.
arXiv Detail & Related papers (2022-10-15T08:01:08Z) - On the Power of Edge Independent Graph Models [22.085932117823738]
We study the limitations of edge independent random graph models, in which each edge is added to the graph independently with some probability.
We prove that subject to a bounded overlap condition, edge independent models are inherently limited in their ability to generate graphs with high triangle and other subgraph densities.
arXiv Detail & Related papers (2021-10-29T19:12:14Z) - T-LoHo: A Bayesian Regularization Model for Structured Sparsity and
Smoothness on Graphs [0.0]
In graph-structured data, structured sparsity and smoothness tend to cluster together.
We propose a new prior for high dimensional parameters with graphical relations.
We use it to detect structured sparsity and smoothness simultaneously.
arXiv Detail & Related papers (2021-07-06T10:10:03Z) - Regularization of Mixture Models for Robust Principal Graph Learning [0.0]
A regularized version of Mixture Models is proposed to learn a principal graph from a distribution of $D$-dimensional data points.
Parameters of the model are iteratively estimated through an Expectation-Maximization procedure.
arXiv Detail & Related papers (2021-06-16T18:00:02Z) - Finding Geometric Models by Clustering in the Consensus Space [61.65661010039768]
We propose a new algorithm for finding an unknown number of geometric models, e.g., homographies.
We present a number of applications where the use of multiple geometric models improves accuracy.
These include pose estimation from multiple generalized homographies; trajectory estimation of fast-moving objects.
arXiv Detail & Related papers (2021-03-25T14:35:07Z) - Community detection and percolation of information in a geometric
setting [5.027571997864707]
We make the first steps towards generalizing the theory of block models, in the sparse regime.
We consider a geometric random graph over a homogeneous metric space where the probability of two vertices to be connected is an arbitrary function of the distance.
We define a geometric counterpart of the model of flow of information on trees, due to Mossel and Peres.
arXiv Detail & Related papers (2020-06-28T11:23:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.