Community models for networks observed through edge nominations
- URL: http://arxiv.org/abs/2008.03652v2
- Date: Fri, 19 Mar 2021 03:25:11 GMT
- Title: Community models for networks observed through edge nominations
- Authors: Tianxi Li, Elizaveta Levina, Ji Zhu
- Abstract summary: Communities are a common and widely studied structure in networks, typically under the assumption that the network is fully and correctly observed.
We propose a general model for a class of network sampling mechanisms based on recording edges via querying nodes.
We show community detection can be performed by spectral clustering under this general class of models.
- Score: 6.442024233731203
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Communities are a common and widely studied structure in networks, typically
under the assumption that the network is fully and correctly observed. In
practice, network data are often collected by querying nodes about their
connections. In some settings, all edges of a sampled node will be recorded,
and in others, a node may be asked to name its connections. These sampling
mechanisms introduce noise and bias which can obscure the community structure
and invalidate assumptions underlying standard community detection methods. We
propose a general model for a class of network sampling mechanisms based on
recording edges via querying nodes, designed to improve community detection for
network data collected in this fashion. We model edge sampling probabilities as
a function of both individual preferences and community parameters, and show
community detection can be performed by spectral clustering under this general
class of models. We also propose, as a special case of the general framework, a
parametric model for directed networks we call the nomination stochastic block
model, which allows for meaningful parameter interpretations and can be fitted
by the method of moments. Both spectral clustering and the method of moments in
this case are computationally efficient and come with theoretical guarantees of
consistency. We evaluate the proposed model in simulation studies on both
unweighted and weighted networks and apply it to a faculty hiring dataset,
discovering a meaningful hierarchy of communities among US business schools.
Related papers
- A Dirichlet stochastic block model for composition-weighted networks [0.0]
We propose a block model for composition-weighted networks based on direct modelling of compositional weight vectors.
Inference is implemented via an extension of the classification expectation-maximisation algorithm.
The model is validated using simulation studies, and showcased on network data from the Erasmus exchange program and a bike sharing network for the city of London.
arXiv Detail & Related papers (2024-08-01T15:41:07Z) - Nested stochastic block model for simultaneously clustering networks and
nodes [9.860884833526407]
We introduce the nested block model (NSBM) to cluster a collection of networks while simultaneously detecting communities within each network.
NSBM has several appealing features including the ability to work on unlabeled networks with potentially different node sets.
arXiv Detail & Related papers (2023-07-18T12:46:34Z) - Joint Bayesian Inference of Graphical Structure and Parameters with a
Single Generative Flow Network [59.79008107609297]
We propose in this paper to approximate the joint posterior over the structure of a Bayesian Network.
We use a single GFlowNet whose sampling policy follows a two-phase process.
Since the parameters are included in the posterior distribution, this leaves more flexibility for the local probability models.
arXiv Detail & Related papers (2023-05-30T19:16:44Z) - Numerically assisted determination of local models in network scenarios [55.2480439325792]
We develop a numerical tool for finding explicit local models that reproduce a given statistical behaviour.
We provide conjectures for the critical visibilities of the Greenberger-Horne-Zeilinger (GHZ) and W distributions.
The developed codes and documentation are publicly available at281.com/mariofilho/localmodels.
arXiv Detail & Related papers (2023-03-17T13:24:04Z) - A pseudo-likelihood approach to community detection in weighted networks [4.111899441919165]
We propose a pseudo-likelihood community estimation algorithm for networks with normally distributed edge weights.
We prove that the estimates obtained by the proposed method are consistent under the assumption of homogeneous networks.
We illustrate the method on simulated networks and on a fMRI dataset, where edge weights represent connectivity between brain regions.
arXiv Detail & Related papers (2023-03-10T13:36:10Z) - Bayesian community detection for networks with covariates [16.230648949593153]
"Community detection" has arguably received the most attention in the scientific community.
We propose a block model with a co-dependent random partition prior.
Our model has the ability to learn the number of the communities via posterior inference without having to assume it to be known.
arXiv Detail & Related papers (2022-03-04T01:58:35Z) - Bayesian Structure Learning with Generative Flow Networks [85.84396514570373]
In Bayesian structure learning, we are interested in inferring a distribution over the directed acyclic graph (DAG) from data.
Recently, a class of probabilistic models, called Generative Flow Networks (GFlowNets), have been introduced as a general framework for generative modeling.
We show that our approach, called DAG-GFlowNet, provides an accurate approximation of the posterior over DAGs.
arXiv Detail & Related papers (2022-02-28T15:53:10Z) - Anomaly Detection on Attributed Networks via Contrastive Self-Supervised
Learning [50.24174211654775]
We present a novel contrastive self-supervised learning framework for anomaly detection on attributed networks.
Our framework fully exploits the local information from network data by sampling a novel type of contrastive instance pair.
A graph neural network-based contrastive learning model is proposed to learn informative embedding from high-dimensional attributes and local structure.
arXiv Detail & Related papers (2021-02-27T03:17:20Z) - On the use of local structural properties for improving the efficiency
of hierarchical community detection methods [77.34726150561087]
We study how local structural network properties can be used as proxies to improve the efficiency of hierarchical community detection.
We also check the performance impact of network prunings as an ancillary tactic to make hierarchical community detection more efficient.
arXiv Detail & Related papers (2020-09-15T00:16:12Z) - A Multi-Semantic Metapath Model for Large Scale Heterogeneous Network
Representation Learning [52.83948119677194]
We propose a multi-semantic metapath (MSM) model for large scale heterogeneous representation learning.
Specifically, we generate multi-semantic metapath-based random walks to construct the heterogeneous neighborhood to handle the unbalanced distributions.
We conduct systematical evaluations for the proposed framework on two challenging datasets: Amazon and Alibaba.
arXiv Detail & Related papers (2020-07-19T22:50:20Z) - Network Clustering Via Kernel-ARMA Modeling and the Grassmannian The
Brain-Network Case [6.78543866474958]
This paper introduces a clustering framework for networks with nodes annotated with time-series data.
The framework addresses all types of network-clustering problems: state clustering, node clustering within states, and even subnetwork-state-sequence identification/tracking.
arXiv Detail & Related papers (2020-02-18T19:48:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.