Testing correlation of unlabeled random graphs
- URL: http://arxiv.org/abs/2008.10097v2
- Date: Mon, 8 Feb 2021 01:57:36 GMT
- Title: Testing correlation of unlabeled random graphs
- Authors: Yihong Wu, Jiaming Xu, Sophie H. Yu
- Abstract summary: We study the problem of detecting the edge correlation between two random graphs with $n$ unlabeled nodes.
This is formalized as a hypothesis testing problem, where under the null hypothesis, the two graphs are independently generated.
Under the alternative, the two graphs are edge-correlated under some latent node correspondence, but have the same marginal distributions as the null.
- Score: 18.08210501570919
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We study the problem of detecting the edge correlation between two random
graphs with $n$ unlabeled nodes. This is formalized as a hypothesis testing
problem, where under the null hypothesis, the two graphs are independently
generated; under the alternative, the two graphs are edge-correlated under some
latent node correspondence, but have the same marginal distributions as the
null. For both Gaussian-weighted complete graphs and dense Erd\H{o}s-R\'enyi
graphs (with edge probability $n^{-o(1)}$), we determine the sharp threshold at
which the optimal testing error probability exhibits a phase transition from
zero to one as $n\to \infty$. For sparse Erd\H{o}s-R\'enyi graphs with edge
probability $n^{-\Omega(1)}$, we determine the threshold within a constant
factor.
The proof of the impossibility results is an application of the conditional
second-moment method, where we bound the truncated second moment of the
likelihood ratio by carefully conditioning on the typical behavior of the
intersection graph (consisting of edges in both observed graphs) and taking
into account the cycle structure of the induced random permutation on the
edges. Notably, in the sparse regime, this is accomplished by leveraging the
pseudoforest structure of subcritical Erd\H{o}s-R\'enyi graphs and a careful
enumeration of subpseudoforests that can be assembled from short orbits of the
edge permutation.
Related papers
- Testing Dependency of Weighted Random Graphs [4.0554893636822]
We study the task of detecting the edge dependency between two random graphs.
For general edge-weight distributions, we establish thresholds at which optimal testing becomes information-theoretically possible or impossible.
arXiv Detail & Related papers (2024-09-23T10:07:41Z) - Graph Fourier MMD for Signals on Graphs [67.68356461123219]
We propose a novel distance between distributions and signals on graphs.
GFMMD is defined via an optimal witness function that is both smooth on the graph and maximizes difference in expectation.
We showcase it on graph benchmark datasets as well as on single cell RNA-sequencing data analysis.
arXiv Detail & Related papers (2023-06-05T00:01:17Z) - Online Centralized Non-parametric Change-point Detection via Graph-based
Likelihood-ratio Estimation [77.81487285123147]
Consider each node of a graph to be generating a data stream that is synchronized and observed at near real-time.
At a change-point $tau$, a change occurs at a subset of nodes $C$, which affects the probability distribution of their associated node streams.
We propose a novel kernel-based method to both detect $tau$ and localize $C$, based on the direct estimation of the likelihood-ratio between the post-change and the pre-change distributions of the node streams.
arXiv Detail & Related papers (2023-01-08T10:15:24Z) - Causal Bandits for Linear Structural Equation Models [58.2875460517691]
This paper studies the problem of designing an optimal sequence of interventions in a causal graphical model.
It is assumed that the graph's structure is known and has $N$ nodes.
Two algorithms are proposed for the frequentist (UCB-based) and Bayesian settings.
arXiv Detail & Related papers (2022-08-26T16:21:31Z) - Bi-stochastically normalized graph Laplacian: convergence to manifold Laplacian and robustness to outlier noise [10.418647759223965]
Bi-stochastic normalization provides an alternative normalization of graph Laplacians in graph-based data analysis.
We prove the convergence of bi-stochastically normalized graph Laplacian to manifold (weighted-)Laplacian with rates.
When the manifold data are corrupted by outlier noise, we theoretically prove the graph Laplacian point-wise consistency.
arXiv Detail & Related papers (2022-06-22T21:08:24Z) - Collaborative likelihood-ratio estimation over graphs [55.98760097296213]
Graph-based Relative Unconstrained Least-squares Importance Fitting (GRULSIF)
We develop this idea in a concrete non-parametric method that we call Graph-based Relative Unconstrained Least-squares Importance Fitting (GRULSIF)
We derive convergence rates for our collaborative approach that highlights the role played by variables such as the number of available observations per node, the size of the graph, and how accurately the graph structure encodes the similarity between tasks.
arXiv Detail & Related papers (2022-05-28T15:37:03Z) - Detection threshold for correlated Erd\H{o}s-R\'enyi graphs via densest
subgraphs [9.12788494573002]
We solve the problem of detecting edge correlation between two ErdHos-R'enyi random graphs on $n$ unlabeled nodes.
A key novelty in our work is an interesting connection between the detection problem and the densest subgraph of an ErdHos-R'enyi graph.
arXiv Detail & Related papers (2022-03-28T08:32:43Z) - Correlation detection in trees for partial graph alignment [3.5880535198436156]
We consider alignment of graphs, which consists in finding a mapping between the nodes of two graphs which preserves most of the edges.
Our approach is to compare local structures in the two graphs, matching two nodes if their neighborhoods are 'close enough' for correlated graphs.
This problem can be rephrased in terms of testing whether a pair of branching trees is drawn from either a product distribution, or a correlated distribution.
arXiv Detail & Related papers (2021-07-15T22:02:27Z) - Spectral clustering under degree heterogeneity: a case for the random
walk Laplacian [83.79286663107845]
This paper shows that graph spectral embedding using the random walk Laplacian produces vector representations which are completely corrected for node degree.
In the special case of a degree-corrected block model, the embedding concentrates about K distinct points, representing communities.
arXiv Detail & Related papers (2021-05-03T16:36:27Z) - Settling the Sharp Reconstruction Thresholds of Random Graph Matching [19.54246087326288]
We study the problem of recovering the hidden correspondence between two edge-correlated random graphs.
For dense graphs with $p=n-o(1)$, we prove that there exists a sharp threshold.
For sparse ErdHos-R'enyi graphs with $p=n-Theta(1)$, we show that the all-or-nothing phenomenon no longer holds.
arXiv Detail & Related papers (2021-01-29T21:49:50Z) - Block-Approximated Exponential Random Graphs [77.4792558024487]
An important challenge in the field of exponential random graphs (ERGs) is the fitting of non-trivial ERGs on large graphs.
We propose an approximative framework to such non-trivial ERGs that result in dyadic independence (i.e., edge independent) distributions.
Our methods are scalable to sparse graphs consisting of millions of nodes.
arXiv Detail & Related papers (2020-02-14T11:42:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.