Spectral Algorithms Optimally Recover (Censored) Planted Dense Subgraphs
- URL: http://arxiv.org/abs/2203.11847v1
- Date: Tue, 22 Mar 2022 16:23:43 GMT
- Title: Spectral Algorithms Optimally Recover (Censored) Planted Dense Subgraphs
- Authors: Souvik Dhara, Julia Gaudio, Elchanan Mossel, and Colin Sandon
- Abstract summary: We study spectral algorithms for the planted dense subgraph problem (PDS) and a censored variant (CPDS) of PDS, where the edge statuses are missing at random.
We show that a simple spectral algorithm based on the top two eigenvectors of the adjacency matrix can recover $Sstar$ up to the information theoretic threshold.
For the CDPS model, we obtain the information theoretic limit for the recovery problem, and further show that a spectral algorithm based on a special matrix called the signed adjacency matrix recovers $Sstar$ up to the information theoretic threshold
- Score: 6.760971938794954
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We study spectral algorithms for the planted dense subgraph problem (PDS), as
well as for a censored variant (CPDS) of PDS, where the edge statuses are
missing at random. More precisely, in the PDS model, we consider $n$ vertices
and a random subset of vertices $S^{\star}$ of size $\Omega(n)$, such that two
vertices share an edge with probability $p$ if both of them are in $S^{\star}$,
and all other edges are present with probability $q$, independently. The goal
is to recover $S^{\star}$ from one observation of the network. In the CPDS
model, edge statuses are revealed with probability $\frac{t \log n}{n}$. For
the PDS model, we show that a simple spectral algorithm based on the top two
eigenvectors of the adjacency matrix can recover $S^{\star}$ up to the
information theoretic threshold. Prior work by Hajek, Wu and Xu required a less
efficient SDP based algorithm to recover $S^{\star}$ up to the information
theoretic threshold. For the CDPS model, we obtain the information theoretic
limit for the recovery problem, and further show that a spectral algorithm
based on a special matrix called the signed adjacency matrix recovers
$S^{\star}$ up to the information theoretic threshold.
Related papers
- Exact Community Recovery (under Side Information): Optimality of Spectral Algorithms [1.4732811715354452]
We study the problem of exact community recovery in general, two-community block models.
We provide a unified analysis of the effect of side information on the information-theoretic limits of exact recovery.
arXiv Detail & Related papers (2024-06-18T21:48:59Z) - A Combinatorial Approach to Robust PCA [18.740048806623037]
We study the problem of recovering Gaussian data under adversarial corruptions.
We assume that the Gaussian noises lie in an unknown $k$-dimensional subspace $U subseteq mathbbRd$, and $s$ randomly chosen coordinates of each data point fall into the control of an adversary.
Our main result is an efficient algorithm that, when $ks2 = O(d)$, recovers every single data point up to a nearly-optimal error of $tilde O(ks/d)$ in expectation.
arXiv Detail & Related papers (2023-11-28T01:49:51Z) - Community Detection in the Hypergraph SBM: Exact Recovery Given the
Similarity Matrix [1.74048653626208]
We study the performance of algorithms which operate on the $similarity$matrix $W$, where $W_ij$ reports the number of hyperedges containing both $i$ and $j$.
We design a simple and highly efficient spectral algorithm with nearly linear runtime and show that it achieves the min-bisection threshold.
arXiv Detail & Related papers (2022-08-23T15:22:05Z) - Near-optimal fitting of ellipsoids to random points [68.12685213894112]
A basic problem of fitting an ellipsoid to random points has connections to low-rank matrix decompositions, independent component analysis, and principal component analysis.
We resolve this conjecture up to logarithmic factors by constructing a fitting ellipsoid for some $n = Omega(, d2/mathrmpolylog(d),)$.
Our proof demonstrates feasibility of the least squares construction of Saunderson et al. using a convenient decomposition of a certain non-standard random matrix.
arXiv Detail & Related papers (2022-08-19T18:00:34Z) - Best Policy Identification in Linear MDPs [70.57916977441262]
We investigate the problem of best identification in discounted linear Markov+Delta Decision in the fixed confidence setting under a generative model.
The lower bound as the solution of an intricate non- optimization program can be used as the starting point to devise such algorithms.
arXiv Detail & Related papers (2022-08-11T04:12:50Z) - Sketching Algorithms and Lower Bounds for Ridge Regression [65.0720777731368]
We give a sketching-based iterative algorithm that computes $1+varepsilon$ approximate solutions for the ridge regression problem.
We also show that this algorithm can be used to give faster algorithms for kernel ridge regression.
arXiv Detail & Related papers (2022-04-13T22:18:47Z) - Settling the Sample Complexity of Model-Based Offline Reinforcement
Learning [50.5790774201146]
offline reinforcement learning (RL) learns using pre-collected data without further exploration.
Prior algorithms or analyses either suffer from suboptimal sample complexities or incur high burn-in cost to reach sample optimality.
We demonstrate that the model-based (or "plug-in") approach achieves minimax-optimal sample complexity without burn-in cost.
arXiv Detail & Related papers (2022-04-11T17:26:19Z) - Sparse random hypergraphs: Non-backtracking spectra and community
detection [10.503525445174464]
We prove that a spectral method based on the non-backtracking operator for hypergraphs works with high probability down to the generalized Kesten-Stigum detection threshold conjectured by Angelini et al.
This is the first provable and efficient algorithm that achieves the conjectured threshold for HSBMs with $r$ blocks generated according to a general symmetric probability tensor.
arXiv Detail & Related papers (2022-03-14T17:45:03Z) - Random Graph Matching in Geometric Models: the Case of Complete Graphs [21.689343447798677]
This paper studies the problem of matching two complete graphs with edge weights correlated through latent geometries.
We derive an approximate maximum likelihood estimator, which provably achieves, with high probability, perfect recovery of $pi*$.
As a side discovery, we show that the celebrated spectral algorithm of [Ume88] emerges as a further approximation to the maximum likelihood in the geometric model.
arXiv Detail & Related papers (2022-02-22T04:14:45Z) - Clustering Mixture Models in Almost-Linear Time via List-Decodable Mean
Estimation [58.24280149662003]
We study the problem of list-decodable mean estimation, where an adversary can corrupt a majority of the dataset.
We develop new algorithms for list-decodable mean estimation, achieving nearly-optimal statistical guarantees.
arXiv Detail & Related papers (2021-06-16T03:34:14Z) - Projection-free Graph-based Classifier Learning using Gershgorin Disc
Perfect Alignment [59.87663954467815]
In graph-based binary learning, a subset of known labels $hatx_i$ are used to infer unknown labels.
When restricting labels $x_i$ to binary values, the problem is NP-hard.
We propose a fast projection-free method by solving a sequence of linear programs (LP) instead.
arXiv Detail & Related papers (2021-06-03T07:22:48Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.