Planted Dense Subgraphs in Dense Random Graphs Can Be Recovered using
Graph-based Machine Learning
- URL: http://arxiv.org/abs/2201.01825v1
- Date: Wed, 5 Jan 2022 21:11:23 GMT
- Title: Planted Dense Subgraphs in Dense Random Graphs Can Be Recovered using
Graph-based Machine Learning
- Authors: Itay Levinas and Yoram Louzoun
- Abstract summary: We present PYGON, a graph neural network-based algorithm, which is insensitive to the structure of the planted subgraph.
We show that PYGON can recover cliques of sizes $Thetaleft(sqrtnright)$, where $n$ is the size of the background graph.
We also show that the same algorithm can recover multiple other planted subgraphs of size $Thetaleft(sqrtnright)$, in both directed and undirected.
- Score: 1.52292571922932
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: Multiple methods of finding the vertices belonging to a planted dense
subgraph in a random dense $G(n, p)$ graph have been proposed, with an emphasis
on planted cliques. Such methods can identify the planted subgraph in
polynomial time, but are all limited to several subgraph structures. Here, we
present PYGON, a graph neural network-based algorithm, which is insensitive to
the structure of the planted subgraph. This is the first algorithm that uses
advanced learning tools for recovering dense subgraphs. We show that PYGON can
recover cliques of sizes $\Theta\left(\sqrt{n}\right)$, where $n$ is the size
of the background graph, comparable with the state of the art. We also show
that the same algorithm can recover multiple other planted subgraphs of size
$\Theta\left(\sqrt{n}\right)$, in both directed and undirected graphs. We
suggest a conjecture that no polynomial time PAC-learning algorithm can detect
planted dense subgraphs with size smaller than $O\left(\sqrt{n}\right)$, even
if in principle one could find dense subgraphs of logarithmic size.
Related papers
- Jaccard-constrained dense subgraph discovery [2.4511852052357628]
We search for dense subgraphs that have large pairwise Jaccard similarity coefficients.
We show experimentally that our algorithms are efficient, they can find ground truth in synthetic datasets and provide interpretable results from real-world datasets.
arXiv Detail & Related papers (2023-08-30T10:33:02Z) - Detection of Dense Subhypergraphs by Low-Degree Polynomials [72.4451045270967]
Detection of a planted dense subgraph in a random graph is a fundamental statistical and computational problem.
We consider detecting the presence of a planted $Gr(ngamma, n-alpha)$ subhypergraph in a $Gr(n, n-beta) hypergraph.
Our results are already new in the graph case $r=2$, as we consider the subtle log-density regime where hardness based on average-case reductions is not known.
arXiv Detail & Related papers (2023-04-17T10:38:08Z) - Detection-Recovery Gap for Planted Dense Cycles [72.4451045270967]
We consider a model where a dense cycle with expected bandwidth $n tau$ and edge density $p$ is planted in an ErdHos-R'enyi graph $G(n,q)$.
We characterize the computational thresholds for the associated detection and recovery problems for the class of low-degree algorithms.
arXiv Detail & Related papers (2023-02-13T22:51:07Z) - Correlation Clustering Algorithm for Dynamic Complete Signed Graphs: An
Index-based Approach [9.13755431537592]
In this paper, we reduce the complexity of approximating the correlation clustering problem from $O(mtimesleft( 2+ alpha (G) right)+n)$ to $O(m+n)$ for any given value of $varepsilon$ for a complete signed graph.
Our approach gives the same output as the original algorithm and makes it possible to implement the algorithm in a full dynamic setting.
arXiv Detail & Related papers (2023-01-01T10:57:36Z) - Fast Graph Sampling for Short Video Summarization using Gershgorin Disc
Alignment [52.577757919003844]
We study the problem of efficiently summarizing a short video into several paragraphs, leveraging recent progress in fast graph sampling.
Experimental results show that our algorithm achieves comparable video summarization as state-of-the-art methods, at a substantially reduced complexity.
arXiv Detail & Related papers (2021-10-21T18:43:00Z) - Random Subgraph Detection Using Queries [29.192695995340653]
The planted densest subgraph detection problem refers to the task of testing whether in a given (random) graph there is a subgraph that is unusually dense.
In this paper, we consider a natural variant of the above problem, where one can only observe a relatively small part of the graph using adaptive edge queries.
For this model, we determine the number of queries necessary and sufficient (accompanied with a quasi-polynomial optimal algorithm) for detecting the presence of the planted subgraph.
arXiv Detail & Related papers (2021-10-02T07:41:17Z) - The Generalized Mean Densest Subgraph Problem [30.33731479053404]
We introduce a new family of dense subgraph objectives, parameterized by a single parameter $p$.
Our objective captures both the standard densest subgraph problem and the maximum $k$-core as special cases.
A major contribution of our work is analyzing the performance of different types of peeling algorithms for dense subgraphs both in theory and practice.
arXiv Detail & Related papers (2021-06-02T02:58:35Z) - Efficient Graph Deep Learning in TensorFlow with tf_geometric [53.237754811019464]
We introduce tf_geometric, an efficient and friendly library for graph deep learning.
tf_geometric provides kernel libraries for building Graph Neural Networks (GNNs) as well as implementations of popular GNNs.
The kernel libraries consist of infrastructures for building efficient GNNs, including graph data structures, graph map-reduce framework, graph mini-batch strategy, etc.
arXiv Detail & Related papers (2021-01-27T17:16:36Z) - A step towards neural genome assembly [0.0]
We train the MPNN model with max-aggregator to execute several algorithms for graph simplification.
We show that the algorithms were learned successfully and can be scaled to graphs of sizes up to 20 times larger than the ones used in training.
arXiv Detail & Related papers (2020-11-10T10:12:19Z) - Scalable Deep Generative Modeling for Sparse Graphs [105.60961114312686]
Existing deep neural methods require $Omega(n2)$ complexity by building up the adjacency matrix.
We develop a novel autoregressive model, named BiGG, that utilizes this sparsity to avoid generating the full adjacency matrix.
During training this autoregressive model can be parallelized with $O(log n)$ synchronization stages.
arXiv Detail & Related papers (2020-06-28T04:37:57Z) - Online Dense Subgraph Discovery via Blurred-Graph Feedback [87.9850024070244]
We introduce a novel learning problem for dense subgraph discovery.
We first propose a edge-time algorithm that obtains a nearly-optimal solution with high probability.
We then design a more scalable algorithm with a theoretical guarantee.
arXiv Detail & Related papers (2020-06-24T11:37:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.