Listing Maximal k-Plexes in Large Real-World Graphs
- URL: http://arxiv.org/abs/2202.08737v2
- Date: Sat, 19 Feb 2022 03:00:13 GMT
- Title: Listing Maximal k-Plexes in Large Real-World Graphs
- Authors: Zhengren Wang, Yi Zhou, Mingyu Xiao and Bakhadyr Khoussainov
- Abstract summary: Listing dense subgraphs in large graphs plays a key task in varieties of network analysis applications like community detection.
In this paper, we continue the research line of listing all maximal $k$-plexes and maximal $k$-plexes of prescribed size.
Our first contribution is algorithm ListPlex that lists all maximal $k$-plexes in $O*(gammaD)$ time for each constant $k$, where $gamma$ is a value related to $k$ but strictly smaller than 2, and $D$ is the degeneracy of the graph that is far less
- Score: 21.79007529359561
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Listing dense subgraphs in large graphs plays a key task in varieties of
network analysis applications like community detection. Clique, as the densest
model, has been widely investigated. However, in practice, communities rarely
form as cliques for various reasons, e.g., data noise. Therefore, $k$-plex, --
graph with each vertex adjacent to all but at most $k$ vertices, is introduced
as a relaxed version of clique. Often, to better simulate cohesive communities,
an emphasis is placed on connected $k$-plexes with small $k$. In this paper, we
continue the research line of listing all maximal $k$-plexes and maximal
$k$-plexes of prescribed size. Our first contribution is algorithm ListPlex
that lists all maximal $k$-plexes in $O^*(\gamma^D)$ time for each constant
$k$, where $\gamma$ is a value related to $k$ but strictly smaller than 2, and
$D$ is the degeneracy of the graph that is far less than the vertex number $n$
in real-word graphs. Compared to the trivial bound of $2^n$, the improvement is
significant, and our bound is better than all previously known results. In
practice, we further use several techniques to accelerate listing $k$-plexes of
a given size, such as structural-based prune rules, cache-efficient data
structures, and parallel techniques. All these together result in a very
practical algorithm. Empirical results show that our approach outperforms the
state-of-the-art solutions by up to orders of magnitude.
Related papers
- Efficient Top-k s-Biplexes Search over Large Bipartite Graphs [4.507351209412066]
In bipartite graph analysis, enumeration of $s$-biplexes is a fundamental problem.
In real-world data engineering, finding all $s-biplexes is neither necessary nor computationally affordable.
We propose MVBP, that breaks the simple $2n enumeration algorithm.
FastMVBP outperforms the benchmark algorithms by up to three orders of magnitude in several instances.
arXiv Detail & Related papers (2024-09-27T06:23:29Z) - A Faster Branching Algorithm for the Maximum $k$-Defective Clique Problem [18.720357905876604]
A $k$-defective clique of an un branching graph $G$ induces a nearly complete graph with a maximum of $k$ missing edges.
The maximum $k$-defective clique problem, which asks for the largest $k$-defective clique from the given graph, is important in many applications, such as social and biological network analysis.
arXiv Detail & Related papers (2024-07-23T15:40:35Z) - Efficiently Learning One-Hidden-Layer ReLU Networks via Schur
Polynomials [50.90125395570797]
We study the problem of PAC learning a linear combination of $k$ ReLU activations under the standard Gaussian distribution on $mathbbRd$ with respect to the square loss.
Our main result is an efficient algorithm for this learning task with sample and computational complexity $(dk/epsilon)O(k)$, whereepsilon>0$ is the target accuracy.
arXiv Detail & Related papers (2023-07-24T14:37:22Z) - Fast Maximum $k$-Plex Algorithms Parameterized by Small Degeneracy Gaps [30.06993761032835]
The maximum $k$-plex problem is important but computationally challenging in applications such as graph mining and community detection.
We present an exact algorithm parameterized by $g_k(G)$, which has the worst-case running time in the size of the input graph and exponential in $g_k(G)$.
We further extend our discussion to an even smaller parameter $cg_k(G)$, the gap between the community-degeneracy bound and the size of the maximum $k$-plex.
arXiv Detail & Related papers (2023-06-23T01:28:24Z) - TURF: A Two-factor, Universal, Robust, Fast Distribution Learning
Algorithm [64.13217062232874]
One of its most powerful and successful modalities approximates every distribution to an $ell$ distance essentially at most a constant times larger than its closest $t$-piece degree-$d_$.
We provide a method that estimates this number near-optimally, hence helps approach the best possible approximation.
arXiv Detail & Related papers (2022-02-15T03:49:28Z) - Fast Graph Sampling for Short Video Summarization using Gershgorin Disc
Alignment [52.577757919003844]
We study the problem of efficiently summarizing a short video into several paragraphs, leveraging recent progress in fast graph sampling.
Experimental results show that our algorithm achieves comparable video summarization as state-of-the-art methods, at a substantially reduced complexity.
arXiv Detail & Related papers (2021-10-21T18:43:00Z) - Local classical MAX-CUT algorithm outperforms $p=2$ QAOA on high-girth
regular graphs [0.0]
We show that for all degrees $D ge 2$ of girth $> 5$, QAOA$ has a larger expected cut fraction than QAOA$$ on $G$.
We conjecture that for every constant $p$, there exists a local classical MAX-CUT algorithm that performs as well as QAOA$_p$ on all graphs.
arXiv Detail & Related papers (2021-01-14T09:17:20Z) - Small Covers for Near-Zero Sets of Polynomials and Learning Latent
Variable Models [56.98280399449707]
We show that there exists an $epsilon$-cover for $S$ of cardinality $M = (k/epsilon)O_d(k1/d)$.
Building on our structural result, we obtain significantly improved learning algorithms for several fundamental high-dimensional probabilistic models hidden variables.
arXiv Detail & Related papers (2020-12-14T18:14:08Z) - Optimal Low-Degree Hardness of Maximum Independent Set [93.59919600451487]
We study the algorithmic task of finding a large independent set in a sparse ErdHos-R'enyi random graph.
We show that the class of low-degree algorithms can find independent sets of half-optimal size but no larger.
arXiv Detail & Related papers (2020-10-13T17:26:09Z) - Streaming Complexity of SVMs [110.63976030971106]
We study the space complexity of solving the bias-regularized SVM problem in the streaming model.
We show that for both problems, for dimensions of $frac1lambdaepsilon$, one can obtain streaming algorithms with spacely smaller than $frac1lambdaepsilon$.
arXiv Detail & Related papers (2020-07-07T17:10:00Z) - Learning Bayesian Networks Under Sparsity Constraints: A Parameterized
Complexity Analysis [7.99536002595393]
We study the problem of learning the structure of an optimal Bayesian network when additional constraints are posed on the network or on its moralized graph.
We show that learning an optimal network with at most $k$ edges in the moralized graph presumably has no $f(k)cdot |I|O(1)$-time algorithm.
arXiv Detail & Related papers (2020-04-30T12:31:13Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.