Just SLaQ When You Approximate: Accurate Spectral Distances for
Web-Scale Graphs
- URL: http://arxiv.org/abs/2003.01282v1
- Date: Tue, 3 Mar 2020 01:25:07 GMT
- Title: Just SLaQ When You Approximate: Accurate Spectral Distances for
Web-Scale Graphs
- Authors: Anton Tsitsulin, Marina Munkhoeva, Bryan Perozzi
- Abstract summary: We propose SLaQ, an efficient and effective approximation technique for computing spectral distances between graphs with billions of nodes and edges.
We show that SLaQ outperforms existing methods, oftentimes by several orders of magnitude in approximation accuracy.
- Score: 6.72542623686684
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Graph comparison is a fundamental operation in data mining and information
retrieval. Due to the combinatorial nature of graphs, it is hard to balance the
expressiveness of the similarity measure and its scalability. Spectral analysis
provides quintessential tools for studying the multi-scale structure of graphs
and is a well-suited foundation for reasoning about differences between graphs.
However, computing full spectrum of large graphs is computationally
prohibitive; thus, spectral graph comparison methods often rely on rough
approximation techniques with weak error guarantees. In this work, we propose
SLaQ, an efficient and effective approximation technique for computing spectral
distances between graphs with billions of nodes and edges. We derive the
corresponding error bounds and demonstrate that accurate computation is
possible in time linear in the number of graph edges. In a thorough
experimental evaluation, we show that SLaQ outperforms existing methods,
oftentimes by several orders of magnitude in approximation accuracy, and
maintains comparable performance, allowing to compare million-scale graphs in a
matter of minutes on a single machine.
Related papers
- Graph Fourier MMD for Signals on Graphs [67.68356461123219]
We propose a novel distance between distributions and signals on graphs.
GFMMD is defined via an optimal witness function that is both smooth on the graph and maximizes difference in expectation.
We showcase it on graph benchmark datasets as well as on single cell RNA-sequencing data analysis.
arXiv Detail & Related papers (2023-06-05T00:01:17Z) - Localized Contrastive Learning on Graphs [110.54606263711385]
We introduce a simple yet effective contrastive model named Localized Graph Contrastive Learning (Local-GCL)
In spite of its simplicity, Local-GCL achieves quite competitive performance in self-supervised node representation learning tasks on graphs with various scales and properties.
arXiv Detail & Related papers (2022-12-08T23:36:00Z) - A Spectral Analysis of Graph Neural Networks on Dense and Sparse Graphs [13.954735096637298]
We analyze how sparsity affects the graph spectra, and thus the performance of graph neural networks (GNNs) in node classification on dense and sparse graphs.
We show that GNNs can outperform spectral methods on sparse graphs, and illustrate these results with numerical examples on both synthetic and real graphs.
arXiv Detail & Related papers (2022-11-06T22:38:13Z) - Learning Product Graphs from Spectral Templates [3.04585143845864]
Graph Learning (GL) is at the core of inference and analysis of connections in data mining and machine learning (ML)
We propose learning product (high dimensional) graphs from product spectral templates with significantly reduced complexity.
In contrast to the rare current approaches, our approach can learn all types of product graphs without knowing the type of graph products.
arXiv Detail & Related papers (2022-11-05T12:28:11Z) - FGOT: Graph Distances based on Filters and Optimal Transport [62.779521543654134]
Graph comparison deals with identifying similarities and dissimilarities between graphs.
A major obstacle is the unknown alignment of graphs, as well as the lack of accurate and inexpensive comparison metrics.
In this work we introduce the filter graph distance approximation.
arXiv Detail & Related papers (2021-09-09T17:43:07Z) - Multilayer Clustered Graph Learning [66.94201299553336]
We use contrastive loss as a data fidelity term, in order to properly aggregate the observed layers into a representative graph.
Experiments show that our method leads to a clustered clusters w.r.t.
We learn a clustering algorithm for solving clustering problems.
arXiv Detail & Related papers (2020-10-29T09:58:02Z) - Understanding Coarsening for Embedding Large-Scale Graphs [3.6739949215165164]
Proper analysis of graphs with Machine Learning (ML) algorithms has the potential to yield far-reaching insights into many areas of research and industry.
The irregular structure of graph data constitutes an obstacle for running ML tasks on graphs.
We analyze the impact of the coarsening quality on the embedding performance both in terms of speed and accuracy.
arXiv Detail & Related papers (2020-09-10T15:06:33Z) - Graph topology inference benchmarks for machine learning [16.857405938139525]
We introduce several benchmarks specifically designed to reveal the relative merits and limitations of graph inference methods.
We also contrast some of the most prominent techniques in the literature.
arXiv Detail & Related papers (2020-07-16T09:40:32Z) - Graph Pooling with Node Proximity for Hierarchical Representation
Learning [80.62181998314547]
We propose a novel graph pooling strategy that leverages node proximity to improve the hierarchical representation learning of graph data with their multi-hop topology.
Results show that the proposed graph pooling strategy is able to achieve state-of-the-art performance on a collection of public graph classification benchmark datasets.
arXiv Detail & Related papers (2020-06-19T13:09:44Z) - Block-Approximated Exponential Random Graphs [77.4792558024487]
An important challenge in the field of exponential random graphs (ERGs) is the fitting of non-trivial ERGs on large graphs.
We propose an approximative framework to such non-trivial ERGs that result in dyadic independence (i.e., edge independent) distributions.
Our methods are scalable to sparse graphs consisting of millions of nodes.
arXiv Detail & Related papers (2020-02-14T11:42:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.