Computing Graph Descriptors on Edge Streams
- URL: http://arxiv.org/abs/2109.01494v5
- Date: Sat, 8 Apr 2023 20:14:48 GMT
- Title: Computing Graph Descriptors on Edge Streams
- Authors: Zohair Raza Hassan, Sarwan Ali, Imdadullah Khan, Mudassir Shabbir,
Waseem Abbas
- Abstract summary: We present streaming algorithms to approximately compute three different graph descriptors.
operating on edge streams allows us to avoid storing the entire graph in memory.
Our scalable algorithms compute descriptors of graphs with millions of edges within minutes.
- Score: 4.129847064263056
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Feature extraction is an essential task in graph analytics. These feature
vectors, called graph descriptors, are used in downstream vector-space-based
graph analysis models. This idea has proved fruitful in the past, with
spectral-based graph descriptors providing state-of-the-art classification
accuracy. However, known algorithms to compute meaningful descriptors do not
scale to large graphs since: (1) they require storing the entire graph in
memory, and (2) the end-user has no control over the algorithm's runtime. In
this paper, we present streaming algorithms to approximately compute three
different graph descriptors capturing the essential structure of graphs.
Operating on edge streams allows us to avoid storing the entire graph in
memory, and controlling the sample size enables us to keep the runtime of our
algorithms within desired bounds. We demonstrate the efficacy of the proposed
descriptors by analyzing the approximation error and classification accuracy.
Our scalable algorithms compute descriptors of graphs with millions of edges
within minutes. Moreover, these descriptors yield predictive accuracy
comparable to the state-of-the-art methods but can be computed using only 25%
as much memory.
Related papers
- Graph Parsing Networks [64.5041886737007]
We propose an efficient graph parsing algorithm to infer the pooling structure, which then drives graph pooling.
The resulting Graph Parsing Network (GPN) adaptively learns personalized pooling structure for each individual graph.
arXiv Detail & Related papers (2024-02-22T09:08:36Z) - Exploiting Edge Features in Graphs with Fused Network Gromov-Wasserstein
Distance [18.522233517515975]
We introduce an extension of Gromov-Wasserstein distance for comparing graphs whose both nodes and edges have features.
We empirically show the effectiveness of the novel distance in learning tasks where graphs occur in either input space or output space.
arXiv Detail & Related papers (2023-09-28T17:05:03Z) - Bures-Wasserstein Means of Graphs [60.42414991820453]
We propose a novel framework for defining a graph mean via embeddings in the space of smooth graph signal distributions.
By finding a mean in this embedding space, we can recover a mean graph that preserves structural information.
We establish the existence and uniqueness of the novel graph mean, and provide an iterative algorithm for computing it.
arXiv Detail & Related papers (2023-05-31T11:04:53Z) - Towards Accurate Subgraph Similarity Computation via Neural Graph
Pruning [22.307526272085024]
In this work, we convert graph pruning to a problem of node relabeling and then relax it to a differentiable problem.
Based on this idea, we further design a novel neural network to approximate a type of subgraph distance: the subgraph edit distance (SED)
In the design of the model, we propose an attention mechanism to leverage the information about the query graph and guide the pruning of the target graph.
arXiv Detail & Related papers (2022-10-19T15:16:28Z) - Malware Analysis with Symbolic Execution and Graph Kernel [2.1377923666134113]
We propose a new efficient open source toolchain for machine learning-based classification.
We focus on the 1-dimensional Weisfeiler-Lehman kernel, which can capture local similarities between graphs.
arXiv Detail & Related papers (2022-04-12T08:52:33Z) - Sketch-Based Anomaly Detection in Streaming Graphs [89.52200264469364]
Given a stream of graph edges from a dynamic graph, how can we assign anomaly scores to edges and subgraphs in an online manner?
Our method is the first streaming approach that incorporates dense subgraph search to detect graph anomalies in constant memory and time.
arXiv Detail & Related papers (2021-06-08T16:10:36Z) - Line Graph Neural Networks for Link Prediction [71.00689542259052]
We consider the graph link prediction task, which is a classic graph analytical problem with many real-world applications.
In this formalism, a link prediction problem is converted to a graph classification task.
We propose to seek a radically different and novel path by making use of the line graphs in graph theory.
In particular, each node in a line graph corresponds to a unique edge in the original graph. Therefore, link prediction problems in the original graph can be equivalently solved as a node classification problem in its corresponding line graph, instead of a graph classification task.
arXiv Detail & Related papers (2020-10-20T05:54:31Z) - Faster Graph Embeddings via Coarsening [25.37181684580123]
Graph embeddings are a ubiquitous tool for machine learning tasks, such as node classification and link prediction, on graph-structured data.
computing the embeddings for large-scale graphs is prohibitively inefficient even if we are interested only in a small subset of relevant vertices.
We present an efficient graph coarsening approach, based on Schur complements, for computing the embedding of the relevant vertices.
arXiv Detail & Related papers (2020-07-06T15:22:25Z) - Online Dense Subgraph Discovery via Blurred-Graph Feedback [87.9850024070244]
We introduce a novel learning problem for dense subgraph discovery.
We first propose a edge-time algorithm that obtains a nearly-optimal solution with high probability.
We then design a more scalable algorithm with a theoretical guarantee.
arXiv Detail & Related papers (2020-06-24T11:37:33Z) - Graph Pooling with Node Proximity for Hierarchical Representation
Learning [80.62181998314547]
We propose a novel graph pooling strategy that leverages node proximity to improve the hierarchical representation learning of graph data with their multi-hop topology.
Results show that the proposed graph pooling strategy is able to achieve state-of-the-art performance on a collection of public graph classification benchmark datasets.
arXiv Detail & Related papers (2020-06-19T13:09:44Z) - Wasserstein Embedding for Graph Learning [33.90471037116372]
Wasserstein Embedding for Graph Learning (WEGL) is a framework for embedding entire graphs in a vector space.
We leverage new insights on defining similarity between graphs as a function of the similarity between their node embedding distributions.
We evaluate our new graph embedding approach on various benchmark graph-property prediction tasks.
arXiv Detail & Related papers (2020-06-16T18:23:00Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.