Degree-Based Random Walk Approach for Graph Embedding
- URL: http://arxiv.org/abs/2110.13627v1
- Date: Thu, 21 Oct 2021 19:16:16 GMT
- Title: Degree-Based Random Walk Approach for Graph Embedding
- Authors: Sarmad N. Mohammed, Semra G\"und\"u\c{c}
- Abstract summary: A computationally less intensive and node connectivity aware uniform sampling method is proposed.
The advantages of the proposed algorithm become more enhanced when the algorithm is applied to large graphs.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Graph embedding, representing local and global neighborhood information by
numerical vectors, is a crucial part of the mathematical modeling of a wide
range of real-world systems. Among the embedding algorithms, random walk-based
algorithms have proven to be very successful. These algorithms collect
information by creating numerous random walks with a redefined number of steps.
Creating random walks is the most demanding part of the embedding process. The
computation demand increases with the size of the network. Moreover, for
real-world networks, considering all nodes on the same footing, the abundance
of low-degree nodes creates an imbalanced data problem. In this work, a
computationally less intensive and node connectivity aware uniform sampling
method is proposed. In the proposed method, the number of random walks is
created proportionally with the degree of the node. The advantages of the
proposed algorithm become more enhanced when the algorithm is applied to large
graphs. A comparative study by using two networks namely CORA and CiteSeer is
presented. Comparing with the fixed number of walks case, the proposed method
requires 50% less computational effort to reach the same accuracy for node
classification and link prediction calculations.
Related papers
- Ensemble Quadratic Assignment Network for Graph Matching [52.20001802006391]
Graph matching is a commonly used technique in computer vision and pattern recognition.
Recent data-driven approaches have improved the graph matching accuracy remarkably.
We propose a graph neural network (GNN) based approach to combine the advantages of data-driven and traditional methods.
arXiv Detail & Related papers (2024-03-11T06:34:05Z) - NodeFormer: A Scalable Graph Structure Learning Transformer for Node
Classification [70.51126383984555]
We introduce a novel all-pair message passing scheme for efficiently propagating node signals between arbitrary nodes.
The efficient computation is enabled by a kernerlized Gumbel-Softmax operator.
Experiments demonstrate the promising efficacy of the method in various tasks including node classification on graphs.
arXiv Detail & Related papers (2023-06-14T09:21:15Z) - Hub-aware Random Walk Graph Embedding Methods for Classification [44.99833362998488]
We propose two novel graph embedding algorithms based on random walks that are specifically designed for the node classification problem.
The proposed methods are experimentally evaluated by analyzing the classification performance of three classification algorithms trained on embeddings of real-world networks.
arXiv Detail & Related papers (2022-09-15T20:41:18Z) - A Metaheuristic Algorithm for Large Maximum Weight Independent Set
Problems [58.348679046591265]
Given a node-weighted graph, find a set of independent (mutually nonadjacent) nodes whose node-weight sum is maximum.
Some of the graphs airsing in this application are large, having hundreds of thousands of nodes and hundreds of millions of edges.
We develop a new local search algorithm, which is a metaheuristic in the greedy randomized adaptive search framework.
arXiv Detail & Related papers (2022-03-28T21:34:16Z) - Neighbor2Seq: Deep Learning on Massive Graphs by Transforming Neighbors
to Sequences [55.329402218608365]
We propose the Neighbor2Seq to transform the hierarchical neighborhood of each node into a sequence.
We evaluate our method on a massive graph with more than 111 million nodes and 1.6 billion edges.
Results show that our proposed method is scalable to massive graphs and achieves superior performance across massive and medium-scale graphs.
arXiv Detail & Related papers (2022-02-07T16:38:36Z) - Efficient and Local Parallel Random Walks [21.29022677162416]
Random walks are a fundamental primitive used in many machine learning algorithms.
We present a new algorithm that overcomes this limitation by building random walk efficiently and locally.
We show that our technique is both memory and round efficient, and in particular yields an efficient parallel local clustering algorithm.
arXiv Detail & Related papers (2021-12-01T17:06:11Z) - Community detection using low-dimensional network embedding algorithms [1.052782170493037]
We rigorously understand the performance of two major algorithms, DeepWalk and node2vec, in recovering communities for canonical network models.
We prove that, given some fixed co-occurrence window, node2vec using random walks with a low non-backtracking probability can succeed for much sparser networks.
arXiv Detail & Related papers (2021-11-04T14:57:43Z) - A Preference Random Walk Algorithm for Link Prediction through Mutual
Influence Nodes in Complex Networks [1.345669927504424]
Local random walk is considered to be one of the most well-known algorithms in the category of quasi-local methods.
In most datasets, this method is not able to perform accurately in scoring remarkably similar nodes.
An efficient method is proposed for improving local random walk by encouraging random walk to move towards the node which has a stronger influence.
arXiv Detail & Related papers (2021-05-20T03:35:38Z) - Finding Geometric Models by Clustering in the Consensus Space [61.65661010039768]
We propose a new algorithm for finding an unknown number of geometric models, e.g., homographies.
We present a number of applications where the use of multiple geometric models improves accuracy.
These include pose estimation from multiple generalized homographies; trajectory estimation of fast-moving objects.
arXiv Detail & Related papers (2021-03-25T14:35:07Z) - Towards Optimally Efficient Tree Search with Deep Learning [76.64632985696237]
This paper investigates the classical integer least-squares problem which estimates signals integer from linear models.
The problem is NP-hard and often arises in diverse applications such as signal processing, bioinformatics, communications and machine learning.
We propose a general hyper-accelerated tree search (HATS) algorithm by employing a deep neural network to estimate the optimal estimation for the underlying simplified memory-bounded A* algorithm.
arXiv Detail & Related papers (2021-01-07T08:00:02Z) - Private Weighted Random Walk Stochastic Gradient Descent [21.23973736418492]
We consider a decentralized learning setting in which data is distributed over nodes in a graph.
We propose two algorithms based on two types of random walks that achieve, in a decentralized way, uniform sampling and importance sampling of the data.
arXiv Detail & Related papers (2020-09-03T16:52:13Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.