Clustering for directed graphs using parametrized random walk diffusion
kernels
- URL: http://arxiv.org/abs/2210.00310v1
- Date: Sat, 1 Oct 2022 16:13:40 GMT
- Title: Clustering for directed graphs using parametrized random walk diffusion
kernels
- Authors: Harry Sevi, Matthieu Jonckheere, Argyris Kalogeratos
- Abstract summary: We introduce a new clustering framework, the Parametrized Random Walk Diffusion Kernel Clustering (P-RWDKC)
Our framework is based on the diffusion geometry and the generalized spectral clustering framework.
Experiments on $K$-NN graphs constructed from real-world datasets and real-world graphs show that our clustering approach performs well in all tested cases.
- Score: 5.145741425164946
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Clustering based on the random walk operator has been proven effective for
undirected graphs, but its generalization to directed graphs (digraphs) is much
more challenging. Although the random walk operator is well-defined for
digraphs, in most cases such graphs are not strongly connected, and hence the
associated random walks are not irreducible, which is a crucial property for
clustering that exists naturally in the undirected setting. To remedy this, the
usual workaround is to either naively symmetrize the adjacency matrix or to
replace the natural random walk operator by the teleporting random walk
operator, but this can lead to the loss of valuable information carried by edge
directionality. In this paper, we introduce a new clustering framework, the
Parametrized Random Walk Diffusion Kernel Clustering (P-RWDKC), which is
suitable for handling both directed and undirected graphs. Our framework is
based on the diffusion geometry and the generalized spectral clustering
framework. Accordingly, we propose an algorithm that automatically reveals the
cluster structure at a given scale, by considering the random walk dynamics
associated with a parametrized kernel operator, and by estimating its critical
diffusion time. Experiments on $K$-NN graphs constructed from real-world
datasets and real-world graphs show that our clustering approach performs well
in all tested cases, and outperforms existing approaches in most of them.
Related papers
- HeNCler: Node Clustering in Heterophilous Graphs through Learned Asymmetric Similarity [55.27586970082595]
HeNCler is a novel approach for Heterophilous Node Clustering.
We show that HeNCler significantly enhances performance in node clustering tasks within heterophilous graph contexts.
arXiv Detail & Related papers (2024-05-27T11:04:05Z) - Interpretable Multi-View Clustering Based on Anchor Graph Tensor Factorization [64.00146569922028]
Multi-view clustering methods based on anchor graph factorization lack adequate cluster interpretability for the decomposed matrix.
We address this limitation by using non-negative tensor factorization to decompose an anchor graph tensor that combines anchor graphs from multiple views.
arXiv Detail & Related papers (2024-04-01T03:23:55Z) - Reinforcement Graph Clustering with Unknown Cluster Number [91.4861135742095]
We propose a new deep graph clustering method termed Reinforcement Graph Clustering.
In our proposed method, cluster number determination and unsupervised representation learning are unified into a uniform framework.
In order to conduct feedback actions, the clustering-oriented reward function is proposed to enhance the cohesion of the same clusters and separate the different clusters.
arXiv Detail & Related papers (2023-08-13T18:12:28Z) - Latent Random Steps as Relaxations of Max-Cut, Min-Cut, and More [30.919536115917726]
We present a probabilistic model based on non-negative matrix factorization which unifies clustering and simplification.
By relaxing the hard clustering to a soft clustering, our algorithm relaxes potentially hard clustering problems to a tractable ones.
arXiv Detail & Related papers (2023-08-12T02:47:57Z) - Optimal Propagation for Graph Neural Networks [51.08426265813481]
We propose a bi-level optimization approach for learning the optimal graph structure.
We also explore a low-rank approximation model for further reducing the time complexity.
arXiv Detail & Related papers (2022-05-06T03:37:00Z) - Generalized Spectral Clustering for Directed and Undirected Graphs [4.286327408435937]
We present a generalized spectral clustering framework that can address both directed and undirected graphs.
Our approach is based on the spectral relaxation of a new functional that we introduce as the generalized Dirichlet energy of a graph function.
We also propose a practical parametrization of the regularizing measure constructed from the iterated powers of the natural random walk on the graph.
arXiv Detail & Related papers (2022-03-07T09:18:42Z) - Skew-Symmetric Adjacency Matrices for Clustering Directed Graphs [5.301300942803395]
Cut-based directed graph (digraph) clustering often focuses on finding dense within-cluster or sparse between-cluster connections.
For flow-based clusterings the edges between clusters tend to be oriented in one direction and have been found in migration data, food webs, and trade data.
arXiv Detail & Related papers (2022-03-02T20:07:04Z) - Private Weighted Random Walk Stochastic Gradient Descent [21.23973736418492]
We consider a decentralized learning setting in which data is distributed over nodes in a graph.
We propose two algorithms based on two types of random walks that achieve, in a decentralized way, uniform sampling and importance sampling of the data.
arXiv Detail & Related papers (2020-09-03T16:52:13Z) - Structured Graph Learning for Clustering and Semi-supervised
Classification [74.35376212789132]
We propose a graph learning framework to preserve both the local and global structure of data.
Our method uses the self-expressiveness of samples to capture the global structure and adaptive neighbor approach to respect the local structure.
Our model is equivalent to a combination of kernel k-means and k-means methods under certain condition.
arXiv Detail & Related papers (2020-08-31T08:41:20Z) - Block-Approximated Exponential Random Graphs [77.4792558024487]
An important challenge in the field of exponential random graphs (ERGs) is the fitting of non-trivial ERGs on large graphs.
We propose an approximative framework to such non-trivial ERGs that result in dyadic independence (i.e., edge independent) distributions.
Our methods are scalable to sparse graphs consisting of millions of nodes.
arXiv Detail & Related papers (2020-02-14T11:42:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.