Related papers: FedSpectral+: Spectral Clustering using Federated Learning

FedSpectral+: Spectral Clustering using Federated Learning

URL: http://arxiv.org/abs/2302.02137v1
Date: Sat, 4 Feb 2023 09:28:36 GMT
Title: FedSpectral+: Spectral Clustering using Federated Learning
Authors: Janvi Thakkar, Devvrat Joshi
Abstract summary: We propose a spectral clustering algorithm using federated learning (FL) to overcome issues of data privacy and increased communication costs. FL is a privacy-protecting algorithm that accumulates model parameters from each local learner rather than collecting users' raw data. FedSpectral is a baseline approach that uses local spectral clustering labels to aggregate the global spectral clustering by creating a similarity graph. FedSpectral+, a state-of-the-art approach, uses the power method to learn the global spectral embedding by incorporating the entire graph data without access to the raw information distributed among the clients.
Score: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Clustering in graphs has been a well-known research problem, particularly because most Internet and social network data is in the form of graphs. Organizations widely use spectral clustering algorithms to find clustering in graph datasets. However, applying spectral clustering to a large dataset is challenging due to computational overhead. While the distributed spectral clustering algorithm exists, they face the problem of data privacy and increased communication costs between the clients. Thus, in this paper, we propose a spectral clustering algorithm using federated learning (FL) to overcome these issues. FL is a privacy-protecting algorithm that accumulates model parameters from each local learner rather than collecting users' raw data, thus providing both scalability and data privacy. We developed two approaches: FedSpectral and FedSpectral+. FedSpectral is a baseline approach that uses local spectral clustering labels to aggregate the global spectral clustering by creating a similarity graph. FedSpectral+, a state-of-the-art approach, uses the power iteration method to learn the global spectral embedding by incorporating the entire graph data without access to the raw information distributed among the clients. We further designed our own similarity metric to check the clustering quality of the distributed approach to that of the original/non-FL clustering. The proposed approach FedSpectral+ obtained a similarity of 98.85% and 99.8%, comparable to that of global clustering on the ego-Facebook and email-Eu-core dataset.

Related papers

A Comprehensive Survey on Spectral Clustering with Graph Structure Learning [10.579153358536372]
Spectral clustering is a powerful technique for clustering high-dimensional data. We explore various graph clustering techniques, including pairwise, anchor, and hypergraph-based methods. We discuss multi-view clustering frameworks, examining their applications within one-step and two-step clustering processes.
arXiv Detail & Related papers (2025-01-23T12:06:32Z)
Attributed Graph Clustering via Generalized Quaternion Representation Learning [38.98084537010657]
We propose a graph auto-encoder network, which introduces quaternion operations to the encoders to achieve efficient structured feature representation learning. It turns out that the representations of nodes learned by the proposed Graph Clustering are more discriminative, containing global distribution information, and are more general, suiting downstream clustering under different $k$s.
arXiv Detail & Related papers (2024-11-22T04:46:31Z)
Federated Incomplete Multi-View Clustering with Heterogeneous Graph Neural Networks [21.710283538891968]
Federated multi-view clustering offers the potential to develop a global clustering model using data distributed across multiple devices. Current methods face challenges due to the absence of label information and the paramount importance of data privacy. We introduce a federated incomplete multi-view clustering framework with heterogeneous graph neural networks (FIM-GNNs)
arXiv Detail & Related papers (2024-06-12T07:16:00Z)
A Unified Framework for Fair Spectral Clustering With Effective Graph Learning [12.343382413705394]
We consider the problem of spectral clustering under group fairness constraints. In practice, the graph is usually unknown, and we need to construct the underlying graph from potentially noisy data. We propose a novel graph construction method with a node-adaptive graph filter to learn graphs from noisy data.
arXiv Detail & Related papers (2023-11-23T01:43:00Z)
Dynamically Weighted Federated k-Means [0.0]
Federated clustering enables multiple data sources to collaboratively cluster their data, maintaining decentralization and preserving privacy. We introduce a novel federated clustering algorithm named Dynamically Weighted Federated k-means (DWF k-means) based on Lloyd's method for k-means clustering. We conduct experiments on multiple datasets and data distribution settings to evaluate the performance of our algorithm in terms of clustering score, accuracy, and v-measure.
arXiv Detail & Related papers (2023-10-23T12:28:21Z)
Reinforcement Graph Clustering with Unknown Cluster Number [91.4861135742095]
We propose a new deep graph clustering method termed Reinforcement Graph Clustering. In our proposed method, cluster number determination and unsupervised representation learning are unified into a uniform framework. In order to conduct feedback actions, the clustering-oriented reward function is proposed to enhance the cohesion of the same clusters and separate the different clusters.
arXiv Detail & Related papers (2023-08-13T18:12:28Z)
ClusterNet: A Perception-Based Clustering Model for Scattered Data [16.326062082938215]
Cluster separation in scatterplots is a task that is typically tackled by widely used clustering techniques. We propose a learning strategy which directly operates on scattered data. We train ClusterNet, a point-based deep learning model, trained to reflect human perception of cluster separability.
arXiv Detail & Related papers (2023-04-27T13:41:12Z)
Hard Regularization to Prevent Deep Online Clustering Collapse without Data Augmentation [65.268245109828]
Online deep clustering refers to the joint use of a feature extraction network and a clustering model to assign cluster labels to each new data point or batch as it is processed. While faster and more versatile than offline methods, online clustering can easily reach the collapsed solution where the encoder maps all inputs to the same point and all are put into a single cluster. We propose a method that does not require data augmentation, and that, differently from existing methods, regularizes the hard assignments.
arXiv Detail & Related papers (2023-03-29T08:23:26Z)
GLCC: A General Framework for Graph-level Clustering [5.069852282550117]
This paper studies the problem of graph-level clustering, which is a novel yet challenging task. We propose a general graph-level clustering framework named Graph-Level Contrastive Clustering (GLCC) Experiments on a range of well-known datasets demonstrate the superiority of our proposed GLCC over competitive baselines.
arXiv Detail & Related papers (2022-10-21T11:08:10Z)
Self-supervised Contrastive Attributed Graph Clustering [110.52694943592974]
We propose a novel attributed graph clustering network, namely Self-supervised Contrastive Attributed Graph Clustering (SCAGC) In SCAGC, by leveraging inaccurate clustering labels, a self-supervised contrastive loss, are designed for node representation learning. For the OOS nodes, SCAGC can directly calculate their clustering labels.
arXiv Detail & Related papers (2021-10-15T03:25:28Z)
Spectral-Spatial Global Graph Reasoning for Hyperspectral Image Classification [50.899576891296235]
Convolutional neural networks have been widely applied to hyperspectral image classification. Recent methods attempt to address this issue by performing graph convolutions on spatial topologies.
arXiv Detail & Related papers (2021-06-26T06:24:51Z)
Scalable Hierarchical Agglomerative Clustering [65.66407726145619]
Existing scalable hierarchical clustering methods sacrifice quality for speed. We present a scalable, agglomerative method for hierarchical clustering that does not sacrifice quality and scales to billions of data points.
arXiv Detail & Related papers (2020-10-22T15:58:35Z)
LSD-C: Linearly Separable Deep Clusters [145.89790963544314]
We present LSD-C, a novel method to identify clusters in an unlabeled dataset. Our method draws inspiration from recent semi-supervised learning practice and proposes to combine our clustering algorithm with self-supervised pretraining and strong data augmentation. We show that our approach significantly outperforms competitors on popular public image benchmarks including CIFAR 10/100, STL 10 and MNIST, as well as the document classification dataset Reuters 10K.
arXiv Detail & Related papers (2020-06-17T17:58:10Z)

This list is automatically generated from the titles and abstracts of the papers in this site.