FedSpectral+: Spectral Clustering using Federated Learning
- URL: http://arxiv.org/abs/2302.02137v1
- Date: Sat, 4 Feb 2023 09:28:36 GMT
- Title: FedSpectral+: Spectral Clustering using Federated Learning
- Authors: Janvi Thakkar, Devvrat Joshi
- Abstract summary: We propose a spectral clustering algorithm using federated learning (FL) to overcome issues of data privacy and increased communication costs.
FL is a privacy-protecting algorithm that accumulates model parameters from each local learner rather than collecting users' raw data.
FedSpectral is a baseline approach that uses local spectral clustering labels to aggregate the global spectral clustering by creating a similarity graph.
FedSpectral+, a state-of-the-art approach, uses the power method to learn the global spectral embedding by incorporating the entire graph data without access to the raw information distributed among the clients.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Clustering in graphs has been a well-known research problem, particularly
because most Internet and social network data is in the form of graphs.
Organizations widely use spectral clustering algorithms to find clustering in
graph datasets. However, applying spectral clustering to a large dataset is
challenging due to computational overhead. While the distributed spectral
clustering algorithm exists, they face the problem of data privacy and
increased communication costs between the clients. Thus, in this paper, we
propose a spectral clustering algorithm using federated learning (FL) to
overcome these issues. FL is a privacy-protecting algorithm that accumulates
model parameters from each local learner rather than collecting users' raw
data, thus providing both scalability and data privacy. We developed two
approaches: FedSpectral and FedSpectral+. FedSpectral is a baseline approach
that uses local spectral clustering labels to aggregate the global spectral
clustering by creating a similarity graph. FedSpectral+, a state-of-the-art
approach, uses the power iteration method to learn the global spectral
embedding by incorporating the entire graph data without access to the raw
information distributed among the clients. We further designed our own
similarity metric to check the clustering quality of the distributed approach
to that of the original/non-FL clustering. The proposed approach FedSpectral+
obtained a similarity of 98.85% and 99.8%, comparable to that of global
clustering on the ego-Facebook and email-Eu-core dataset.
Related papers
- Attributed Graph Clustering via Generalized Quaternion Representation Learning [38.98084537010657]
We propose a graph auto-encoder network, which introduces quaternion operations to the encoders to achieve efficient structured feature representation learning.
It turns out that the representations of nodes learned by the proposed Graph Clustering are more discriminative, containing global distribution information, and are more general, suiting downstream clustering under different $k$s.
arXiv Detail & Related papers (2024-11-22T04:46:31Z) - A Unified Framework for Fair Spectral Clustering With Effective Graph
Learning [12.343382413705394]
We consider the problem of spectral clustering under group fairness constraints.
In practice, the graph is usually unknown, and we need to construct the underlying graph from potentially noisy data.
We propose a novel graph construction method with a node-adaptive graph filter to learn graphs from noisy data.
arXiv Detail & Related papers (2023-11-23T01:43:00Z) - Dynamically Weighted Federated k-Means [0.0]
Federated clustering enables multiple data sources to collaboratively cluster their data, maintaining decentralization and preserving privacy.
We introduce a novel federated clustering algorithm named Dynamically Weighted Federated k-means (DWF k-means) based on Lloyd's method for k-means clustering.
We conduct experiments on multiple datasets and data distribution settings to evaluate the performance of our algorithm in terms of clustering score, accuracy, and v-measure.
arXiv Detail & Related papers (2023-10-23T12:28:21Z) - Reinforcement Graph Clustering with Unknown Cluster Number [91.4861135742095]
We propose a new deep graph clustering method termed Reinforcement Graph Clustering.
In our proposed method, cluster number determination and unsupervised representation learning are unified into a uniform framework.
In order to conduct feedback actions, the clustering-oriented reward function is proposed to enhance the cohesion of the same clusters and separate the different clusters.
arXiv Detail & Related papers (2023-08-13T18:12:28Z) - ClusterNet: A Perception-Based Clustering Model for Scattered Data [16.326062082938215]
Cluster separation in scatterplots is a task that is typically tackled by widely used clustering techniques.
We propose a learning strategy which directly operates on scattered data.
We train ClusterNet, a point-based deep learning model, trained to reflect human perception of cluster separability.
arXiv Detail & Related papers (2023-04-27T13:41:12Z) - Hard Regularization to Prevent Deep Online Clustering Collapse without
Data Augmentation [65.268245109828]
Online deep clustering refers to the joint use of a feature extraction network and a clustering model to assign cluster labels to each new data point or batch as it is processed.
While faster and more versatile than offline methods, online clustering can easily reach the collapsed solution where the encoder maps all inputs to the same point and all are put into a single cluster.
We propose a method that does not require data augmentation, and that, differently from existing methods, regularizes the hard assignments.
arXiv Detail & Related papers (2023-03-29T08:23:26Z) - GLCC: A General Framework for Graph-level Clustering [5.069852282550117]
This paper studies the problem of graph-level clustering, which is a novel yet challenging task.
We propose a general graph-level clustering framework named Graph-Level Contrastive Clustering (GLCC)
Experiments on a range of well-known datasets demonstrate the superiority of our proposed GLCC over competitive baselines.
arXiv Detail & Related papers (2022-10-21T11:08:10Z) - Self-supervised Contrastive Attributed Graph Clustering [110.52694943592974]
We propose a novel attributed graph clustering network, namely Self-supervised Contrastive Attributed Graph Clustering (SCAGC)
In SCAGC, by leveraging inaccurate clustering labels, a self-supervised contrastive loss, are designed for node representation learning.
For the OOS nodes, SCAGC can directly calculate their clustering labels.
arXiv Detail & Related papers (2021-10-15T03:25:28Z) - Spectral-Spatial Global Graph Reasoning for Hyperspectral Image
Classification [50.899576891296235]
Convolutional neural networks have been widely applied to hyperspectral image classification.
Recent methods attempt to address this issue by performing graph convolutions on spatial topologies.
arXiv Detail & Related papers (2021-06-26T06:24:51Z) - Scalable Hierarchical Agglomerative Clustering [65.66407726145619]
Existing scalable hierarchical clustering methods sacrifice quality for speed.
We present a scalable, agglomerative method for hierarchical clustering that does not sacrifice quality and scales to billions of data points.
arXiv Detail & Related papers (2020-10-22T15:58:35Z) - LSD-C: Linearly Separable Deep Clusters [145.89790963544314]
We present LSD-C, a novel method to identify clusters in an unlabeled dataset.
Our method draws inspiration from recent semi-supervised learning practice and proposes to combine our clustering algorithm with self-supervised pretraining and strong data augmentation.
We show that our approach significantly outperforms competitors on popular public image benchmarks including CIFAR 10/100, STL 10 and MNIST, as well as the document classification dataset Reuters 10K.
arXiv Detail & Related papers (2020-06-17T17:58:10Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.