Topological Federated Clustering via Gravitational Potential Fields under Local Differential Privacy
- URL: http://arxiv.org/abs/2512.00849v1
- Date: Sun, 30 Nov 2025 11:41:16 GMT
- Title: Topological Federated Clustering via Gravitational Potential Fields under Local Differential Privacy
- Authors: Yunbo Long, Jiaquan Zhang, Xi Chen, Alexandra Brintrup,
- Abstract summary: Existing one-shot methods rely on unstable pairwise centroid distances or neighborhood rankings.<n>We present Gravitational Federated Clustering (GFC), a novel approach to privacy-preserving federated clustering.<n>GFC transforms privatized client centroids into a global gravitational potential field.
- Score: 46.295754114458134
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Clustering non-independent and identically distributed (non-IID) data under local differential privacy (LDP) in federated settings presents a critical challenge: preserving privacy while maintaining accuracy without iterative communication. Existing one-shot methods rely on unstable pairwise centroid distances or neighborhood rankings, degrading severely under strong LDP noise and data heterogeneity. We present Gravitational Federated Clustering (GFC), a novel approach to privacy-preserving federated clustering that overcomes the limitations of distance-based methods under varying LDP. Addressing the critical challenge of clustering non-IID data with diverse privacy guarantees, GFC transforms privatized client centroids into a global gravitational potential field where true cluster centers emerge as topologically persistent singularities. Our framework introduces two key innovations: (1) a client-side compactness-aware perturbation mechanism that encodes local cluster geometry as "mass" values, and (2) a server-side topological aggregation phase that extracts stable centroids through persistent homology analysis of the potential field's superlevel sets. Theoretically, we establish a closed-form bound between the privacy budget $ε$ and centroid estimation error, proving the potential field's Lipschitz smoothing properties exponentially suppress noise in high-density regions. Empirically, GFC outperforms state-of-the-art methods on ten benchmarks, especially under strong LDP constraints ($ε< 1$), while maintaining comparable performance at lower privacy budgets. By reformulating federated clustering as a topological persistence problem in a synthetic physics-inspired space, GFC achieves unprecedented privacy-accuracy trade-offs without iterative communication, providing a new perspective for privacy-preserving distributed learning.
Related papers
- A Secure and Private Distributed Bayesian Federated Learning Design [56.92336577799572]
Distributed Federated Learning (DFL) enables decentralized model training across large-scale systems without a central parameter server.<n>DFL faces three critical challenges: privacy leakage from honest-but-curious neighbors, slow convergence due to the lack of central coordination, and vulnerability to Byzantine adversaries aiming to degrade model accuracy.<n>We propose a novel DFL framework that integrates Byzantine robustness, privacy preservation, and convergence acceleration.
arXiv Detail & Related papers (2026-02-23T16:12:02Z) - VertCoHiRF: Decentralized Vertical Clustering Beyond k-means [7.0930706640450225]
VertCoHiRF is a fully decentralized framework for vertical federated clustering based on structural consensus across heterogeneous views.<n>Agents cluster their local views independently and reconcile their proposals through identifier-level consensus.<n>We analyze communication complexity and robustness, and experiments demonstrate competitive clustering performance in vertical federated settings.
arXiv Detail & Related papers (2026-02-07T00:06:23Z) - Towards Federated Clustering: A Client-wise Private Graph Aggregation Framework [57.04850867402913]
Federated clustering addresses the challenge of extracting patterns from decentralized, unlabeled data.<n>We propose Structural Privacy-Preserving Federated Graph Clustering (SPP-FGC), a novel algorithm that innovatively leverages local structural graphs as the primary medium for privacy-preserving knowledge sharing.<n>Our framework achieves state-of-the-art performance, improving clustering accuracy by up to 10% (NMI) over federated baselines while maintaining provable privacy guarantees.
arXiv Detail & Related papers (2025-11-14T03:05:22Z) - GCAO: Group-driven Clustering via Gravitational Attraction and Optimization [10.85103265888665]
We propose a Group-driven Clustering via Gravitational Attraction and Optimization (GCAO) algorithm.<n>GCAO aggregates low-density boundary points into collaboratively moving groups, replacing the traditional point-based contraction process.<n>Experiments on multiple high-dimensional datasets demonstrate that GCAO outperforms 11 representative clustering methods.
arXiv Detail & Related papers (2025-10-27T12:22:24Z) - Differentially Private Federated Clustering with Random Rebalancing [9.331231828491461]
Federated clustering aims to group similar clients into clusters and produce one model for each cluster.<n>We propose RR-Cluster, that can be viewed as a light-weight add-on to many federated clustering algorithms.<n>We analyze the tradeoffs between decreased privacy noise variance and potentially increased bias from incorrect assignments.
arXiv Detail & Related papers (2025-08-08T09:56:47Z) - OmniFC: Rethinking Federated Clustering via Lossless and Secure Distance Reconstruction [10.326927761316986]
Federated clustering aims to discover global cluster structures across decentralized clients without sharing raw data.<n>There are two critical challenges: (1) privacy leakage during collaboration, and (2) robustness degradation due to aggregation of proxy information.<n>We propose Omni Federated Clustering, a unified and model-agnostic framework.
arXiv Detail & Related papers (2025-05-19T13:04:59Z) - Interaction-Aware Gaussian Weighting for Clustered Federated Learning [58.92159838586751]
Federated Learning (FL) emerged as a decentralized paradigm to train models while preserving privacy.<n>We propose a novel clustered FL method, FedGWC (Federated Gaussian Weighting Clustering), which groups clients based on their data distribution.<n>Our experiments on benchmark datasets show that FedGWC outperforms existing FL algorithms in cluster quality and classification accuracy.
arXiv Detail & Related papers (2025-02-05T16:33:36Z) - Differentially Private Federated Clustering over Non-IID Data [59.611244450530315]
clustering clusters (FedC) problem aims to accurately partition unlabeled data samples distributed over massive clients into finite clients under the orchestration of a server.
We propose a novel FedC algorithm using differential privacy convergence technique, referred to as DP-Fed, in which partial participation and multiple clients are also considered.
Various attributes of the proposed DP-Fed are obtained through theoretical analyses of privacy protection, especially for the case of non-identically and independently distributed (non-i.i.d.) data.
arXiv Detail & Related papers (2023-01-03T05:38:43Z) - Privacy-Preserving Federated Deep Clustering based on GAN [12.256298398007848]
We present a novel approach to Federated Deep Clustering based on Generative Adversarial Networks (GANs)
Each client trains a local generative adversarial network (GAN) locally and uploads the synthetic data to the server.
The server applies a deep clustering network on the synthetic data to establish $k$ cluster centroids, which are then downloaded to the clients for cluster assignment.
arXiv Detail & Related papers (2022-11-30T13:20:11Z) - Graph-Homomorphic Perturbations for Private Decentralized Learning [64.26238893241322]
Local exchange of estimates allows inference of data based on private data.
perturbations chosen independently at every agent, resulting in a significant performance loss.
We propose an alternative scheme, which constructs perturbations according to a particular nullspace condition, allowing them to be invisible.
arXiv Detail & Related papers (2020-10-23T10:35:35Z) - Stable and consistent density-based clustering via multiparameter persistence [49.1574468325115]
We consider the degree-Rips construction from topological data analysis.<n>We analyze its stability to perturbations of the input data using the correspondence-interleaving distance.<n>We integrate these methods into a pipeline for density-based clustering, which we call Persistable.
arXiv Detail & Related papers (2020-05-18T19:45:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.