Fugu-MT 論文翻訳(概要): Federated Hierarchical Clustering with Automatic Selection of Optimal Cluster Numbers

論文の概要: Federated Hierarchical Clustering with Automatic Selection of Optimal Cluster Numbers

arxiv url: http://arxiv.org/abs/2603.12684v1
Date: Fri, 13 Mar 2026 05:58:35 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-16 17:38:11.931479
Title: Federated Hierarchical Clustering with Automatic Selection of Optimal Cluster Numbers
Title（参考訳）: 最適クラスタ数の自動選択による階層クラスタリング
Authors: Yue Zhang, Chuanlong Qiu, Xinfa Liao, Yiqun Zhang,
Abstract要約: Federated Clustering(FC)は、分散およびプライバシ保護されたデータから、教師なしの方法でデータ分散パターンを探索する、新しくて有望なソリューションである。我々はFed-$k*$-HCという新しいFCフレームワークを提案し、階層的クラスタリングによって探索されたデータ分布に基づいて最適なクラスタ数$k*$を自動的に決定できる。
参考スコア（独自算出の注目度）: 13.396894845465468
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Federated Clustering (FC) is an emerging and promising solution in exploring data distribution patterns from distributed and privacy-protected data in an unsupervised manner. Existing FC methods implicitly rely on the assumption that clients are with a known number of uniformly sized clusters. However, the true number of clusters is typically unknown, and cluster sizes are naturally imbalanced in real scenarios. Furthermore, the privacy-preserving transmission constraints in federated learning inevitably reduce usable information, making the development of robust and accurate FC extremely challenging. Accordingly, we propose a novel FC framework named Fed-$k^*$-HC, which can automatically determine an optimal number of clusters $k^*$ based on the data distribution explored through hierarchical clustering. To obtain the global data distribution for $k^*$ determination, we let each client generate micro-subclusters. Their prototypes are then uploaded to the server for hierarchical merging. The density-based merging design allows exploring clusters of varying sizes and shapes, and the progressive merging process can self-terminate according to the neighboring relationships among the prototypes to determine $k^*$. Extensive experiments on diverse datasets demonstrate the FC capability of the proposed Fed-$k^*$-HC in accurately exploring a proper number of clusters.
Abstract（参考訳）: Federated Clustering(FC)は、分散およびプライバシ保護されたデータから、教師なしの方法でデータ分散パターンを探索する、新しくて有望なソリューションである。既存のFCメソッドは、クライアントが既知の一様サイズのクラスタの数を持っているという仮定に暗黙的に依存している。しかし、クラスタの真の数は通常不明であり、クラスタのサイズは実際のシナリオでは自然に不均衡である。さらに、フェデレーション学習におけるプライバシー保護送信制約は、必然的に使用可能な情報を低減し、堅牢で正確なFCの開発を極めて困難にする。そこで我々はFed-$k^*$-HCという新しいFCフレームワークを提案する。 k^*$決定のためのグローバルなデータ分布を得るために、各クライアントがマイクロサブクラスタを生成する。プロトタイプは階層的なマージのためにサーバにアップロードされる。密度に基づくマージング設計により、異なる大きさと形状のクラスタを探索することができ、プログレッシブマージングプロセスはプロトタイプ間の隣り合う関係に応じて自己決定し、$k^*$を決定することができる。多様なデータセットに対する大規模な実験は、適切な数のクラスタを正確に探索する際、提案したFed-$k^*$-HCのFC能力を示す。

論文の概要: Federated Hierarchical Clustering with Automatic Selection of Optimal Cluster Numbers

関連論文リスト