Fugu-MT 論文翻訳(概要): A Distribution Testing Approach to Clustering Distributions

論文の概要: A Distribution Testing Approach to Clustering Distributions

arxiv url: http://arxiv.org/abs/2512.08376v1
Date: Tue, 09 Dec 2025 09:01:41 GMT
ステータス: 翻訳完了
システム内更新日: 2025-12-10 22:28:07.887299
Title: A Distribution Testing Approach to Clustering Distributions
Title（参考訳）: クラスタリング分布に対する分布試験手法
Authors: Gunjan Kumar, Yash Pote, Jonathan Scarlett,
Abstract要約: 2つのグループに$k$の分散を隠されたパーティションが与えられると、そのパーティションを回復することがゴールである。 2つの基本事例に対して,サンプルの複雑さの上限と下限を定めている。特に、すべてのレジームに対して$(n,k,r,varepsilon)$(最大$O(log k)$ factor)に関して厳密性を達成する。
参考スコア（独自算出の注目度）: 35.016184519329194
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: We study the following distribution clustering problem: Given a hidden partition of $k$ distributions into two groups, such that the distributions within each group are the same, and the two distributions associated with the two clusters are $\varepsilon$-far in total variation, the goal is to recover the partition. We establish upper and lower bounds on the sample complexity for two fundamental cases: (1) when one of the cluster's distributions is known, and (2) when both are unknown. Our upper and lower bounds characterize the sample complexity's dependence on the domain size $n$, number of distributions $k$, size $r$ of one of the clusters, and distance $\varepsilon$. In particular, we achieve tightness with respect to $(n,k,r,\varepsilon)$ (up to an $O(\log k)$ factor) for all regimes.
Abstract（参考訳）: 各グループ内の分布が同じであり、2つのクラスタに関連する2つの分布が合計変動で$\varepsilon$-farであるように、$k$の分布を2つのグループに隠された分割を与えられた場合、その分割を回復することが目的である。我々は,(1)クラスタの分布の1つが分かっている場合,(2)両者が未知である場合,の2つの基本事例に対して,サンプル複雑性の上限と下限を定めている。上と下の境界は、サンプルの複雑さがドメインサイズ$n$、分布数$k$、クラスタの1つのサイズ$r$、距離$\varepsilon$に依存することを特徴付けています。特に、すべてのレジームに対して$(n,k,r,\varepsilon)$(最大$O(\log k)$ factor)に関して厳密性を達成する。

論文の概要: A Distribution Testing Approach to Clustering Distributions

関連論文リスト