Clustering a Mixture of Gaussians with Unknown Covariance
- URL: http://arxiv.org/abs/2110.01602v1
- Date: Mon, 4 Oct 2021 17:59:20 GMT
- Title: Clustering a Mixture of Gaussians with Unknown Covariance
- Authors: Damek Davis, Mateo Diaz, Kaizheng Wang
- Abstract summary: We derive a Max-Cut integer program based on maximum likelihood estimation.
We develop an efficient spectral algorithm that attains the optimal rate but requires a quadratic sample size.
We generalize the Max-Cut program to a $k$-means program that handles multi-component mixtures with possibly unequal weights.
- Score: 4.821312633849745
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We investigate a clustering problem with data from a mixture of Gaussians
that share a common but unknown, and potentially ill-conditioned, covariance
matrix. We start by considering Gaussian mixtures with two equally-sized
components and derive a Max-Cut integer program based on maximum likelihood
estimation. We prove its solutions achieve the optimal misclassification rate
when the number of samples grows linearly in the dimension, up to a logarithmic
factor. However, solving the Max-cut problem appears to be computationally
intractable. To overcome this, we develop an efficient spectral algorithm that
attains the optimal rate but requires a quadratic sample size. Although this
sample complexity is worse than that of the Max-cut problem, we conjecture that
no polynomial-time method can perform better. Furthermore, we gather numerical
and theoretical evidence that supports the existence of a
statistical-computational gap. Finally, we generalize the Max-Cut program to a
$k$-means program that handles multi-component mixtures with possibly unequal
weights. It enjoys similar optimality guarantees for mixtures of distributions
that satisfy a transportation-cost inequality, encompassing Gaussian and
strongly log-concave distributions.
Related papers
- Maximum a Posteriori Inference for Factor Graphs via Benders' Decomposition [0.38233569758620056]
We present a method for maximum a-posteriori inference in general Bayesian factor models.
We derive MAP estimation algorithms for the Bayesian Gaussian mixture model and latent Dirichlet allocation.
arXiv Detail & Related papers (2024-10-24T19:57:56Z) - Learning general Gaussian mixtures with efficient score matching [16.06356123715737]
We study the problem of learning mixtures of $k$ Gaussians in $d$ dimensions.
We make no separation assumptions on the underlying mixture components.
We give an algorithm that draws $dmathrmpoly(k/varepsilon)$ samples from the target mixture, runs in sample-polynomial time, and constructs a sampler.
arXiv Detail & Related papers (2024-04-29T17:30:36Z) - Fit Like You Sample: Sample-Efficient Generalized Score Matching from
Fast Mixing Diffusions [29.488555741982015]
We show a close connection between the mixing time of a broad class of Markov processes with generator $mathcalL$.
We adapt techniques to speed up Markov chains to construct better score-matching losses.
In particular, preconditioning' the diffusion can be translated to an appropriate preconditioning'' of the score loss.
arXiv Detail & Related papers (2023-06-15T17:58:42Z) - Approximating a RUM from Distributions on k-Slates [88.32814292632675]
We find a generalization-time algorithm that finds the RUM that best approximates the given distribution on average.
Our theoretical result can also be made practical: we obtain a that is effective and scales to real-world datasets.
arXiv Detail & Related papers (2023-05-22T17:43:34Z) - A Robust and Flexible EM Algorithm for Mixtures of Elliptical
Distributions with Missing Data [71.9573352891936]
This paper tackles the problem of missing data imputation for noisy and non-Gaussian data.
A new EM algorithm is investigated for mixtures of elliptical distributions with the property of handling potential missing data.
Experimental results on synthetic data demonstrate that the proposed algorithm is robust to outliers and can be used with non-Gaussian data.
arXiv Detail & Related papers (2022-01-28T10:01:37Z) - Learning Gaussian Mixtures with Generalised Linear Models: Precise
Asymptotics in High-dimensions [79.35722941720734]
Generalised linear models for multi-class classification problems are one of the fundamental building blocks of modern machine learning tasks.
We prove exacts characterising the estimator in high-dimensions via empirical risk minimisation.
We discuss how our theory can be applied beyond the scope of synthetic data.
arXiv Detail & Related papers (2021-06-07T16:53:56Z) - Robustly Learning Mixtures of $k$ Arbitrary Gaussians [47.40835932474677]
We give a-time algorithm for the problem of robustly estimating a mixture of $k$ arbitrary Gaussians in $mathbbRd$, for any fixed $k$, in the presence of a constant fraction of arbitrary corruptions.
Our main tools are an efficient emphpartial clustering algorithm that relies on the sum-of-squares method, and a novel tensor decomposition algorithm that allows errors in both Frobenius norm and low-rank terms.
arXiv Detail & Related papers (2020-12-03T17:54:03Z) - Robustly Learning any Clusterable Mixture of Gaussians [55.41573600814391]
We study the efficient learnability of high-dimensional Gaussian mixtures in the adversarial-robust setting.
We provide an algorithm that learns the components of an $epsilon$-corrupted $k$-mixture within information theoretically near-optimal error proofs of $tildeO(epsilon)$.
Our main technical contribution is a new robust identifiability proof clusters from a Gaussian mixture, which can be captured by the constant-degree Sum of Squares proof system.
arXiv Detail & Related papers (2020-05-13T16:44:12Z) - Optimal Randomized First-Order Methods for Least-Squares Problems [56.05635751529922]
This class of algorithms encompasses several randomized methods among the fastest solvers for least-squares problems.
We focus on two classical embeddings, namely, Gaussian projections and subsampled Hadamard transforms.
Our resulting algorithm yields the best complexity known for solving least-squares problems with no condition number dependence.
arXiv Detail & Related papers (2020-02-21T17:45:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.