Probabilistic Classification by Density Estimation Using Gaussian
Mixture Model and Masked Autoregressive Flow
- URL: http://arxiv.org/abs/2310.10843v1
- Date: Mon, 16 Oct 2023 21:37:22 GMT
- Title: Probabilistic Classification by Density Estimation Using Gaussian
Mixture Model and Masked Autoregressive Flow
- Authors: Benyamin Ghojogh, Milad Amir Toutounchian
- Abstract summary: Density estimation, which estimates the distribution of data, is an important category of probabilistic machine learning.
In this paper, we use the density estimators for classification, although they are often used for estimating the distribution of data.
We model the likelihood of classes of data by density estimation, specifically using GMM and MAF.
- Score: 1.3706331473063882
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Density estimation, which estimates the distribution of data, is an important
category of probabilistic machine learning. A family of density estimators is
mixture models, such as Gaussian Mixture Model (GMM) by expectation
maximization. Another family of density estimators is the generative models
which generate data from input latent variables. One of the generative models
is the Masked Autoregressive Flow (MAF) which makes use of normalizing flows
and autoregressive networks. In this paper, we use the density estimators for
classification, although they are often used for estimating the distribution of
data. We model the likelihood of classes of data by density estimation,
specifically using GMM and MAF. The proposed classifiers outperform simpler
classifiers such as linear discriminant analysis which model the likelihood
using only a single Gaussian distribution. This work opens the research door
for proposing other probabilistic classifiers based on joint density
estimation.
Related papers
- Empirical Density Estimation based on Spline Quasi-Interpolation with
applications to Copulas clustering modeling [0.0]
Density estimation is a fundamental technique employed in various fields to model and to understand the underlying distribution of data.
In this paper we propose the mono-variate approximation of the density using quasi-interpolation.
The presented algorithm is validated on artificial and real datasets.
arXiv Detail & Related papers (2024-02-18T11:49:38Z) - Finite Mixtures of Multivariate Poisson-Log Normal Factor Analyzers for
Clustering Count Data [0.8499685241219366]
A class of eight parsimonious mixture models based on the mixtures of factor analyzers model are introduced.
The proposed models are explored in the context of clustering discrete data arising from RNA sequencing studies.
arXiv Detail & Related papers (2023-11-13T21:23:15Z) - Context-specific kernel-based hidden Markov model for time series
analysis [9.007829035130886]
We introduce a new hidden Markov model based on kernel density estimation.
It is capable of capturing kernel dependencies using context-specific Bayesian networks.
The benefits in likelihood and classification accuracy from the proposed model are quantified and analyzed.
arXiv Detail & Related papers (2023-01-24T09:10:38Z) - Learning from aggregated data with a maximum entropy model [73.63512438583375]
We show how a new model, similar to a logistic regression, may be learned from aggregated data only by approximating the unobserved feature distribution with a maximum entropy hypothesis.
We present empirical evidence on several public datasets that the model learned this way can achieve performances comparable to those of a logistic model trained with the full unaggregated data.
arXiv Detail & Related papers (2022-10-05T09:17:27Z) - A Robust and Flexible EM Algorithm for Mixtures of Elliptical
Distributions with Missing Data [71.9573352891936]
This paper tackles the problem of missing data imputation for noisy and non-Gaussian data.
A new EM algorithm is investigated for mixtures of elliptical distributions with the property of handling potential missing data.
Experimental results on synthetic data demonstrate that the proposed algorithm is robust to outliers and can be used with non-Gaussian data.
arXiv Detail & Related papers (2022-01-28T10:01:37Z) - Continual Learning with Fully Probabilistic Models [70.3497683558609]
We present an approach for continual learning based on fully probabilistic (or generative) models of machine learning.
We propose a pseudo-rehearsal approach using a Gaussian Mixture Model (GMM) instance for both generator and classifier functionalities.
We show that GMR achieves state-of-the-art performance on common class-incremental learning problems at very competitive time and memory complexity.
arXiv Detail & Related papers (2021-04-19T12:26:26Z) - Scalable Marginal Likelihood Estimation for Model Selection in Deep
Learning [78.83598532168256]
Marginal-likelihood based model-selection is rarely used in deep learning due to estimation difficulties.
Our work shows that marginal likelihoods can improve generalization and be useful when validation data is unavailable.
arXiv Detail & Related papers (2021-04-11T09:50:24Z) - Variational Mixture of Normalizing Flows [0.0]
Deep generative models, such as generative adversarial networks autociteGAN, variational autoencoders autocitevaepaper, and their variants, have seen wide adoption for the task of modelling complex data distributions.
Normalizing flows have overcome this limitation by leveraging the change-of-suchs formula for probability density functions.
The present work overcomes this by using normalizing flows as components in a mixture model and devising an end-to-end training procedure for such a model.
arXiv Detail & Related papers (2020-09-01T17:20:08Z) - Handling missing data in model-based clustering [0.0]
We propose two methods to fit Gaussian mixtures in the presence of missing data.
Both methods use a variant of the Monte Carlo Expectation-Maximisation algorithm for data augmentation.
We show that the proposed methods outperform the multiple imputation approach, both in terms of clusters identification and density estimation.
arXiv Detail & Related papers (2020-06-04T15:36:31Z) - TraDE: Transformers for Density Estimation [101.20137732920718]
TraDE is a self-attention-based architecture for auto-regressive density estimation.
We present a suite of tasks such as regression using generated samples, out-of-distribution detection, and robustness to noise in the training data.
arXiv Detail & Related papers (2020-04-06T07:32:51Z) - On the Discrepancy between Density Estimation and Sequence Generation [92.70116082182076]
log-likelihood is highly correlated with BLEU when we consider models within the same family.
We observe no correlation between rankings of models across different families.
arXiv Detail & Related papers (2020-02-17T20:13:35Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.