Efficient remedies for outlier detection with variational autoencoders
- URL: http://arxiv.org/abs/2108.08760v1
- Date: Thu, 19 Aug 2021 16:00:58 GMT
- Title: Efficient remedies for outlier detection with variational autoencoders
- Authors: Kushal Chauhan, Pradeep Shenoy, Manish Gupta and Devarajan Sridharan
- Abstract summary: Likelihoods computed by deep generative models are a candidate metric for outlier detection with unlabeled data.
We show that a theoretically-grounded correction readily ameliorates a key bias with VAE likelihood estimates.
We also show that the variance of the likelihoods computed over an ensemble of VAEs also enables robust outlier detection.
- Score: 8.80692072928023
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Deep networks often make confident, yet incorrect, predictions when tested
with outlier data that is far removed from their training distributions.
Likelihoods computed by deep generative models are a candidate metric for
outlier detection with unlabeled data. Yet, previous studies have shown that
such likelihoods are unreliable and can be easily biased by simple
transformations to input data. Here, we examine outlier detection with
variational autoencoders (VAEs), among the simplest class of deep generative
models. First, we show that a theoretically-grounded correction readily
ameliorates a key bias with VAE likelihood estimates. The bias correction is
model-free, sample-specific, and accurately computed with the Bernoulli and
continuous Bernoulli visible distributions. Second, we show that a well-known
preprocessing technique, contrast normalization, extends the effectiveness of
bias correction to natural image datasets. Third, we show that the variance of
the likelihoods computed over an ensemble of VAEs also enables robust outlier
detection. We perform a comprehensive evaluation of our remedies with nine
(grayscale and natural) image datasets, and demonstrate significant advantages,
in terms of both speed and accuracy, over four other state-of-the-art methods.
Our lightweight remedies are biologically inspired and may serve to achieve
efficient outlier detection with many types of deep generative models.
Related papers
- Positive Difference Distribution for Image Outlier Detection using
Normalizing Flows and Contrastive Data [2.9005223064604078]
Likelihoods learned by a generative model, e.g., a normalizing flow via standard log-likelihood training, perform poorly as an outlier score.
We propose to use an unlabelled auxiliary dataset and a probabilistic outlier score for outlier detection.
We show that this is equivalent to learning the normalized positive difference between the in-distribution and the contrastive feature density.
arXiv Detail & Related papers (2022-08-30T07:00:46Z) - Shaken, and Stirred: Long-Range Dependencies Enable Robust Outlier
Detection with PixelCNN++ [6.736754991468853]
We show that biases in PixelCNN++ likelihoods arise primarily from predictions based on local dependencies.
We propose two families of transformations -- stirring'' and shaking'' -- which ameliorate low-level biases and isolate the contribution of long-range dependencies.
We test our approaches extensively with five grayscale and six natural image datasets and show that they achieve or exceed state-of-the-art outlier detection.
arXiv Detail & Related papers (2022-08-29T13:17:22Z) - Leveraging Unlabeled Data to Predict Out-of-Distribution Performance [63.740181251997306]
Real-world machine learning deployments are characterized by mismatches between the source (training) and target (test) distributions.
In this work, we investigate methods for predicting the target domain accuracy using only labeled source data and unlabeled target data.
We propose Average Thresholded Confidence (ATC), a practical method that learns a threshold on the model's confidence, predicting accuracy as the fraction of unlabeled examples.
arXiv Detail & Related papers (2022-01-11T23:01:12Z) - Robust Out-of-Distribution Detection on Deep Probabilistic Generative
Models [0.06372261626436676]
Out-of-distribution (OOD) detection is an important task in machine learning systems.
Deep probabilistic generative models facilitate OOD detection by estimating the likelihood of a data sample.
We propose a new detection metric that operates without outlier exposure.
arXiv Detail & Related papers (2021-06-15T06:36:10Z) - Evaluating State-of-the-Art Classification Models Against Bayes
Optimality [106.50867011164584]
We show that we can compute the exact Bayes error of generative models learned using normalizing flows.
We use our approach to conduct a thorough investigation of state-of-the-art classification models.
arXiv Detail & Related papers (2021-06-07T06:21:20Z) - Scalable Marginal Likelihood Estimation for Model Selection in Deep
Learning [78.83598532168256]
Marginal-likelihood based model-selection is rarely used in deep learning due to estimation difficulties.
Our work shows that marginal likelihoods can improve generalization and be useful when validation data is unavailable.
arXiv Detail & Related papers (2021-04-11T09:50:24Z) - Improving Uncertainty Calibration via Prior Augmented Data [56.88185136509654]
Neural networks have proven successful at learning from complex data distributions by acting as universal function approximators.
They are often overconfident in their predictions, which leads to inaccurate and miscalibrated probabilistic predictions.
We propose a solution by seeking out regions of feature space where the model is unjustifiably overconfident, and conditionally raising the entropy of those predictions towards that of the prior distribution of the labels.
arXiv Detail & Related papers (2021-02-22T07:02:37Z) - Unlabelled Data Improves Bayesian Uncertainty Calibration under
Covariate Shift [100.52588638477862]
We develop an approximate Bayesian inference scheme based on posterior regularisation.
We demonstrate the utility of our method in the context of transferring prognostic models of prostate cancer across globally diverse populations.
arXiv Detail & Related papers (2020-06-26T13:50:19Z) - Evaluating Prediction-Time Batch Normalization for Robustness under
Covariate Shift [81.74795324629712]
We call prediction-time batch normalization, which significantly improves model accuracy and calibration under covariate shift.
We show that prediction-time batch normalization provides complementary benefits to existing state-of-the-art approaches for improving robustness.
The method has mixed results when used alongside pre-training, and does not seem to perform as well under more natural types of dataset shift.
arXiv Detail & Related papers (2020-06-19T05:08:43Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.