Anomaly Detection for High-Dimensional Data Using Large Deviations
Principle
- URL: http://arxiv.org/abs/2109.13698v1
- Date: Tue, 28 Sep 2021 13:13:14 GMT
- Title: Anomaly Detection for High-Dimensional Data Using Large Deviations
Principle
- Authors: Sreelekha Guggilam and Varun Chandola and Abani Patra
- Abstract summary: We propose an anomaly detection algorithm that can scale to high-dimensional data using concepts from the theory of large deviations.
The proposed Large Deviations Anomaly Detection (LAD) algorithm is shown to outperform state of art anomaly detection methods on a variety of large and high-dimensional benchmark data sets.
- Score: 0.8526086056172273
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Most current anomaly detection methods suffer from the curse of
dimensionality when dealing with high-dimensional data. We propose an anomaly
detection algorithm that can scale to high-dimensional data using concepts from
the theory of large deviations. The proposed Large Deviations Anomaly Detection
(LAD) algorithm is shown to outperform state of art anomaly detection methods
on a variety of large and high-dimensional benchmark data sets. Exploiting the
ability of the algorithm to scale to high-dimensional data, we propose an
online anomaly detection method to identify anomalies in a collection of
multivariate time series. We demonstrate the applicability of the online
algorithm in identifying counties in the United States with anomalous trends in
terms of COVID-19 related cases and deaths. Several of the identified anomalous
counties correlate with counties with documented poor response to the COVID
pandemic.
Related papers
- Unsupervised Anomaly Detection Using Diffusion Trend Analysis [48.19821513256158]
We propose a method to detect anomalies by analysis of reconstruction trend depending on the degree of degradation.
The proposed method is validated on an open dataset for industrial anomaly detection.
arXiv Detail & Related papers (2024-07-12T01:50:07Z) - Anomaly Detection Based on Isolation Mechanisms: A Survey [13.449446806837422]
Isolation-based unsupervised anomaly detection is a novel and effective approach for identifying anomalies in data.
We review the state-of-the-art isolation-based anomaly detection methods, including their data partitioning strategies, anomaly score functions, and algorithmic details.
arXiv Detail & Related papers (2024-03-16T04:29:21Z) - AnomalyDiffusion: Few-Shot Anomaly Image Generation with Diffusion Model [59.08735812631131]
Anomaly inspection plays an important role in industrial manufacture.
Existing anomaly inspection methods are limited in their performance due to insufficient anomaly data.
We propose AnomalyDiffusion, a novel diffusion-based few-shot anomaly generation model.
arXiv Detail & Related papers (2023-12-10T05:13:40Z) - MSFlow: Multi-Scale Flow-based Framework for Unsupervised Anomaly
Detection [124.52227588930543]
Unsupervised anomaly detection (UAD) attracts a lot of research interest and drives widespread applications.
An inconspicuous yet powerful statistics model, the normalizing flows, is appropriate for anomaly detection and localization in an unsupervised fashion.
We propose a novel Multi-Scale Flow-based framework dubbed MSFlow composed of asymmetrical parallel flows followed by a fusion flow.
Our MSFlow achieves a new state-of-the-art with a detection AUORC score of up to 99.7%, localization AUCROC score of 98.8%, and PRO score of 97.1%.
arXiv Detail & Related papers (2023-08-29T13:38:35Z) - AGAD: Adversarial Generative Anomaly Detection [12.68966318231776]
Anomaly detection suffered from the lack of anomalies due to the diversity of abnormalities and the difficulties of obtaining large-scale anomaly data.
We propose Adversarial Generative Anomaly Detection (AGAD), a self-contrast-based anomaly detection paradigm.
Our method generates pseudo-anomaly data for both supervised and semi-supervised anomaly detection scenarios.
arXiv Detail & Related papers (2023-04-09T10:40:02Z) - Explainable Deep Few-shot Anomaly Detection with Deviation Networks [123.46611927225963]
We introduce a novel weakly-supervised anomaly detection framework to train detection models.
The proposed approach learns discriminative normality by leveraging the labeled anomalies and a prior probability.
Our model is substantially more sample-efficient and robust, and performs significantly better than state-of-the-art competing methods in both closed-set and open-set settings.
arXiv Detail & Related papers (2021-08-01T14:33:17Z) - DASVDD: Deep Autoencoding Support Vector Data Descriptor for Anomaly
Detection [9.19194451963411]
Semi-supervised anomaly detection aims to detect anomalies from normal samples using a model that is trained on normal data.
We propose a method, DASVDD, that jointly learns the parameters of an autoencoder while minimizing the volume of an enclosing hyper-sphere on its latent representation.
arXiv Detail & Related papers (2021-06-09T21:57:41Z) - Sub-clusters of Normal Data for Anomaly Detection [0.15229257192293197]
Anomaly detection in data analysis is an interesting but still challenging research topic in real world applications.
Existing anomaly detection methods show limited performances with high dimensional data such as ImageNet.
In this paper, we study anomaly detection with high dimensional and complex normal data.
arXiv Detail & Related papers (2020-11-17T03:53:31Z) - Algorithmic Frameworks for the Detection of High Density Anomalies [0.0]
High-density anomalies are deviant cases positioned in the most normal regions of the data space.
This study introduces several non-parametric algorithmic frameworks for unsupervised detection.
arXiv Detail & Related papers (2020-10-09T17:48:02Z) - TadGAN: Time Series Anomaly Detection Using Generative Adversarial
Networks [73.01104041298031]
TadGAN is an unsupervised anomaly detection approach built on Generative Adversarial Networks (GANs)
To capture the temporal correlations of time series, we use LSTM Recurrent Neural Networks as base models for Generators and Critics.
To demonstrate the performance and generalizability of our approach, we test several anomaly scoring techniques and report the best-suited one.
arXiv Detail & Related papers (2020-09-16T15:52:04Z) - Toward Deep Supervised Anomaly Detection: Reinforcement Learning from
Partially Labeled Anomaly Data [150.9270911031327]
We consider the problem of anomaly detection with a small set of partially labeled anomaly examples and a large-scale unlabeled dataset.
Existing related methods either exclusively fit the limited anomaly examples that typically do not span the entire set of anomalies, or proceed with unsupervised learning from the unlabeled data.
We propose here instead a deep reinforcement learning-based approach that enables an end-to-end optimization of the detection of both labeled and unlabeled anomalies.
arXiv Detail & Related papers (2020-09-15T03:05:39Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.