A Meta-level Analysis of Online Anomaly Detectors
- URL: http://arxiv.org/abs/2209.05899v1
- Date: Tue, 13 Sep 2022 11:28:15 GMT
- Title: A Meta-level Analysis of Online Anomaly Detectors
- Authors: Antonios Ntroumpogiannis, Michail Giannoulis, Nikolaos Myrtakis,
Vassilis Christophides, Eric Simon, Ioannis Tsamardinos
- Abstract summary: Real-time detection of anomalies in streaming data is receiving increasing attention.
Yet, little attention has been given to compare the effectiveness and efficiency of anomaly detectors for streaming data.
We present a qualitative, synthetic overview of major online detectors from different algorithmic families.
- Score: 4.852567314334134
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Real-time detection of anomalies in streaming data is receiving increasing
attention as it allows us to raise alerts, predict faults, and detect
intrusions or threats across industries. Yet, little attention has been given
to compare the effectiveness and efficiency of anomaly detectors for streaming
data (i.e., of online algorithms). In this paper, we present a qualitative,
synthetic overview of major online detectors from different algorithmic
families (i.e., distance, density, tree or projection-based) and highlight
their main ideas for constructing, updating and testing detection models. Then,
we provide a thorough analysis of the results of a quantitative experimental
evaluation of online detection algorithms along with their offline
counterparts. The behavior of the detectors is correlated with the
characteristics of different datasets (i.e., meta-features), thereby providing
a meta-level analysis of their performance. Our study addresses several missing
insights from the literature such as (a) how reliable are detectors against a
random classifier and what dataset characteristics make them perform randomly;
(b) to what extent online detectors approximate the performance of offline
counterparts; (c) which sketch strategy and update primitives of detectors are
best to detect anomalies visible only within a feature subspace of a dataset;
(d) what are the tradeoffs between the effectiveness and the efficiency of
detectors belonging to different algorithmic families; (e) which specific
characteristics of datasets yield an online algorithm to outperform all others.
Related papers
- Research on Dynamic Data Flow Anomaly Detection based on Machine Learning [11.526496773281938]
In this study, the unsupervised learning method is employed to identify anomalies in dynamic data flows.
By clustering similar data, the model is able to detect data behaviour that deviates significantly from normal traffic without the need for labelled data.
Notably, it demonstrates robust and adaptable performance, particularly in the context of unbalanced data.
arXiv Detail & Related papers (2024-09-23T08:19:15Z) - A Self-Supervised Task for Fault Detection in Satellite Multivariate Time Series [45.31237646796715]
This work proposes a novel approach leveraging Physics-Informed Real NVP neural networks, renowned for their ability to model complex and high-dimensional distributions.
The experiments involve various configurations, including pre-training with self-supervision, multi-task learning, and standalone self-supervised training.
Results indicate significant performance improvements across all settings.
arXiv Detail & Related papers (2024-07-03T07:19:41Z) - Benchmarking Cross-Domain Audio-Visual Deception Detection [45.342156006617394]
We present the first cross-domain audio-visual deception detection benchmark.
We compare single-to-single and multi-to-single domain generalization performance.
We propose an algorithm to enhance the generalization performance.
arXiv Detail & Related papers (2024-05-11T12:06:31Z) - A Robust and Explainable Data-Driven Anomaly Detection Approach For
Power Electronics [56.86150790999639]
We present two anomaly detection and classification approaches, namely the Matrix Profile algorithm and anomaly transformer.
The Matrix Profile algorithm is shown to be well suited as a generalizable approach for detecting real-time anomalies in streaming time-series data.
A series of custom filters is created and added to the detector to tune its sensitivity, recall, and detection accuracy.
arXiv Detail & Related papers (2022-09-23T06:09:35Z) - Functional Anomaly Detection: a Benchmark Study [4.444788548423704]
Anomaly detection can now rely on measurements sampled at a very high frequency.
It is the purpose of this paper to investigate the performance of recent techniques for anomaly detection in the functional setup on real datasets.
arXiv Detail & Related papers (2022-01-13T18:20:32Z) - DAE : Discriminatory Auto-Encoder for multivariate time-series anomaly
detection in air transportation [68.8204255655161]
We propose a novel anomaly detection model called Discriminatory Auto-Encoder (DAE)
It uses the baseline of a regular LSTM-based auto-encoder but with several decoders, each getting data of a specific flight phase.
Results show that the DAE achieves better results in both accuracy and speed of detection.
arXiv Detail & Related papers (2021-09-08T14:07:55Z) - An algorithm-based multiple detection influence measure for high
dimensional regression using expectile [0.4999814847776096]
We propose an algorithm-based, multi-step, multiple detection procedure to identify influential observations.
Our three-step algorithm to identify and capture undesirable variability in the data, $asymMIP,$ is based on two complementary statistics.
The application of our method to the Autism Brain Imaging Data Exchange dataset resulted in a more balanced and accurate prediction of brain maturity.
arXiv Detail & Related papers (2021-05-26T01:16:24Z) - Anomaly Detection Based on Selection and Weighting in Latent Space [73.01328671569759]
We propose a novel selection-and-weighting-based anomaly detection framework called SWAD.
Experiments on both benchmark and real-world datasets have shown the effectiveness and superiority of SWAD.
arXiv Detail & Related papers (2021-03-08T10:56:38Z) - Unsupervised Anomaly Detectors to Detect Intrusions in the Current
Threat Landscape [0.11470070927586014]
We show that Isolation Forests, One-Class Support Vector Machines and Self-Organizing Maps are more effective than their counterparts for intrusion detection.
We detail how attacks with unstable, distributed or non-repeatable behavior as Fuzzing, Worms and Botnets are more difficult to detect.
arXiv Detail & Related papers (2020-12-21T14:06:58Z) - Bayesian Optimization with Machine Learning Algorithms Towards Anomaly
Detection [66.05992706105224]
In this paper, an effective anomaly detection framework is proposed utilizing Bayesian Optimization technique.
The performance of the considered algorithms is evaluated using the ISCX 2012 dataset.
Experimental results show the effectiveness of the proposed framework in term of accuracy rate, precision, low-false alarm rate, and recall.
arXiv Detail & Related papers (2020-08-05T19:29:35Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.