Autoencoder-based Anomaly Detection in Streaming Data with Incremental
Learning and Concept Drift Adaptation
- URL: http://arxiv.org/abs/2305.08977v2
- Date: Tue, 5 Sep 2023 18:31:18 GMT
- Title: Autoencoder-based Anomaly Detection in Streaming Data with Incremental
Learning and Concept Drift Adaptation
- Authors: Jin Li, Kleanthis Malialis, Marios M. Polycarpou
- Abstract summary: The paper proposes an autoencoder-based incremental learning method with drift detection (strAEm++DD)
Our proposed method strAEm++DD leverages on the advantages of both incremental learning and drift detection.
We conduct an experimental study using real-world and synthetic datasets with severe or extreme class imbalance, and provide an empirical analysis of strAEm++DD.
- Score: 10.41066461952124
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In our digital universe nowadays, enormous amount of data are produced in a
streaming manner in a variety of application areas. These data are often
unlabelled. In this case, identifying infrequent events, such as anomalies,
poses a great challenge. This problem becomes even more difficult in
non-stationary environments, which can cause deterioration of the predictive
performance of a model. To address the above challenges, the paper proposes an
autoencoder-based incremental learning method with drift detection
(strAEm++DD). Our proposed method strAEm++DD leverages on the advantages of
both incremental learning and drift detection. We conduct an experimental study
using real-world and synthetic datasets with severe or extreme class imbalance,
and provide an empirical analysis of strAEm++DD. We further conduct a
comparative study, showing that the proposed method significantly outperforms
existing baseline and advanced methods.
Related papers
- SUDS: A Strategy for Unsupervised Drift Sampling [0.5437605013181142]
Supervised machine learning encounters concept drift, where the data distribution changes over time, degrading performance.
We present the Strategy for Drift Sampling (SUDS), a novel method that selects homogeneous samples for retraining using existing drift detection algorithms.
Our results demonstrate the efficacy of SUDS in optimizing labeled data use in dynamic environments.
arXiv Detail & Related papers (2024-11-05T10:55:29Z) - Self-Supervised Time-Series Anomaly Detection Using Learnable Data Augmentation [37.72735288760648]
We propose a learnable data augmentation-based time-series anomaly detection (LATAD) technique that is trained in a self-supervised manner.
LATAD extracts discriminative features from time-series data through contrastive learning.
As per the results, LATAD exhibited comparable or improved performance to the state-of-the-art anomaly detection assessments.
arXiv Detail & Related papers (2024-06-18T04:25:56Z) - Simple Ingredients for Offline Reinforcement Learning [86.1988266277766]
offline reinforcement learning algorithms have proven effective on datasets highly connected to the target downstream task.
We show that existing methods struggle with diverse data: their performance considerably deteriorates as data collected for related but different tasks is simply added to the offline buffer.
We show that scale, more than algorithmic considerations, is the key factor influencing performance.
arXiv Detail & Related papers (2024-03-19T18:57:53Z) - Stochastic Amortization: A Unified Approach to Accelerate Feature and Data Attribution [62.71425232332837]
We show that training amortized models with noisy labels is inexpensive and surprisingly effective.
This approach significantly accelerates several feature attribution and data valuation methods, often yielding an order of magnitude speedup over existing approaches.
arXiv Detail & Related papers (2024-01-29T03:42:37Z) - A Discrepancy Aware Framework for Robust Anomaly Detection [51.710249807397695]
We present a Discrepancy Aware Framework (DAF), which demonstrates robust performance consistently with simple and cheap strategies.
Our method leverages an appearance-agnostic cue to guide the decoder in identifying defects, thereby alleviating its reliance on synthetic appearance.
Under the simple synthesis strategies, it outperforms existing methods by a large margin. Furthermore, it also achieves the state-of-the-art localization performance.
arXiv Detail & Related papers (2023-10-11T15:21:40Z) - Enhancing Multiple Reliability Measures via Nuisance-extended
Information Bottleneck [77.37409441129995]
In practical scenarios where training data is limited, many predictive signals in the data can be rather from some biases in data acquisition.
We consider an adversarial threat model under a mutual information constraint to cover a wider class of perturbations in training.
We propose an autoencoder-based training to implement the objective, as well as practical encoder designs to facilitate the proposed hybrid discriminative-generative training.
arXiv Detail & Related papers (2023-03-24T16:03:21Z) - Adaptive Model Pooling for Online Deep Anomaly Detection from a Complex
Evolving Data Stream [15.599296461516984]
This paper presents a framework for online deep anomaly detection, ARCUS, which can be instantiated with any autoencoder-based deep anomaly detection methods.
It handles the complex and evolving data streams using an adaptive model pooling approach with two novel techniques.
In comprehensive experiments with ten data sets which are both high-dimensional and concept-drifted, ARCUS improved the anomaly detection accuracy of the streaming variants of state-of-the-art autoencoder-based methods by up to 22% and 37%, respectively.
arXiv Detail & Related papers (2022-06-09T23:11:43Z) - Closing the sim-to-real gap in guided wave damage detection with
adversarial training of variational auto-encoders [0.0]
We focus on the primary task of damage detection, where signal processing techniques are commonly employed.
We train an ensemble of variational autoencoders only on simulation data with a wave physics-guided adversarial component.
We compare our scheme with existing deep learning detection schemes and observe superior performance on experimental data.
arXiv Detail & Related papers (2022-01-26T17:36:11Z) - DAE : Discriminatory Auto-Encoder for multivariate time-series anomaly
detection in air transportation [68.8204255655161]
We propose a novel anomaly detection model called Discriminatory Auto-Encoder (DAE)
It uses the baseline of a regular LSTM-based auto-encoder but with several decoders, each getting data of a specific flight phase.
Results show that the DAE achieves better results in both accuracy and speed of detection.
arXiv Detail & Related papers (2021-09-08T14:07:55Z) - Unsupervised Deep Anomaly Detection for Multi-Sensor Time-Series Signals [10.866594993485226]
We propose a novel deep learning-based anomaly detection algorithm called Deep Convolutional Autoencoding Memory network (CAE-M)
We first build a Deep Convolutional Autoencoder to characterize spatial dependence of multi-sensor data with a Maximum Mean Discrepancy (MMD)
Then, we construct a Memory Network consisting of linear (Autoregressive Model) and non-linear predictions (Bigressive LSTM with Attention) to capture temporal dependence from time-series data.
arXiv Detail & Related papers (2021-07-27T06:48:20Z) - DEALIO: Data-Efficient Adversarial Learning for Imitation from
Observation [57.358212277226315]
In imitation learning from observation IfO, a learning agent seeks to imitate a demonstrating agent using only observations of the demonstrated behavior without access to the control signals generated by the demonstrator.
Recent methods based on adversarial imitation learning have led to state-of-the-art performance on IfO problems, but they typically suffer from high sample complexity due to a reliance on data-inefficient, model-free reinforcement learning algorithms.
This issue makes them impractical to deploy in real-world settings, where gathering samples can incur high costs in terms of time, energy, and risk.
We propose a more data-efficient IfO algorithm
arXiv Detail & Related papers (2021-03-31T23:46:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.