Towards efficient deep autoencoders for multivariate time series anomaly
detection
- URL: http://arxiv.org/abs/2403.02429v1
- Date: Mon, 4 Mar 2024 19:22:09 GMT
- Title: Towards efficient deep autoencoders for multivariate time series anomaly
detection
- Authors: Marcin Pietro\'n, Dominik \.Zurek, Kamil Faber, Roberto Corizzo
- Abstract summary: We propose a novel compression method for deep autoencoders that involves three key factors.
First, pruning reduces the number of weights, while preventing catastrophic drops in accuracy by means of a fast search process.
Second, linear and non-linear quantization reduces model complexity by reducing the number of bits for every single weight.
- Score: 0.8681331155356999
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Multivariate time series anomaly detection is a crucial problem in many
industrial and research applications. Timely detection of anomalies allows, for
instance, to prevent defects in manufacturing processes and failures in
cyberphysical systems. Deep learning methods are preferred among others for
their accuracy and robustness for the analysis of complex multivariate data.
However, a key aspect is being able to extract predictions in a timely manner,
to accommodate real-time requirements in different applications. In the case of
deep learning models, model reduction is extremely important to achieve optimal
results in real-time systems with limited time and memory constraints. In this
paper, we address this issue by proposing a novel compression method for deep
autoencoders that involves three key factors. First, pruning reduces the number
of weights, while preventing catastrophic drops in accuracy by means of a fast
search process that identifies high sparsity levels. Second, linear and
non-linear quantization reduces model complexity by reducing the number of bits
for every single weight. The combined contribution of these three aspects allow
the model size to be reduced, by removing a subset of the weights (pruning),
and decreasing their bit-width (quantization). As a result, the compressed
model is faster and easier to adopt in highly constrained hardware
environments. Experiments performed on popular multivariate anomaly detection
benchmarks, show that our method is capable of achieving significant model
compression ratio (between 80% and 95%) without a significant reduction in the
anomaly detection performance.
Related papers
- Graph Spatiotemporal Process for Multivariate Time Series Anomaly
Detection with Missing Values [67.76168547245237]
We introduce a novel framework called GST-Pro, which utilizes a graphtemporal process and anomaly scorer to detect anomalies.
Our experimental results show that the GST-Pro method can effectively detect anomalies in time series data and outperforms state-of-the-art methods.
arXiv Detail & Related papers (2024-01-11T10:10:16Z) - Sparse Binary Transformers for Multivariate Time Series Modeling [1.3965477771846404]
We show that lightweight Compressed Neural Networks can achieve accuracy comparable to dense floating-point Transformers.
Our model achieves favorable results across three time series learning tasks: classification, anomaly detection, and single-step forecasting.
We measure the computational savings of our approach over a range of metrics including parameter count, bit size, and floating point operation (FLOPs) count.
arXiv Detail & Related papers (2023-08-09T00:23:04Z) - Multi-scale Fusion Fault Diagnosis Method Based on Two-Dimensionaliztion
Sequence in Complex Scenarios [0.0]
Rolling bearings are critical components in rotating machinery, and their faults can cause severe damage.
Early detection of abnormalities is crucial to prevent catastrophic accidents.
Traditional and intelligent methods have been used to analyze time series data, but in real-life scenarios, sensor data is often noisy and cannot be accurately characterized in the time domain.
This paper proposes an improved convolutional neural network method with a multi-scale feature fusion model and deep learning compression techniques for deployment in industrial scenarios.
arXiv Detail & Related papers (2023-04-11T13:05:50Z) - Mixed Precision Post Training Quantization of Neural Networks with
Sensitivity Guided Search [7.392278887917975]
Mixed-precision quantization allows different tensors to be quantized to varying levels of numerical precision.
We evaluate our method for computer vision and natural language processing and demonstrate latency reductions of up to 27.59% and 34.31%.
arXiv Detail & Related papers (2023-02-02T19:30:00Z) - Fundamental Limits of Two-layer Autoencoders, and Achieving Them with
Gradient Methods [91.54785981649228]
This paper focuses on non-linear two-layer autoencoders trained in the challenging proportional regime.
Our results characterize the minimizers of the population risk, and show that such minimizers are achieved by gradient methods.
For the special case of a sign activation function, our analysis establishes the fundamental limits for the lossy compression of Gaussian sources via (shallow) autoencoders.
arXiv Detail & Related papers (2022-12-27T12:37:34Z) - Unified Multivariate Gaussian Mixture for Efficient Neural Image
Compression [151.3826781154146]
latent variables with priors and hyperpriors is an essential problem in variational image compression.
We find inter-correlations and intra-correlations exist when observing latent variables in a vectorized perspective.
Our model has better rate-distortion performance and an impressive $3.18times$ compression speed up.
arXiv Detail & Related papers (2022-03-21T11:44:17Z) - Time-Series Anomaly Detection with Implicit Neural Representation [0.38073142980733]
Implicit Neural Representation-based Anomaly Detection (INRAD) is proposed.
We train a simple multi-layer perceptron that takes time as input and outputs corresponding values at that time.
Then we utilize the representation error as an anomaly score for detecting anomalies.
arXiv Detail & Related papers (2022-01-28T06:17:24Z) - TELESTO: A Graph Neural Network Model for Anomaly Classification in
Cloud Services [77.454688257702]
Machine learning (ML) and artificial intelligence (AI) are applied on IT system operation and maintenance.
One direction aims at the recognition of re-occurring anomaly types to enable remediation automation.
We propose a method that is invariant to dimensionality changes of given data.
arXiv Detail & Related papers (2021-02-25T14:24:49Z) - Variable Skipping for Autoregressive Range Density Estimation [84.60428050170687]
We show a technique, variable skipping, for accelerating range density estimation over deep autoregressive models.
We show that variable skipping provides 10-100$times$ efficiency improvements when targeting challenging high-quantile error metrics.
arXiv Detail & Related papers (2020-07-10T19:01:40Z) - Automatic heterogeneous quantization of deep neural networks for
low-latency inference on the edge for particle detectors [5.609098985493794]
We introduce a method for designing optimally heterogeneously quantized versions of deep neural network models for minimum-energy, high-accuracy, nanosecond inference and fully automated deployment on chip.
This is crucial for the event selection procedure in proton-proton collisions at the CERN Large Hadron Collider, where resources are strictly limited and a latency of $mathcal O(1)mu$s is required.
arXiv Detail & Related papers (2020-06-15T15:07:49Z) - SUOD: Accelerating Large-Scale Unsupervised Heterogeneous Outlier
Detection [63.253850875265115]
Outlier detection (OD) is a key machine learning (ML) task for identifying abnormal objects from general samples.
We propose a modular acceleration system, called SUOD, to address it.
arXiv Detail & Related papers (2020-03-11T00:22:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.