TELESTO: A Graph Neural Network Model for Anomaly Classification in
Cloud Services
- URL: http://arxiv.org/abs/2102.12877v1
- Date: Thu, 25 Feb 2021 14:24:49 GMT
- Title: TELESTO: A Graph Neural Network Model for Anomaly Classification in
Cloud Services
- Authors: Dominik Scheinert, Alexander Acker
- Abstract summary: Machine learning (ML) and artificial intelligence (AI) are applied on IT system operation and maintenance.
One direction aims at the recognition of re-occurring anomaly types to enable remediation automation.
We propose a method that is invariant to dimensionality changes of given data.
- Score: 77.454688257702
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Deployment, operation and maintenance of large IT systems becomes
increasingly complex and puts human experts under extreme stress when problems
occur. Therefore, utilization of machine learning (ML) and artificial
intelligence (AI) is applied on IT system operation and maintenance -
summarized in the term AIOps. One specific direction aims at the recognition of
re-occurring anomaly types to enable remediation automation. However, due to IT
system specific properties, especially their frequent changes (e.g. software
updates, reconfiguration or hardware modernization), recognition of reoccurring
anomaly types is challenging. Current methods mainly assume a static
dimensionality of provided data. We propose a method that is invariant to
dimensionality changes of given data. Resource metric data such as CPU
utilization, allocated memory and others are modelled as multivariate time
series. The extraction of temporal and spatial features together with the
subsequent anomaly classification is realized by utilizing TELESTO, our novel
graph convolutional neural network (GCNN) architecture. The experimental
evaluation is conducted in a real-world cloud testbed deployment that is
hosting two applications. Classification results of injected anomalies on a
cassandra database node show that TELESTO outperforms the alternative GCNNs and
achieves an overall classification accuracy of 85.1%. Classification results
for the other nodes show accuracy values between 85% and 60%.
Related papers
- Hypergraph-based multi-scale spatio-temporal graph convolution network for Time-Series anomaly detection [8.878898677348086]
Multi-dimensional time series anomaly detection technology plays an important role in many fields including aerospace, water treatment, cloud service providers, etc.
It is becoming increasingly challenging to perform effective and accurate anomaly detection in high-dimensional and complex data sets.
We propose a hypergraph basedtemporal graph convolutional network model STGCN_Hyper, which explicitly captures high-order, multi-hop correlations between multiple variables.
Our model can flexibly learn the multi-scale time series features in the data and the dependencies between features, and outperforms most existing baseline models in terms of precision, recall, F1-score on anomaly detection
arXiv Detail & Related papers (2024-10-29T17:19:18Z) - Neural networks for insurance pricing with frequency and severity data: a benchmark study from data preprocessing to technical tariff [2.4578723416255754]
We present a benchmark study on four insurance data sets with frequency and severity targets in the presence of multiple types of input features.
We compare in detail the performance of a generalized linear model on binned input data, a gradient-boosted tree model, a feed-forward neural network (FFNN), and the combined actuarial neural network (CANN)
arXiv Detail & Related papers (2023-10-19T12:00:33Z) - Continuous time recurrent neural networks: overview and application to
forecasting blood glucose in the intensive care unit [56.801856519460465]
Continuous time autoregressive recurrent neural networks (CTRNNs) are a deep learning model that account for irregular observations.
We demonstrate the application of these models to probabilistic forecasting of blood glucose in a critical care setting.
arXiv Detail & Related papers (2023-04-14T09:39:06Z) - Systematic Evaluation of Deep Learning Models for Log-based Failure Prediction [3.3810628880631226]
This paper systematically investigates the combination of log data embedding strategies and Deep Learning (DL) types for failure prediction.
To that end, we propose a modular architecture to accommodate various configurations of embedding strategies and DL-based encoders.
Using the F1 score metric, our results show that the best overall performing configuration is a CNN-based encoder with Logkey2vec.
arXiv Detail & Related papers (2023-03-13T16:04:14Z) - Self-Supervised Masked Convolutional Transformer Block for Anomaly
Detection [122.4894940892536]
We present a novel self-supervised masked convolutional transformer block (SSMCTB) that comprises the reconstruction-based functionality at a core architectural level.
In this work, we extend our previous self-supervised predictive convolutional attentive block (SSPCAB) with a 3D masked convolutional layer, a transformer for channel-wise attention, as well as a novel self-supervised objective based on Huber loss.
arXiv Detail & Related papers (2022-09-25T04:56:10Z) - Leveraging Log Instructions in Log-based Anomaly Detection [0.5949779668853554]
We propose a method for reliable and practical anomaly detection from system logs.
It overcomes the common disadvantage of related works by building an anomaly detection model with log instructions from the source code of 1000+ GitHub projects.
The proposed method, named ADLILog, combines the log instructions and the data from the system of interest (target system) to learn a deep neural network model.
arXiv Detail & Related papers (2022-07-07T10:22:10Z) - Learning Dependencies in Distributed Cloud Applications to Identify and
Localize Anomalies [58.88325379746632]
We present Arvalus and its variant D-Arvalus, a neural graph transformation method that models system components as nodes and their dependencies as edges to improve the identification and localization of anomalies.
Given a series of metric, our method predicts the most likely system state - either normal or an anomaly class - and performs localization when an anomaly is detected.
The evaluation shows the generally good prediction performance of Arvalus and reveals the advantage of D-Arvalus which incorporates information about system component dependencies.
arXiv Detail & Related papers (2021-03-09T06:34:05Z) - TadGAN: Time Series Anomaly Detection Using Generative Adversarial
Networks [73.01104041298031]
TadGAN is an unsupervised anomaly detection approach built on Generative Adversarial Networks (GANs)
To capture the temporal correlations of time series, we use LSTM Recurrent Neural Networks as base models for Generators and Critics.
To demonstrate the performance and generalizability of our approach, we test several anomaly scoring techniques and report the best-suited one.
arXiv Detail & Related papers (2020-09-16T15:52:04Z) - Contextual-Bandit Anomaly Detection for IoT Data in Distributed
Hierarchical Edge Computing [65.78881372074983]
IoT devices can hardly afford complex deep neural networks (DNN) models, and offloading anomaly detection tasks to the cloud incurs long delay.
We propose and build a demo for an adaptive anomaly detection approach for distributed hierarchical edge computing (HEC) systems.
We show that our proposed approach significantly reduces detection delay without sacrificing accuracy, as compared to offloading detection tasks to the cloud.
arXiv Detail & Related papers (2020-04-15T06:13:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.