Learning Graph Structures with Transformer for Multivariate Time Series
Anomaly Detection in IoT
- URL: http://arxiv.org/abs/2104.03466v1
- Date: Thu, 8 Apr 2021 01:45:28 GMT
- Title: Learning Graph Structures with Transformer for Multivariate Time Series
Anomaly Detection in IoT
- Authors: Zekai Chen, Dingshuo Chen, Zixuan Yuan, Xiuzhen Cheng, Xiao Zhang
- Abstract summary: This work proposed a novel framework, namely GTA, for multivariate time series anomaly detection by automatically learning a graph structure followed by the graph convolution.
We also devised a novel graph convolution named Influence propagation convolution to model the anomaly information flow between graph nodes.
The experiments on four public anomaly detection benchmarks further demonstrate our approach's superiority over other state-of-the-arts.
- Score: 11.480824844205864
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Many real-world IoT systems comprising various internet-connected sensory
devices generate substantial amounts of multivariate time series data.
Meanwhile, those critical IoT infrastructures, such as smart power grids and
water distribution networks, are often targets of cyber-attacks, making anomaly
detection of high research value. However, considering the complex topological
and nonlinear dependencies that are initially unknown among sensors, modeling
such relatedness is inevitable for any efficient and accurate anomaly detection
system. Additionally, due to multivariate time series' temporal dependency and
stochasticity, their anomaly detection remains a big challenge. This work
proposed a novel framework, namely GTA, for multivariate time series anomaly
detection by automatically learning a graph structure followed by the graph
convolution and modeling the temporal dependency through a Transformer-based
architecture. The core idea of learning graph structure is called the
connection learning policy based on the Gumbel-softmax sampling strategy to
learn bi-directed associations among sensors directly. We also devised a novel
graph convolution named Influence Propagation convolution to model the anomaly
information flow between graph nodes. Moreover, we proposed a multi-branch
attention mechanism to substitute for original multi-head self-attention to
overcome the quadratic complexity challenge. The extensive experiments on four
public anomaly detection benchmarks further demonstrate our approach's
superiority over other state-of-the-arts.
Related papers
- Hypergraph-based multi-scale spatio-temporal graph convolution network for Time-Series anomaly detection [8.878898677348086]
Multi-dimensional time series anomaly detection technology plays an important role in many fields including aerospace, water treatment, cloud service providers, etc.
It is becoming increasingly challenging to perform effective and accurate anomaly detection in high-dimensional and complex data sets.
We propose a hypergraph basedtemporal graph convolutional network model STGCN_Hyper, which explicitly captures high-order, multi-hop correlations between multiple variables.
Our model can flexibly learn the multi-scale time series features in the data and the dependencies between features, and outperforms most existing baseline models in terms of precision, recall, F1-score on anomaly detection
arXiv Detail & Related papers (2024-10-29T17:19:18Z) - Multivariate Time-Series Anomaly Detection based on Enhancing Graph Attention Networks with Topological Analysis [31.43159668073136]
Unsupervised anomaly detection in time series is essential in industrial applications, as it significantly reduces the need for manual intervention.
Traditional methods use Graph Neural Networks (GNNs) or Transformers to analyze spatial while RNNs to model temporal dependencies.
This paper introduces a novel temporal model built on an enhanced Graph Attention Network (GAT) for multivariate time series anomaly detection called TopoGDN.
arXiv Detail & Related papers (2024-08-23T14:06:30Z) - Graph Spatiotemporal Process for Multivariate Time Series Anomaly
Detection with Missing Values [67.76168547245237]
We introduce a novel framework called GST-Pro, which utilizes a graphtemporal process and anomaly scorer to detect anomalies.
Our experimental results show that the GST-Pro method can effectively detect anomalies in time series data and outperforms state-of-the-art methods.
arXiv Detail & Related papers (2024-01-11T10:10:16Z) - Correlation-aware Spatial-Temporal Graph Learning for Multivariate
Time-series Anomaly Detection [67.60791405198063]
We propose a correlation-aware spatial-temporal graph learning (termed CST-GL) for time series anomaly detection.
CST-GL explicitly captures the pairwise correlations via a multivariate time series correlation learning module.
A novel anomaly scoring component is further integrated into CST-GL to estimate the degree of an anomaly in a purely unsupervised manner.
arXiv Detail & Related papers (2023-07-17T11:04:27Z) - Coupled Attention Networks for Multivariate Time Series Anomaly
Detection [10.620044922371177]
We propose a coupled attention-based neural network framework (CAN) for anomaly detection in multivariate time series data.
To capture inter-sensor relationships and temporal dependencies, a convolutional neural network based on the global-local graph is integrated with a temporal self-attention module.
arXiv Detail & Related papers (2023-06-12T13:42:56Z) - Causality-Based Multivariate Time Series Anomaly Detection [63.799474860969156]
We formulate the anomaly detection problem from a causal perspective and view anomalies as instances that do not follow the regular causal mechanism to generate the multivariate data.
We then propose a causality-based anomaly detection approach, which first learns the causal structure from data and then infers whether an instance is an anomaly relative to the local causal mechanism.
We evaluate our approach with both simulated and public datasets as well as a case study on real-world AIOps applications.
arXiv Detail & Related papers (2022-06-30T06:00:13Z) - Unsupervised Deep Anomaly Detection for Multi-Sensor Time-Series Signals [10.866594993485226]
We propose a novel deep learning-based anomaly detection algorithm called Deep Convolutional Autoencoding Memory network (CAE-M)
We first build a Deep Convolutional Autoencoder to characterize spatial dependence of multi-sensor data with a Maximum Mean Discrepancy (MMD)
Then, we construct a Memory Network consisting of linear (Autoregressive Model) and non-linear predictions (Bigressive LSTM with Attention) to capture temporal dependence from time-series data.
arXiv Detail & Related papers (2021-07-27T06:48:20Z) - HIFI: Anomaly Detection for Multivariate Time Series with High-order
Feature Interactions [7.016615391171876]
HIFI builds multivariate feature interaction graph automatically and uses the graph convolutional neural network to achieve high-order feature interactions.
Experiments on three publicly available datasets demonstrate the superiority of our framework compared with state-of-the-art approaches.
arXiv Detail & Related papers (2021-06-11T04:57:03Z) - Consistency of mechanistic causal discovery in continuous-time using
Neural ODEs [85.7910042199734]
We consider causal discovery in continuous-time for the study of dynamical systems.
We propose a causal discovery algorithm based on penalized Neural ODEs.
arXiv Detail & Related papers (2021-05-06T08:48:02Z) - TELESTO: A Graph Neural Network Model for Anomaly Classification in
Cloud Services [77.454688257702]
Machine learning (ML) and artificial intelligence (AI) are applied on IT system operation and maintenance.
One direction aims at the recognition of re-occurring anomaly types to enable remediation automation.
We propose a method that is invariant to dimensionality changes of given data.
arXiv Detail & Related papers (2021-02-25T14:24:49Z) - TadGAN: Time Series Anomaly Detection Using Generative Adversarial
Networks [73.01104041298031]
TadGAN is an unsupervised anomaly detection approach built on Generative Adversarial Networks (GANs)
To capture the temporal correlations of time series, we use LSTM Recurrent Neural Networks as base models for Generators and Critics.
To demonstrate the performance and generalizability of our approach, we test several anomaly scoring techniques and report the best-suited one.
arXiv Detail & Related papers (2020-09-16T15:52:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.