PATE: Proximity-Aware Time series anomaly Evaluation
- URL: http://arxiv.org/abs/2405.12096v1
- Date: Mon, 20 May 2024 15:06:36 GMT
- Title: PATE: Proximity-Aware Time series anomaly Evaluation
- Authors: Ramin Ghorbani, Marcel J. T. Reinders, David M. J. Tax,
- Abstract summary: Traditional performance metrics assume iid data and fail to capture the complex temporal dynamics and specific characteristics of time series anomalies.
We introduce Proximity-Aware Time series anomaly Evaluation (PATE), a novel evaluation metric that incorporates the temporal relationship between prediction and anomaly intervals.
Our experiments with synthetic and real-world datasets show the superiority of PATE in providing more sensible and accurate evaluations.
- Score: 3.0377067713090633
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Evaluating anomaly detection algorithms in time series data is critical as inaccuracies can lead to flawed decision-making in various domains where real-time analytics and data-driven strategies are essential. Traditional performance metrics assume iid data and fail to capture the complex temporal dynamics and specific characteristics of time series anomalies, such as early and delayed detections. We introduce Proximity-Aware Time series anomaly Evaluation (PATE), a novel evaluation metric that incorporates the temporal relationship between prediction and anomaly intervals. PATE uses proximity-based weighting considering buffer zones around anomaly intervals, enabling a more detailed and informed assessment of a detection. Using these weights, PATE computes a weighted version of the area under the Precision and Recall curve. Our experiments with synthetic and real-world datasets show the superiority of PATE in providing more sensible and accurate evaluations than other evaluation metrics. We also tested several state-of-the-art anomaly detectors across various benchmark datasets using the PATE evaluation scheme. The results show that a common metric like Point-Adjusted F1 Score fails to characterize the detection performances well, and that PATE is able to provide a more fair model comparison. By introducing PATE, we redefine the understanding of model efficacy that steers future studies toward developing more effective and accurate detection models.
Related papers
- Towards Unbiased Evaluation of Time-series Anomaly Detector [6.521243384420707]
Time series anomaly detection (TSAD) is an evolving area of research motivated by its critical applications.
In this work, we propose an alternative adjustment protocol called Balanced point adjustment'' (BA)
arXiv Detail & Related papers (2024-09-19T19:02:45Z) - Graph Spatiotemporal Process for Multivariate Time Series Anomaly
Detection with Missing Values [67.76168547245237]
We introduce a novel framework called GST-Pro, which utilizes a graphtemporal process and anomaly scorer to detect anomalies.
Our experimental results show that the GST-Pro method can effectively detect anomalies in time series data and outperforms state-of-the-art methods.
arXiv Detail & Related papers (2024-01-11T10:10:16Z) - Score Matching-based Pseudolikelihood Estimation of Neural Marked
Spatio-Temporal Point Process with Uncertainty Quantification [59.81904428056924]
We introduce SMASH: a Score MAtching estimator for learning markedPs with uncertainty quantification.
Specifically, our framework adopts a normalization-free objective by estimating the pseudolikelihood of markedPs through score-matching.
The superior performance of our proposed framework is demonstrated through extensive experiments in both event prediction and uncertainty quantification.
arXiv Detail & Related papers (2023-10-25T02:37:51Z) - Adaptive Thresholding Heuristic for KPI Anomaly Detection [1.57731592348751]
A plethora of outlier detectors have been explored in the time series domain, however, in a business sense, not all outliers are anomalies of interest.
This article proposes an Adaptive Thresholding Heuristic (ATH) to dynamically adjust the detection threshold based on the local properties of the data distribution and adapt to changes in time series patterns.
Experimental results show that ATH is efficient making it scalable for near real time anomaly detection and flexible with forecasters and outlier detectors.
arXiv Detail & Related papers (2023-08-21T06:45:28Z) - PULL: Reactive Log Anomaly Detection Based On Iterative PU Learning [58.85063149619348]
We propose PULL, an iterative log analysis method for reactive anomaly detection based on estimated failure time windows.
Our evaluation shows that PULL consistently outperforms ten benchmark baselines across three different datasets.
arXiv Detail & Related papers (2023-01-25T16:34:43Z) - Unsupervised Anomaly Detection in Time-series: An Extensive Evaluation and Analysis of State-of-the-art Methods [10.618572317896515]
Unsupervised anomaly detection in time-series has been extensively investigated in the literature.
This paper proposes an in-depth evaluation study of recent unsupervised anomaly detection techniques in time-series.
arXiv Detail & Related papers (2022-12-06T15:05:54Z) - An Evaluation of Anomaly Detection and Diagnosis in Multivariate Time
Series [7.675917669905486]
This paper presents a systematic and comprehensive evaluation of unsupervised and semi-supervised deep-learning based methods for anomaly detection and diagnosis.
We vary the model and post-processing of model errors, through a grid of 10 models and 4 scoring functions, comparing these variants to state of the art methods.
We find that the existing evaluation metrics either do not take events into account, or cannot distinguish between a good detector and trivial detectors.
arXiv Detail & Related papers (2021-09-23T15:14:24Z) - Doing Great at Estimating CATE? On the Neglected Assumptions in
Benchmark Comparisons of Treatment Effect Estimators [91.3755431537592]
We show that even in arguably the simplest setting, estimation under ignorability assumptions can be misleading.
We consider two popular machine learning benchmark datasets for evaluation of heterogeneous treatment effect estimators.
We highlight that the inherent characteristics of the benchmark datasets favor some algorithms over others.
arXiv Detail & Related papers (2021-07-28T13:21:27Z) - Multi-Source Causal Inference Using Control Variates [81.57072928775509]
We propose a general algorithm to estimate causal effects from emphmultiple data sources.
We show theoretically that this reduces the variance of the ATE estimate.
We apply this framework to inference from observational data under an outcome selection bias.
arXiv Detail & Related papers (2021-03-30T21:20:51Z) - How Far Should We Look Back to Achieve Effective Real-Time Time-Series
Anomaly Detection? [1.0437764544103274]
Anomaly detection is the process of identifying unexpected events or ab-normalities in data.
RePAD (Real-time Proactive Anomaly Detection algorithm) is a generic approach with all above-mentioned features.
It is unclear how different amounts of historical data points affect the performance of RePAD.
arXiv Detail & Related papers (2021-02-12T14:51:05Z) - TadGAN: Time Series Anomaly Detection Using Generative Adversarial
Networks [73.01104041298031]
TadGAN is an unsupervised anomaly detection approach built on Generative Adversarial Networks (GANs)
To capture the temporal correlations of time series, we use LSTM Recurrent Neural Networks as base models for Generators and Critics.
To demonstrate the performance and generalizability of our approach, we test several anomaly scoring techniques and report the best-suited one.
arXiv Detail & Related papers (2020-09-16T15:52:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.