Stochastic Voronoi Ensembles for Anomaly Detection
- URL: http://arxiv.org/abs/2601.03664v1
- Date: Wed, 07 Jan 2026 07:37:38 GMT
- Title: Stochastic Voronoi Ensembles for Anomaly Detection
- Authors: Yang Cao,
- Abstract summary: We propose SVEAD (Stochastic Voronoi Ensembles Anomaly Detector), which constructs ensemble random Voronoi diagrams and scores points by normalized cell-relative distances weighted by local scale.<n> Experiments on 45 datasets demonstrate that SVEAD outperforms 12 state-of-the-art approaches.
- Score: 6.508284558109273
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Anomaly detection aims to identify data instances that deviate significantly from majority of data, which has been widely used in fraud detection, network security, and industrial quality control. Existing methods struggle with datasets exhibiting varying local densities: distance-based methods miss local anomalies, while density-based approaches require careful parameter selection and incur quadratic time complexity. We observe that local anomalies, though indistinguishable under global analysis, become conspicuous when the data space is decomposed into restricted regions and each region is examined independently. Leveraging this geometric insight, we propose SVEAD (Stochastic Voronoi Ensembles Anomaly Detector), which constructs ensemble random Voronoi diagrams and scores points by normalized cell-relative distances weighted by local scale. The proposed method achieves linear time complexity and constant space complexity. Experiments on 45 datasets demonstrate that SVEAD outperforms 12 state-of-the-art approaches.
Related papers
- Finding Time Series Anomalies using Granular-ball Vector Data Description [29.555765407206408]
Granular-ball One-Class Network (GBOC) is a novel approach based on a data-adaptive representation called Granular-ball Vector Data Description (GVDD)<n>GVDD partitions the latent space into compact, high-density regions represented by granular-balls.<n>During inference, anomaly scores are computed based on the distance to the nearest granular-ball.
arXiv Detail & Related papers (2025-11-15T10:24:53Z) - Isolation-based Spherical Ensemble Representations for Anomaly Detection [60.989157958972356]
Anomaly detection is a critical task in data mining and management with applications spanning fraud detection, network security, and log monitoring.<n>Existing unsupervised anomaly detection methods face fundamental challenges including conflicting distributional assumptions, computational inefficiency, and difficulty handling different anomaly types.<n>We propose ISER (Isolation-based Spherical Ensemble Representations) that extends existing isolation-based methods by using hypersphere radii as proxies for local density characteristics while maintaining linear time and constant space complexity.
arXiv Detail & Related papers (2025-10-15T09:00:05Z) - Robust Spatiotemporally Contiguous Anomaly Detection Using Tensor Decomposition [14.212807007278185]
We introduce an unsupervised tensor-based anomaly detection method that simultaneously considers the sparse and lowtemporally smooth nature of anomalies.<n>The proposed framework is evaluated on both synthetic and real data.
arXiv Detail & Related papers (2025-10-01T03:25:44Z) - Detecting Localized Density Anomalies in Multivariate Data via Coin-Flip Statistics [38.24458888666912]
We introduce EagleEye, an anomaly detection method to compare two datasets.<n>Anomalies are detected by modelling, for each point, the ordered sequence of its neighbours' membership label.<n>We demonstrate its effectiveness through experiments on both synthetic and real-world datasets.
arXiv Detail & Related papers (2025-03-31T10:20:04Z) - A Dataset for Semantic Segmentation in the Presence of Unknowns [49.795683850385956]
Existing datasets allow evaluation of only knowns or unknowns - but not both.<n>We propose a novel anomaly segmentation dataset, ISSU, that features a diverse set of anomaly inputs from cluttered real-world environments.<n>The dataset is twice larger than existing anomaly segmentation datasets.
arXiv Detail & Related papers (2025-03-28T10:31:01Z) - PeFAD: A Parameter-Efficient Federated Framework for Time Series Anomaly Detection [51.20479454379662]
We propose a.
Federated Anomaly Detection framework named PeFAD with the increasing privacy concerns.
We conduct extensive evaluations on four real datasets, where PeFAD outperforms existing state-of-the-art baselines by up to 28.74%.
arXiv Detail & Related papers (2024-06-04T13:51:08Z) - Learning to Bound Counterfactual Inference in Structural Causal Models
from Observational and Randomised Data [64.96984404868411]
We derive a likelihood characterisation for the overall data that leads us to extend a previous EM-based algorithm.
The new algorithm learns to approximate the (unidentifiability) region of model parameters from such mixed data sources.
It delivers interval approximations to counterfactual results, which collapse to points in the identifiable case.
arXiv Detail & Related papers (2022-12-06T12:42:11Z) - Causality-Based Multivariate Time Series Anomaly Detection [63.799474860969156]
We formulate the anomaly detection problem from a causal perspective and view anomalies as instances that do not follow the regular causal mechanism to generate the multivariate data.
We then propose a causality-based anomaly detection approach, which first learns the causal structure from data and then infers whether an instance is an anomaly relative to the local causal mechanism.
We evaluate our approach with both simulated and public datasets as well as a case study on real-world AIOps applications.
arXiv Detail & Related papers (2022-06-30T06:00:13Z) - Generative Anomaly Detection for Time Series Datasets [1.7954335118363964]
Traffic congestion anomaly detection is of paramount importance in intelligent traffic systems.
We propose a data-driven generative approach that can perform tractable density estimation for detecting traffic anomalies.
Our approach significantly outperforms several state-of-the-art congestion anomaly detection and diagnosis methods in terms of Recall and F1-Score.
arXiv Detail & Related papers (2022-06-28T17:08:47Z) - TadGAN: Time Series Anomaly Detection Using Generative Adversarial
Networks [73.01104041298031]
TadGAN is an unsupervised anomaly detection approach built on Generative Adversarial Networks (GANs)
To capture the temporal correlations of time series, we use LSTM Recurrent Neural Networks as base models for Generators and Critics.
To demonstrate the performance and generalizability of our approach, we test several anomaly scoring techniques and report the best-suited one.
arXiv Detail & Related papers (2020-09-16T15:52:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.