Neural network-based CUSUM for online change-point detection
- URL: http://arxiv.org/abs/2210.17312v6
- Date: Sat, 9 Mar 2024 18:47:56 GMT
- Title: Neural network-based CUSUM for online change-point detection
- Authors: Tingnan Gong, Junghwan Lee, Xiuyuan Cheng, Yao Xie
- Abstract summary: We introduce a neural network CUSUM (NN-CUSUM) for online change-point detection.
We present a general theoretical condition when the trained neural networks can perform change-point detection.
The strong performance of NN-CUSUM is demonstrated in detecting change-point in high-dimensional data.
- Score: 17.098858682219866
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Change-point detection, detecting an abrupt change in the data distribution
from sequential data, is a fundamental problem in statistics and machine
learning. CUSUM is a popular statistical method for online change-point
detection due to its efficiency from recursive computation and constant memory
requirement, and it enjoys statistical optimality. CUSUM requires knowing the
precise pre- and post-change distribution. However, post-change distribution is
usually unknown a priori since it represents anomaly and novelty. Classic CUSUM
can perform poorly when there is a model mismatch with actual data. While
likelihood ratio-based methods encounter challenges facing high dimensional
data, neural networks have become an emerging tool for change-point detection
with computational efficiency and scalability. In this paper, we introduce a
neural network CUSUM (NN-CUSUM) for online change-point detection. We also
present a general theoretical condition when the trained neural networks can
perform change-point detection and what losses can achieve our goal. We further
extend our analysis by combining it with the Neural Tangent Kernel theory to
establish learning guarantees for the standard performance metrics, including
the average run length (ARL) and expected detection delay (EDD). The strong
performance of NN-CUSUM is demonstrated in detecting change-point in
high-dimensional data using both synthetic and real-world data.
Related papers
- NIDS Neural Networks Using Sliding Time Window Data Processing with Trainable Activations and its Generalization Capability [0.0]
This paper presents neural networks for network intrusion detection systems (NIDS) that operate on flow data preprocessed with a time window.
It requires only eleven features which do not rely on deep packet inspection and can be found in most NIDS datasets and easily obtained from conventional flow collectors.
The reported training accuracy exceeds 99% for the proposed method with as little as twenty neural network input features.
arXiv Detail & Related papers (2024-10-24T11:36:19Z) - Anomaly Detection in Power Grids via Context-Agnostic Learning [4.865842426618145]
Given time-series measurement values coming from a fixed set of sensors on the grid, can we identify anomalies in the network topology or measurement data?
Recent data-driven ML techniques have shown promise by using a combination of current and historical data for anomaly detection.
We propose a novel context-aware anomaly detection algorithm, GridCAL, that considers the effect of regular topology and load/generation changes.
arXiv Detail & Related papers (2024-04-11T16:37:01Z) - An Evaluation of Real-time Adaptive Sampling Change Point Detection Algorithm using KCUSUM [4.610597418629838]
We introduce the Kernel-based Cumulative Sum (KCUSUM) algorithm, a non-parametric extension of the traditional Cumulative Sum (CUSUM) method.
KCUSUM splits itself by comparing incoming samples directly with reference samples and computes a statistic grounded in the Maximum Mean Discrepancy (MMD) non-parametric framework.
We discuss real-world use cases from scientific simulations such as NWChem CODAR and protein folding data, demonstrating KCUSUM's practical effectiveness in online change point detection.
arXiv Detail & Related papers (2024-02-15T19:45:24Z) - Assessing Neural Network Representations During Training Using
Noise-Resilient Diffusion Spectral Entropy [55.014926694758195]
Entropy and mutual information in neural networks provide rich information on the learning process.
We leverage data geometry to access the underlying manifold and reliably compute these information-theoretic measures.
We show that they form noise-resistant measures of intrinsic dimensionality and relationship strength in high-dimensional simulated data.
arXiv Detail & Related papers (2023-12-04T01:32:42Z) - Automatic Change-Point Detection in Time Series via Deep Learning [8.43086628139493]
We show how to automatically generate new offline detection methods based on training a neural network.
We present theory that quantifies the error rate for such an approach, and how it depends on the amount of training data.
Our method also shows strong results in detecting and localising changes in activity based on accelerometer data.
arXiv Detail & Related papers (2022-11-07T20:59:14Z) - Convolutional generative adversarial imputation networks for
spatio-temporal missing data in storm surge simulations [86.5302150777089]
Generative Adversarial Imputation Nets (GANs) and GAN-based techniques have attracted attention as unsupervised machine learning methods.
We name our proposed method as Con Conval Generative Adversarial Imputation Nets (Conv-GAIN)
arXiv Detail & Related papers (2021-11-03T03:50:48Z) - Mitigating Performance Saturation in Neural Marked Point Processes:
Architectures and Loss Functions [50.674773358075015]
We propose a simple graph-based network structure called GCHP, which utilizes only graph convolutional layers.
We show that GCHP can significantly reduce training time and the likelihood ratio loss with interarrival time probability assumptions can greatly improve the model performance.
arXiv Detail & Related papers (2021-07-07T16:59:14Z) - TELESTO: A Graph Neural Network Model for Anomaly Classification in
Cloud Services [77.454688257702]
Machine learning (ML) and artificial intelligence (AI) are applied on IT system operation and maintenance.
One direction aims at the recognition of re-occurring anomaly types to enable remediation automation.
We propose a method that is invariant to dimensionality changes of given data.
arXiv Detail & Related papers (2021-02-25T14:24:49Z) - Improving Uncertainty Calibration via Prior Augmented Data [56.88185136509654]
Neural networks have proven successful at learning from complex data distributions by acting as universal function approximators.
They are often overconfident in their predictions, which leads to inaccurate and miscalibrated probabilistic predictions.
We propose a solution by seeking out regions of feature space where the model is unjustifiably overconfident, and conditionally raising the entropy of those predictions towards that of the prior distribution of the labels.
arXiv Detail & Related papers (2021-02-22T07:02:37Z) - Sequential change-point detection for mutually exciting point processes
over networks [11.672651073865538]
We present a new CUSUM procedure for sequentially detecting change-point in the self and mutual exciting processes, a.k.a. Hawkes networks.
We show that the proposed CUSUM method achieves better performance than existing methods.
arXiv Detail & Related papers (2021-02-10T20:20:06Z) - Unlabelled Data Improves Bayesian Uncertainty Calibration under
Covariate Shift [100.52588638477862]
We develop an approximate Bayesian inference scheme based on posterior regularisation.
We demonstrate the utility of our method in the context of transferring prognostic models of prostate cancer across globally diverse populations.
arXiv Detail & Related papers (2020-06-26T13:50:19Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.