CURIE: A Cellular Automaton for Concept Drift Detection
- URL: http://arxiv.org/abs/2009.09677v1
- Date: Mon, 21 Sep 2020 08:28:21 GMT
- Title: CURIE: A Cellular Automaton for Concept Drift Detection
- Authors: Jesus L. Lobo, Javier Del Ser, Eneko Osaba, Albert Bifet, Francisco
Herrera
- Abstract summary: Data stream mining extracts information from large quantities of data flowing fast and continuously.
They are affected by changes in the data distribution, giving rise to a phenomenon referred to as concept drift.
In this work we propose CU RIE, a drift detector relying on cellular automata.
- Score: 25.314158724575915
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: Data stream mining extracts information from large quantities of data flowing
fast and continuously (data streams). They are usually affected by changes in
the data distribution, giving rise to a phenomenon referred to as concept
drift. Thus, learning models must detect and adapt to such changes, so as to
exhibit a good predictive performance after a drift has occurred. In this
regard, the development of effective drift detection algorithms becomes a key
factor in data stream mining. In this work we propose CU RIE, a drift detector
relying on cellular automata. Specifically, in CU RIE the distribution of the
data stream is represented in the grid of a cellular automata, whose
neighborhood rule can then be utilized to detect possible distribution changes
over the stream. Computer simulations are presented and discussed to show that
CU RIE, when hybridized with other base learners, renders a competitive
behavior in terms of detection metrics and classification accuracy. CU RIE is
compared with well-established drift detectors over synthetic datasets with
varying drift characteristics.
Related papers
- SUDS: A Strategy for Unsupervised Drift Sampling [0.5437605013181142]
Supervised machine learning encounters concept drift, where the data distribution changes over time, degrading performance.
We present the Strategy for Drift Sampling (SUDS), a novel method that selects homogeneous samples for retraining using existing drift detection algorithms.
Our results demonstrate the efficacy of SUDS in optimizing labeled data use in dynamic environments.
arXiv Detail & Related papers (2024-11-05T10:55:29Z) - Drift Detection: Introducing Gaussian Split Detector [1.9430846345184412]
We introduce Gaussian Split Detector (GSD) a novel drift detector that works in batch mode.
GSD is designed to work when the data follow a normal distribution and makes use of Gaussian mixture models to monitor changes in the decision boundary.
We show that our detector outperforms the state of the art in detecting real drift and in ignoring virtual drift which is key to avoid false alarms.
arXiv Detail & Related papers (2024-05-14T14:15:31Z) - Thwarting Cybersecurity Attacks with Explainable Concept Drift [10.517955982799553]
Cyber-security attacks pose a significant threat to the operation of autonomous systems.
This paper proposes a Feature Drift Explanation (FDE) module to identify the drifting features.
FDE successfully identifies 85.77 % of drifting features and showcases its utility in the DL adaptation method.
arXiv Detail & Related papers (2024-03-18T20:20:00Z) - Unsupervised Domain Adaptation for Self-Driving from Past Traversal
Features [69.47588461101925]
We propose a method to adapt 3D object detectors to new driving environments.
Our approach enhances LiDAR-based detection models using spatial quantized historical features.
Experiments on real-world datasets demonstrate significant improvements.
arXiv Detail & Related papers (2023-09-21T15:00:31Z) - A Robust and Explainable Data-Driven Anomaly Detection Approach For
Power Electronics [56.86150790999639]
We present two anomaly detection and classification approaches, namely the Matrix Profile algorithm and anomaly transformer.
The Matrix Profile algorithm is shown to be well suited as a generalizable approach for detecting real-time anomalies in streaming time-series data.
A series of custom filters is created and added to the detector to tune its sensitivity, recall, and detection accuracy.
arXiv Detail & Related papers (2022-09-23T06:09:35Z) - Convolutional generative adversarial imputation networks for
spatio-temporal missing data in storm surge simulations [86.5302150777089]
Generative Adversarial Imputation Nets (GANs) and GAN-based techniques have attracted attention as unsupervised machine learning methods.
We name our proposed method as Con Conval Generative Adversarial Imputation Nets (Conv-GAIN)
arXiv Detail & Related papers (2021-11-03T03:50:48Z) - DAE : Discriminatory Auto-Encoder for multivariate time-series anomaly
detection in air transportation [68.8204255655161]
We propose a novel anomaly detection model called Discriminatory Auto-Encoder (DAE)
It uses the baseline of a regular LSTM-based auto-encoder but with several decoders, each getting data of a specific flight phase.
Results show that the DAE achieves better results in both accuracy and speed of detection.
arXiv Detail & Related papers (2021-09-08T14:07:55Z) - Detecting Concept Drift With Neural Network Model Uncertainty [0.0]
Uncertainty Drift Detection (UDD) is able to detect drifts without access to true labels.
In contrast to input data-based drift detection, our approach considers the effects of the current input data on the properties of the prediction model.
We show that UDD outperforms other state-of-the-art strategies on two synthetic as well as ten real-world data sets for both regression and classification tasks.
arXiv Detail & Related papers (2021-07-05T08:56:36Z) - Predicting traffic signals on transportation networks using
spatio-temporal correlations on graphs [56.48498624951417]
This paper proposes a traffic propagation model that merges multiple heat diffusion kernels into a data-driven prediction model to forecast traffic signals.
We optimize the model parameters using Bayesian inference to minimize the prediction errors and, consequently, determine the mixing ratio of the two approaches.
The proposed model demonstrates prediction accuracy comparable to that of the state-of-the-art deep neural networks with lower computational effort.
arXiv Detail & Related papers (2021-04-27T18:17:42Z) - Deep Transformer Networks for Time Series Classification: The NPP Safety
Case [59.20947681019466]
An advanced temporal neural network referred to as the Transformer is used within a supervised learning fashion to model the time-dependent NPP simulation data.
The Transformer can learn the characteristics of the sequential data and yield promising performance with approximately 99% classification accuracy on the testing dataset.
arXiv Detail & Related papers (2021-04-09T14:26:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.