Learning under Concept Drift: A Review
- URL: http://arxiv.org/abs/2004.05785v1
- Date: Mon, 13 Apr 2020 06:29:56 GMT
- Title: Learning under Concept Drift: A Review
- Authors: Jie Lu, Anjin Liu, Fan Dong, Feng Gu, Joao Gama, Guangquan Zhang
- Abstract summary: Concept drift describes unforeseeable changes in the underlying distribution of streaming data over time.
Data analysis has revealed that machine learning in a concept drift environment will result in poor learning results if the drift is not addressed.
This paper reviews over 130 high quality publications in concept drift related research areas.
It analyzes up-to-date developments in methodologies and techniques, and establishes a framework of learning under concept drift.
- Score: 33.941057801788865
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Concept drift describes unforeseeable changes in the underlying distribution
of streaming data over time. Concept drift research involves the development of
methodologies and techniques for drift detection, understanding and adaptation.
Data analysis has revealed that machine learning in a concept drift environment
will result in poor learning results if the drift is not addressed. To help
researchers identify which research topics are significant and how to apply
related techniques in data analysis tasks, it is necessary that a high quality,
instructive review of current research developments and trends in the concept
drift field is conducted. In addition, due to the rapid development of concept
drift in recent years, the methodologies of learning under concept drift have
become noticeably systematic, unveiling a framework which has not been
mentioned in literature. This paper reviews over 130 high quality publications
in concept drift related research areas, analyzes up-to-date developments in
methodologies and techniques, and establishes a framework of learning under
concept drift including three main components: concept drift detection, concept
drift understanding, and concept drift adaptation. This paper lists and
discusses 10 popular synthetic datasets and 14 publicly available benchmark
datasets used for evaluating the performance of learning algorithms aiming at
handling concept drift. Also, concept drift related research directions are
covered and discussed. By providing state-of-the-art knowledge, this survey
will directly support researchers in their understanding of research
developments in the field of learning under concept drift.
Related papers
- Continuous Evolution Pool: Taming Recurring Concept Drift in Online Time Series Forecasting [58.448663215248565]
Continuous Evolution Pool (CEP) is a pooling mechanism that stores different instances of forecasters for different concepts.<n>CEP effectively retains the knowledge of different concepts.<n>In the scenario of online forecasting with recurring concepts, CEP significantly enhances the prediction results.
arXiv Detail & Related papers (2025-05-28T03:27:49Z) - Identifying Predictions That Influence the Future: Detecting Performative Concept Drift in Data Streams [0.0]
We introduce a novel type of drift detection task, aimed at identifying potential performative concept drift in data streams.
We propose a first such drift detection approach, called CheckerBoard Performative Drift Detection (CB-PDD)
Results are positive with CB-PDD showing high efficacy, low false detection rates, resilience to intrinsic drift, and an ability to effectively detect performative drift in semi-synthetic datasets.
arXiv Detail & Related papers (2024-12-13T20:45:18Z) - Methods for Generating Drift in Text Streams [49.3179290313959]
Concept drift is a frequent phenomenon in real-world datasets and corresponds to changes in data distribution over time.
This paper provides four textual drift generation methods to ease the production of datasets with labeled drifts.
Results show that all methods have their performance degraded right after the drifts, and the incremental SVM is the fastest to run and recover the previous performance levels.
arXiv Detail & Related papers (2024-03-18T23:48:33Z) - Concept Drift Adaptation in Text Stream Mining Settings: A Systematic Review [46.543216927386005]
This study presents a systematic literature review regarding concept drift adaptation in text stream scenarios.
We selected 48 papers published between 2018 and August 2024 to unravel aspects such as text drift categories, detection types, model update mechanisms, stream mining tasks addressed, and text representation methods and their update mechanisms.
arXiv Detail & Related papers (2023-12-05T17:15:16Z) - A comprehensive analysis of concept drift locality in data streams [3.5897534810405403]
Concept drift must be detected for effective model adaptation to evolving data properties.
We present a novel categorization of concept drift based on its locality and scale.
We conduct a comparative assessment of 9 state-of-the-art drift detectors across diverse difficulties.
arXiv Detail & Related papers (2023-11-10T20:57:43Z) - Model Based Explanations of Concept Drift [8.686667049158476]
Concept drift refers to the phenomenon that the distribution generating the observed data changes over time.
If drift is present, machine learning models can become inaccurate and need adjustment.
We present a novel technology characterizing concept drift in terms of the characteristic change of spatial features.
arXiv Detail & Related papers (2023-03-16T14:03:56Z) - Deep Learning Schema-based Event Extraction: Literature Review and
Current Trends [60.29289298349322]
Event extraction technology based on deep learning has become a research hotspot.
This paper fills the gap by reviewing the state-of-the-art approaches, focusing on deep learning-based models.
arXiv Detail & Related papers (2021-07-05T16:32:45Z) - Concept drift detection and adaptation for federated and continual
learning [55.41644538483948]
Smart devices can collect vast amounts of data from their environment.
This data is suitable for training machine learning models, which can significantly improve their behavior.
In this work, we present a new method, called Concept-Drift-Aware Federated Averaging.
arXiv Detail & Related papers (2021-05-27T17:01:58Z) - Automatic Learning to Detect Concept Drift [40.69280758487987]
We propose Meta-ADD, a novel framework that learns to classify concept drift by tracking the changed pattern of error rates.
Specifically, in the training phase, we extract meta-features based on the error rates of various concept drift, after which a meta-detector is developed via prototypical neural network.
In the detection phase, the learned meta-detector is fine-tuned to adapt to the corresponding data stream via stream-based active learning.
arXiv Detail & Related papers (2021-05-04T11:10:39Z) - STUDD: A Student-Teacher Method for Unsupervised Concept Drift Detection [10.326887191803275]
We propose a novel approach to concept drift detection based on a student-teacher learning paradigm.
At run-time, our approach is to use the teacher for predicting new instances and monitoring the mimicking loss of the student for concept drift detection.
In a set of experiments using 19 data streams, we show that the proposed approach can detect concept drift and present a competitive behaviour.
arXiv Detail & Related papers (2021-03-01T10:51:09Z) - Video Summarization Using Deep Neural Networks: A Survey [72.98424352264904]
Video summarization technologies aim to create a concise and complete synopsis by selecting the most informative parts of the video content.
This work focuses on the recent advances in the area and provides a comprehensive survey of the existing deep-learning-based methods for generic video summarization.
arXiv Detail & Related papers (2021-01-15T11:41:29Z) - Counterfactual Explanations of Concept Drift [11.53362411363005]
concept drift refers to the phenomenon that the distribution, which is underlying the observed data, changes over time.
We present a novel technology, which characterizes concept drift in terms of the characteristic change of spatial features represented by typical examples.
arXiv Detail & Related papers (2020-06-23T08:27:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.