From Concept Drift to Model Degradation: An Overview on
Performance-Aware Drift Detectors
- URL: http://arxiv.org/abs/2203.11070v1
- Date: Mon, 21 Mar 2022 15:48:13 GMT
- Title: From Concept Drift to Model Degradation: An Overview on
Performance-Aware Drift Detectors
- Authors: Firas Bayram, Bestoun S. Ahmed, Andreas Kassler
- Abstract summary: Changes in the system on which a predictive machine learning model has been trained may lead to performance degradation during the system's life cycle.
Different terms have been used in the literature to refer to the same type of concept drift and the same term for various types.
This lack of unified terminology is set out to create confusion on distinguishing between different concept drift variants.
- Score: 1.757501664210825
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The dynamicity of real-world systems poses a significant challenge to
deployed predictive machine learning (ML) models. Changes in the system on
which the ML model has been trained may lead to performance degradation during
the system's life cycle. Recent advances that study non-stationary environments
have mainly focused on identifying and addressing such changes caused by a
phenomenon called concept drift. Different terms have been used in the
literature to refer to the same type of concept drift and the same term for
various types. This lack of unified terminology is set out to create confusion
on distinguishing between different concept drift variants. In this paper, we
start by grouping concept drift types by their mathematical definitions and
survey the different terms used in the literature to build a consolidated
taxonomy of the field. We also review and classify performance-based concept
drift detection methods proposed in the last decade. These methods utilize the
predictive model's performance degradation to signal substantial changes in the
systems. The classification is outlined in a hierarchical diagram to provide an
orderly navigation between the methods. We present a comprehensive analysis of
the main attributes and strategies for tracking and evaluating the model's
performance in the predictive system. The paper concludes by discussing open
research challenges and possible research directions.
Related papers
- Out-of-Distribution Detection on Graphs: A Survey [58.47395497985277]
Graph out-of-distribution (GOOD) detection focuses on identifying graph data that deviates from the distribution seen during training.
We categorize existing methods into four types: enhancement-based, reconstruction-based, information propagation-based, and classification-based approaches.
We discuss practical applications and theoretical foundations, highlighting the unique challenges posed by graph data.
arXiv Detail & Related papers (2025-02-12T04:07:12Z) - Analyzing Fine-tuning Representation Shift for Multimodal LLMs Steering alignment [53.90425382758605]
We show how fine-tuning alters the internal structure of a model to specialize in new multimodal tasks.
Our work sheds light on how multimodal representations evolve through fine-tuning and offers a new perspective for interpreting model adaptation in multimodal tasks.
arXiv Detail & Related papers (2025-01-06T13:37:13Z) - Sparse autoencoders reveal selective remapping of visual concepts during adaptation [54.82630842681845]
Adapting foundation models for specific purposes has become a standard approach to build machine learning systems.
We develop a new Sparse Autoencoder (SAE) for the CLIP vision transformer, named PatchSAE, to extract interpretable concepts.
arXiv Detail & Related papers (2024-12-06T18:59:51Z) - A Neighbor-Searching Discrepancy-based Drift Detection Scheme for Learning Evolving Data [40.00357483768265]
This work presents a novel real concept drift detection method based on Neighbor-Searching Discrepancy.
The proposed method is able to detect real concept drift with high accuracy while ignoring virtual drift.
It can also indicate the direction of the classification boundary change by identifying the invasion or retreat of a certain class.
arXiv Detail & Related papers (2024-05-23T04:03:36Z) - Methods for Generating Drift in Text Streams [49.3179290313959]
Concept drift is a frequent phenomenon in real-world datasets and corresponds to changes in data distribution over time.
This paper provides four textual drift generation methods to ease the production of datasets with labeled drifts.
Results show that all methods have their performance degraded right after the drifts, and the incremental SVM is the fastest to run and recover the previous performance levels.
arXiv Detail & Related papers (2024-03-18T23:48:33Z) - A comprehensive analysis of concept drift locality in data streams [3.5897534810405403]
Concept drift must be detected for effective model adaptation to evolving data properties.
We present a novel categorization of concept drift based on its locality and scale.
We conduct a comparative assessment of 9 state-of-the-art drift detectors across diverse difficulties.
arXiv Detail & Related papers (2023-11-10T20:57:43Z) - Unsupervised Unlearning of Concept Drift with Autoencoders [5.41354952642957]
Concept drift refers to a change in the data distribution affecting the data stream of future samples.
This paper proposes an unsupervised and model-agnostic concept drift adaptation method at the global level.
arXiv Detail & Related papers (2022-11-23T14:52:49Z) - Towards Robust and Adaptive Motion Forecasting: A Causal Representation
Perspective [72.55093886515824]
We introduce a causal formalism of motion forecasting, which casts the problem as a dynamic process with three groups of latent variables.
We devise a modular architecture that factorizes the representations of invariant mechanisms and style confounders to approximate a causal graph.
Experiment results on synthetic and real datasets show that our three proposed components significantly improve the robustness and reusability of the learned motion representations.
arXiv Detail & Related papers (2021-11-29T18:59:09Z) - Beyond Trivial Counterfactual Explanations with Diverse Valuable
Explanations [64.85696493596821]
In computer vision applications, generative counterfactual methods indicate how to perturb a model's input to change its prediction.
We propose a counterfactual method that learns a perturbation in a disentangled latent space that is constrained using a diversity-enforcing loss.
Our model improves the success rate of producing high-quality valuable explanations when compared to previous state-of-the-art methods.
arXiv Detail & Related papers (2021-03-18T12:57:34Z) - Drift Estimation with Graphical Models [0.0]
We make use of graphical models to elicit the visible structure of the data and we infer from there changes in the hidden context.
The paper evaluate the method with real world data on the Australian Electric market.
arXiv Detail & Related papers (2021-02-02T12:24:34Z) - Switching Scheme: A Novel Approach for Handling Incremental Concept
Drift in Real-World Data Sets [0.0]
Concept drifts can severely affect the prediction performance of a machine learning system.
In this work, we analyze the effects of concept drifts in the context of a real-world data set.
We introduce the switching scheme which combines the two principles of retraining and updating of a machine learning model.
arXiv Detail & Related papers (2020-11-05T10:16:54Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.