A Systematic Literature Review on Multi-label Data Stream Classification
- URL: http://arxiv.org/abs/2508.17455v1
- Date: Sun, 24 Aug 2025 17:17:15 GMT
- Title: A Systematic Literature Review on Multi-label Data Stream Classification
- Authors: H. Freire-Oliveira, E. R. F. Paiva, J. Gama, L. Khan, R. Cerri,
- Abstract summary: This systematic literature review presents an in-depth analysis of multi-label data stream classification proposals.<n>We characterize the latest methods in the literature, providing a comprehensive overview, building a thorough hierarchy, and discussing how the approach approaches each problem.<n>We identify the main gaps and offer recommendations for future research directions in the field.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Classification in the context of multi-label data streams represents a challenge that has attracted significant attention due to its high real-world applicability. However, this task faces problems inherent to dynamic environments, such as the continuous arrival of data at high speed and volume, changes in the data distribution (concept drift), the emergence of new labels (concept evolution), and the latency in the arrival of ground truth labels. This systematic literature review presents an in-depth analysis of multi-label data stream classification proposals. We characterize the latest methods in the literature, providing a comprehensive overview, building a thorough hierarchy, and discussing how the proposals approach each problem. Furthermore, we discuss the adopted evaluation strategies and analyze the methods' asymptotic complexity and resource consumption. Finally, we identify the main gaps and offer recommendations for future research directions in the field.
Related papers
- Adapting Vision-Language Models Without Labels: A Comprehensive Survey [74.17944178027015]
Vision-Language Models (VLMs) have demonstrated remarkable generalization capabilities across a wide range of tasks.<n>Recent research has increasingly focused on unsupervised adaptation methods that do not rely on labeled data.<n>We propose a taxonomy based on the availability and nature of unlabeled visual data, categorizing existing approaches into four key paradigms.
arXiv Detail & Related papers (2025-08-07T16:27:37Z) - A Comprehensive Review on Hashtag Recommendation: From Traditional to Deep Learning and Beyond [0.37865171120254354]
Hashtags, as a fundamental categorization mechanism, play a pivotal role in enhancing content visibility and user engagement.<n>The development of accurate and robust hashtag recommendation systems remains a complex and evolving research challenge.<n>This review article conducts a systematic analysis of hashtag recommendation systems, examining recent advancements across several dimensions.
arXiv Detail & Related papers (2025-03-24T13:40:36Z) - Online Model-based Anomaly Detection in Multivariate Time Series: Taxonomy, Survey, Research Challenges and Future Directions [0.017476232824732776]
Time-series anomaly detection plays an important role in engineering processes.
This survey introduces a novel taxonomy where a distinction between online and offline, and training and inference is made.
It presents the most popular data sets and evaluation metrics used in the literature, as well as a detailed analysis.
arXiv Detail & Related papers (2024-08-07T13:01:10Z) - Embedding in Recommender Systems: A Survey [54.55152033023537]
This survey presents a comprehensive analysis of advances in recommender system embedding techniques.<n>In matrix-based scenarios, collaborative filtering generates embeddings that effectively model user-item preferences.<n>We introduce emerging approaches, including AutoML, hashing techniques, and quantization methods, to enhance performance.
arXiv Detail & Related papers (2023-10-28T06:31:06Z) - Tackling Diverse Minorities in Imbalanced Classification [80.78227787608714]
Imbalanced datasets are commonly observed in various real-world applications, presenting significant challenges in training classifiers.
We propose generating synthetic samples iteratively by mixing data samples from both minority and majority classes.
We demonstrate the effectiveness of our proposed framework through extensive experiments conducted on seven publicly available benchmark datasets.
arXiv Detail & Related papers (2023-08-28T18:48:34Z) - Annotating Ambiguous Images: General Annotation Strategy for High-Quality Data with Real-World Biomedical Validation [2.5817162127446154]
We introduce a novel strategy for generating high-quality labels in challenging datasets.
We validate our methodology through a rigorous real-world test case in the biomedical field.
arXiv Detail & Related papers (2023-06-21T11:35:37Z) - Variational Distillation for Multi-View Learning [104.17551354374821]
We design several variational information bottlenecks to exploit two key characteristics for multi-view representation learning.
Under rigorously theoretical guarantee, our approach enables IB to grasp the intrinsic correlation between observations and semantic labels.
arXiv Detail & Related papers (2022-06-20T03:09:46Z) - Deep Learning Schema-based Event Extraction: Literature Review and
Current Trends [60.29289298349322]
Event extraction technology based on deep learning has become a research hotspot.
This paper fills the gap by reviewing the state-of-the-art approaches, focusing on deep learning-based models.
arXiv Detail & Related papers (2021-07-05T16:32:45Z) - Through the Data Management Lens: Experimental Analysis and Evaluation
of Fair Classification [75.49600684537117]
Data management research is showing an increasing presence and interest in topics related to data and algorithmic fairness.
We contribute a broad analysis of 13 fair classification approaches and additional variants, over their correctness, fairness, efficiency, scalability, and stability.
Our analysis highlights novel insights on the impact of different metrics and high-level approach characteristics on different aspects of performance.
arXiv Detail & Related papers (2021-01-18T22:55:40Z) - Unsupervised Domain Adaptation in Semantic Segmentation: a Review [22.366638308792734]
The aim of this paper is to give an overview of the recent advancements in the Unsupervised Domain Adaptation (UDA) of deep networks for semantic segmentation.
arXiv Detail & Related papers (2020-05-21T20:10:38Z) - Challenges in Benchmarking Stream Learning Algorithms with Real-world
Data [2.861782696432711]
Streaming data are increasingly present in real-world applications such as sensor measurements, satellite data feed, stock market, and financial data.
The data stream mining community still faces some primary challenges and difficulties related to the comparison and evaluation of new proposals.
We propose a new public data repository for benchmarking stream algorithms with real-world data.
arXiv Detail & Related papers (2020-04-30T21:31:34Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.