Adaptive Deep Forest for Online Learning from Drifting Data Streams
        - URL: http://arxiv.org/abs/2010.07340v1
- Date: Wed, 14 Oct 2020 18:24:17 GMT
- Title: Adaptive Deep Forest for Online Learning from Drifting Data Streams
- Authors: {\L}ukasz Korycki, Bartosz Krawczyk
- Abstract summary: Learning from data streams is among the most vital fields of contemporary data mining.
We propose Adaptive Deep Forest (ADF) - a natural combination of the successful tree-based streaming classifiers with deep forest.
The conducted experiments show that the deep forest approach can be effectively transformed into an online algorithm.
- Score: 15.49323098362628
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract:   Learning from data streams is among the most vital fields of contemporary
data mining. The online analysis of information coming from those potentially
unbounded data sources allows for designing reactive up-to-date models capable
of adjusting themselves to continuous flows of data. While a plethora of
shallow methods have been proposed for simpler low-dimensional streaming
problems, almost none of them addressed the issue of learning from complex
contextual data, such as images or texts. The former is represented mainly by
adaptive decision trees that have been proven to be very efficient in streaming
scenarios. The latter has been predominantly addressed by offline deep
learning. In this work, we attempt to bridge the gap between these two worlds
and propose Adaptive Deep Forest (ADF) - a natural combination of the
successful tree-based streaming classifiers with deep forest, which represents
an interesting alternative idea for learning from contextual data. The
conducted experiments show that the deep forest approach can be effectively
transformed into an online algorithm, forming a model that outperforms all
state-of-the-art shallow adaptive classifiers, especially for high-dimensional
complex streams.
 
      
        Related papers
        - SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning   Trajectory Synthesis [89.99161034065614]
 Retrieval-augmented generation (RAG) systems have advanced large language models (LLMs) in complex deep search scenarios.<n>Existing approaches face critical limitations that lack high-quality training trajectories and suffer from distributional mismatches.<n>This paper introduces SimpleDeepSearcher, a framework that bridges the gap through strategic data engineering rather than complex training paradigms.
 arXiv  Detail & Related papers  (2025-05-22T16:05:02Z)
- In-context learning of evolving data streams with tabular foundational   models [42.13420474990124]
 This work bridges advancements from both areas, highlighting how transformers' implicit meta-learning abilities, pre-training on drifting natural data, and reliance on context optimization directly address the core challenges of adaptive learning in dynamic environments.
 Exploring real-time model adaptation, this research demonstrates that TabPFN, coupled with a simple sliding memory strategy, consistently outperforms ensembles of Hoeffding trees across all non-stationary benchmarks.
 arXiv  Detail & Related papers  (2025-02-24T04:52:35Z)
- ALP: Action-Aware Embodied Learning for Perception [60.64801970249279]
 We introduce Action-Aware Embodied Learning for Perception (ALP)
ALP incorporates action information into representation learning through a combination of optimizing a reinforcement learning policy and an inverse dynamics prediction objective.
We show that ALP outperforms existing baselines in several downstream perception tasks.
 arXiv  Detail & Related papers  (2023-06-16T21:51:04Z)
- Deep networks for system identification: a Survey [56.34005280792013]
 System identification learns mathematical descriptions of dynamic systems from input-output data.
Main aim of the identified model is to predict new data from previous observations.
We discuss architectures commonly adopted in the literature, like feedforward, convolutional, and recurrent networks.
 arXiv  Detail & Related papers  (2023-01-30T12:38:31Z)
- On the challenges to learn from Natural Data Streams [6.602973237811197]
 In real-world contexts, sometimes data are available in form of Natural Data Streams.
This data organization represents an interesting and challenging scenario for both traditional Machine and Deep Learning algorithms.
In this paper, we investigate the classification performance of a variety of algorithms that receive as training input Natural Data Streams.
 arXiv  Detail & Related papers  (2023-01-09T16:32:02Z)
- Segmentation-guided Domain Adaptation for Efficient Depth Completion [3.441021278275805]
 We propose an efficient depth completion model based on a vgg05-like CNN architecture and a semi-supervised domain adaptation approach.
In order to boost spatial coherence, we guide the learning process using segmentations as additional source of information.
Our approach improves on previous efficient and low parameter state of the art approaches while having a noticeably lower computational footprint.
 arXiv  Detail & Related papers  (2022-10-14T13:01:25Z)
- GIPSO: Geometrically Informed Propagation for Online Adaptation in 3D
  LiDAR Segmentation [60.07812405063708]
 3D point cloud semantic segmentation is fundamental for autonomous driving.
Most approaches in the literature neglect an important aspect, i.e., how to deal with domain shift when handling dynamic scenes.
This paper advances the state of the art in this research field.
 arXiv  Detail & Related papers  (2022-07-20T09:06:07Z)
- Bilevel Online Deep Learning in Non-stationary Environment [4.565872584112864]
 Bilevel Online Deep Learning (BODL) framework combines bilevel optimization strategy and online ensemble classifier.
When the concept drift is detected, our BODL algorithm can adaptively update the model parameters via bilevel optimization and then circumvent the large drift and encourage positive transfer.
 arXiv  Detail & Related papers  (2022-01-25T11:05:51Z)
- How Well Do Sparse Imagenet Models Transfer? [75.98123173154605]
 Transfer learning is a classic paradigm by which models pretrained on large "upstream" datasets are adapted to yield good results on "downstream" datasets.
In this work, we perform an in-depth investigation of this phenomenon in the context of convolutional neural networks (CNNs) trained on the ImageNet dataset.
We show that sparse models can match or even outperform the transfer performance of dense models, even at high sparsities.
 arXiv  Detail & Related papers  (2021-11-26T11:58:51Z)
- Gone Fishing: Neural Active Learning with Fisher Embeddings [55.08537975896764]
 There is an increasing need for active learning algorithms that are compatible with deep neural networks.
This article introduces BAIT, a practical representation of tractable, and high-performing active learning algorithm for neural networks.
 arXiv  Detail & Related papers  (2021-06-17T17:26:31Z)
- Online Feature Screening for Data Streams with Concept Drift [8.807587076209566]
 This research study focuses on classification datasets.
Our experiments show proposed methods can generate the same feature importance as their offline version with faster speed and less storage consumption.
The results show that online screening methods with integrated model adaptation have a higher true feature detection rate than without model adaptation on data streams with the concept drift property.
 arXiv  Detail & Related papers  (2021-04-07T03:16:15Z)
- Relation-Guided Representation Learning [53.60351496449232]
 We propose a new representation learning method that explicitly models and leverages sample relations.
Our framework well preserves the relations between samples.
By seeking to embed samples into subspace, we show that our method can address the large-scale and out-of-sample problem.
 arXiv  Detail & Related papers  (2020-07-11T10:57:45Z)
- Streaming Active Deep Forest for Evolving Data Stream Classification [9.273077240506016]
 Streaming Deep Forest (SDF) is a high-performance deep ensemble method specially adapted to stream classification.
We also present the Augmented Variable Uncertainty (AVU) active learning strategy to reduce the labeling cost in the streaming context.
 arXiv  Detail & Related papers  (2020-02-26T22:00:39Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.