Enriched Functional Tree-Based Classifiers: A Novel Approach Leveraging
Derivatives and Geometric Features
- URL: http://arxiv.org/abs/2409.17804v1
- Date: Thu, 26 Sep 2024 12:57:47 GMT
- Title: Enriched Functional Tree-Based Classifiers: A Novel Approach Leveraging
Derivatives and Geometric Features
- Authors: Fabrizio Maturo, Annamaria Porreca
- Abstract summary: This study introduces an advanced methodology for supervised classification by integrating Functional Data Analysis (FDA) with tree-based ensemble techniques for classifying high-dimensional time series.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The positioning of this research falls within the scalar-on-function
classification literature, a field of significant interest across various
domains, particularly in statistics, mathematics, and computer science. This
study introduces an advanced methodology for supervised classification by
integrating Functional Data Analysis (FDA) with tree-based ensemble techniques
for classifying high-dimensional time series. The proposed framework, Enriched
Functional Tree-Based Classifiers (EFTCs), leverages derivative and geometric
features, benefiting from the diversity inherent in ensemble methods to further
enhance predictive performance and reduce variance. While our approach has been
tested on the enrichment of Functional Classification Trees (FCTs), Functional
K-NN (FKNN), Functional Random Forest (FRF), Functional XGBoost (FXGB), and
Functional LightGBM (FLGBM), it could be extended to other tree-based and
non-tree-based classifiers, with appropriate considerations emerging from this
investigation. Through extensive experimental evaluations on seven real-world
datasets and six simulated scenarios, this proposal demonstrates fascinating
improvements over traditional approaches, providing new insights into the
application of FDA in complex, high-dimensional learning problems.
Related papers
- Enriching Tabular Data with Contextual LLM Embeddings: A Comprehensive Ablation Study for Ensemble Classifiers [20.46918103875102]
This study presents a systematic approach to enrich datasets with features derived from large language model embeddings.
We assess the impact of RoBERTa and GPT-2 embeddings on ensemble classifiers, including Random Forest, XGBoost, and CatBoost.
Results indicate that integrating embeddings with traditional numerical and categorical features often enhances predictive performance.
arXiv Detail & Related papers (2024-11-03T17:45:00Z) - Ranking Perspective for Tree-based Methods with Applications to Symbolic Feature Selection [3.2964064859807496]
Tree-based methods are powerful nonparametric techniques in statistics and machine learning.
Recent applications have revealed their surprising ability to distinguish transformations that remain obscure under current theoretical understanding.
This work provides a finite-sample analysis of tree-based methods from a ranking perspective.
arXiv Detail & Related papers (2024-10-03T16:03:39Z) - Randomized Spline Trees for Functional Data Classification: Theory and Application to Environmental Time Series [0.0]
This paper introduces Randomized Spline Trees (RST), a novel algorithm that bridges the two approaches by incorporating randomized functional representations into the Random Forest framework.
RST generates diverse functional representations of input data using randomized B-spline parameters, creating an ensemble of decision trees trained on these varied representations.
Results show that RST variants outperform standard Random Forests and Gradient Boosting on most datasets, improving classification accuracy by up to 14%.
arXiv Detail & Related papers (2024-09-12T09:38:16Z) - Augmented Functional Random Forests: Classifier Construction and Unbiased Functional Principal Components Importance through Ad-Hoc Conditional Permutations [0.0]
This paper introduces a novel supervised classification strategy that integrates functional data analysis with tree-based methods.
We propose augmented versions of functional classification trees and functional random forests, incorporating a new tool for assessing the importance of functional principal components.
arXiv Detail & Related papers (2024-08-23T15:58:41Z) - Generating Hierarchical Structures for Improved Time Series
Classification Using Stochastic Splitting Functions [0.0]
This study introduces a novel hierarchical divisive clustering approach with splitting functions (SSFs) to enhance classification performance in multi-class datasets through hierarchical classification (HC)
The method has the unique capability of generating hierarchy without requiring explicit information, making it suitable for datasets lacking prior knowledge of hierarchy.
arXiv Detail & Related papers (2023-09-21T10:34:50Z) - Hierarchical clustering with dot products recovers hidden tree structure [53.68551192799585]
In this paper we offer a new perspective on the well established agglomerative clustering algorithm, focusing on recovery of hierarchical structure.
We recommend a simple variant of the standard algorithm, in which clusters are merged by maximum average dot product and not, for example, by minimum distance or within-cluster variance.
We demonstrate that the tree output by this algorithm provides a bona fide estimate of generative hierarchical structure in data, under a generic probabilistic graphical model.
arXiv Detail & Related papers (2023-05-24T11:05:12Z) - Offline Reinforcement Learning with Differentiable Function
Approximation is Provably Efficient [65.08966446962845]
offline reinforcement learning, which aims at optimizing decision-making strategies with historical data, has been extensively applied in real-life applications.
We take a step by considering offline reinforcement learning with differentiable function class approximation (DFA)
Most importantly, we show offline differentiable function approximation is provably efficient by analyzing the pessimistic fitted Q-learning algorithm.
arXiv Detail & Related papers (2022-10-03T07:59:42Z) - Improved Algorithms for Neural Active Learning [74.89097665112621]
We improve the theoretical and empirical performance of neural-network(NN)-based active learning algorithms for the non-parametric streaming setting.
We introduce two regret metrics by minimizing the population loss that are more suitable in active learning than the one used in state-of-the-art (SOTA) related work.
arXiv Detail & Related papers (2022-10-02T05:03:38Z) - Adversarial Feature Augmentation and Normalization for Visual
Recognition [109.6834687220478]
Recent advances in computer vision take advantage of adversarial data augmentation to ameliorate the generalization ability of classification models.
Here, we present an effective and efficient alternative that advocates adversarial augmentation on intermediate feature embeddings.
We validate the proposed approach across diverse visual recognition tasks with representative backbone networks.
arXiv Detail & Related papers (2021-03-22T20:36:34Z) - Structure-Enhanced Meta-Learning For Few-Shot Graph Classification [53.54066611743269]
This work explores the potential of metric-based meta-learning for solving few-shot graph classification.
An implementation upon GIN, named SMFGIN, is tested on two datasets, Chembl and TRIANGLES.
arXiv Detail & Related papers (2021-03-05T09:03:03Z) - MurTree: Optimal Classification Trees via Dynamic Programming and Search [61.817059565926336]
We present a novel algorithm for learning optimal classification trees based on dynamic programming and search.
Our approach uses only a fraction of the time required by the state-of-the-art and can handle datasets with tens of thousands of instances.
arXiv Detail & Related papers (2020-07-24T17:06:55Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.