Package for Fast ABC-Boost
- URL: http://arxiv.org/abs/2207.08770v1
- Date: Mon, 18 Jul 2022 17:22:32 GMT
- Title: Package for Fast ABC-Boost
- Authors: Ping Li and Weijie Zhao
- Abstract summary: This report presents the open-source package which implements the series of our boosting works in the past years.
The histogram-based (feature-binning) approach makes the tree implementation convenient and efficient.
The explicit gain formula in Li (20010) for tree splitting based on second-order derivatives of the loss function typically improves.
- Score: 21.607059258448594
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This report presents the open-source package which implements the series of
our boosting works in the past years. In particular, the package includes
mainly three lines of techniques, among which the following two are already the
standard implementations in popular boosted tree platforms:
(i) The histogram-based (feature-binning) approach makes the tree
implementation convenient and efficient. In Li et al (2007), a simple
fixed-length adaptive binning algorithm was developed. In this report, we
demonstrate that such a simple algorithm is still surprisingly effective
compared to more sophisticated variants in popular tree platforms.
(ii) The explicit gain formula in Li (20010) for tree splitting based on
second-order derivatives of the loss function typically improves, often
considerably, over the first-order methods. Although the gain formula in Li
(2010) was derived for logistic regression loss, it is a generic formula for
loss functions with second-derivatives. For example, the open-source package
also includes $L_p$ regression for $p\geq 1$.
The main contribution of this package is the ABC-Boost (adaptive base class
boosting) for multi-class classification. The initial work in Li (2008) derived
a new set of derivatives of the classical multi-class logistic regression by
specifying a "base class". The accuracy can be substantially improved if the
base class is chosen properly. The major technical challenge is to design a
search strategy to select the base class. The prior published works implemented
an exhaustive search procedure to find the base class which is computationally
too expensive. Recently, a new report (Li and Zhao, 20022) presents a unified
framework of "Fast ABC-Boost" which allows users to efficiently choose the
proper search space for the base class.
The package provides interfaces for linux, windows, mac, matlab, R, python.
Related papers
- Achieving More with Less: A Tensor-Optimization-Powered Ensemble Method [53.170053108447455]
Ensemble learning is a method that leverages weak learners to produce a strong learner.
We design a smooth and convex objective function that leverages the concept of margin, making the strong learner more discriminative.
We then compare our algorithm with random forests of ten times the size and other classical methods across numerous datasets.
arXiv Detail & Related papers (2024-08-06T03:42:38Z) - How to Boost Any Loss Function [63.573324901948716]
We show that any loss function can be optimized with boosting.
We also show that boosting can achieve a feat not yet known to be possible in the classical $0th$ order setting.
arXiv Detail & Related papers (2024-07-02T14:08:23Z) - Robust Capped lp-Norm Support Vector Ordinal Regression [85.84718111830752]
Ordinal regression is a specialized supervised problem where the labels show an inherent order.
Support Vector Ordinal Regression, as an outstanding ordinal regression model, is widely used in many ordinal regression tasks.
We introduce a new model, Capped $ell_p$-Norm Support Vector Ordinal Regression(CSVOR), that is robust to outliers.
arXiv Detail & Related papers (2024-04-25T13:56:05Z) - Multiclass Boosting: Simple and Intuitive Weak Learning Criteria [72.71096438538254]
We give a simple and efficient boosting algorithm, that does not require realizability assumptions.
We present a new result on boosting for list learners, as well as provide a novel proof for the characterization of multiclass PAC learning.
arXiv Detail & Related papers (2023-07-02T19:26:58Z) - pGMM Kernel Regression and Comparisons with Boosted Trees [21.607059258448594]
In this work, we demonstrate the advantage of the pGMM kernel in the context of (ridge) regression.
Perhaps surprisingly, even without a tuning parameter (i.e., $p=1$ for the power parameter of the pGMM kernel), the pGMM kernel already performs well.
Perhaps also surprisingly, the best performance (in terms of $L$ regression loss) is often attained at $p>2$, in some cases at $pgggg 2$.
arXiv Detail & Related papers (2022-07-18T15:06:30Z) - Human-in-the-loop: Provably Efficient Preference-based Reinforcement
Learning with General Function Approximation [107.54516740713969]
We study human-in-the-loop reinforcement learning (RL) with trajectory preferences.
Instead of receiving a numeric reward at each step, the agent only receives preferences over trajectory pairs from a human overseer.
We propose the first optimistic model-based algorithm for PbRL with general function approximation.
arXiv Detail & Related papers (2022-05-23T09:03:24Z) - Fast ABC-Boost: A Unified Framework for Selecting the Base Class in
Multi-Class Classification [21.607059258448594]
We develop a unified framework for effectively selecting the base class by introducing a series of ideas to improve the computational efficiency of ABC-Boost.
Our framework has parameters $(s,g,w)$.
arXiv Detail & Related papers (2022-05-22T20:42:26Z) - Fast Few-Shot Classification by Few-Iteration Meta-Learning [173.32497326674775]
We introduce a fast optimization-based meta-learning method for few-shot classification.
Our strategy enables important aspects of the base learner objective to be learned during meta-training.
We perform a comprehensive experimental analysis, demonstrating the speed and effectiveness of our approach.
arXiv Detail & Related papers (2020-10-01T15:59:31Z) - Fully-Corrective Gradient Boosting with Squared Hinge: Fast Learning
Rates and Early Stopping [29.485528641599018]
We propose an efficient boosting method with theoretical generalization guarantees for binary classification.
We derive a fast learning rate of the order $cal O((m/log m)-1/4)$ for the proposed boosting method.
Both derived learning rates are the best ones among the existing generalization results of boosting-type methods for classification.
arXiv Detail & Related papers (2020-04-01T00:39:24Z) - BoostTree and BoostForest for Ensemble Learning [27.911350375268576]
BoostForest is an ensemble learning approach using BoostTree as base learners and can be used for both classification and regression.
It generally outperformed four classical ensemble learning approaches (Random Forest, Extra-Trees, XGBoost and LightGBM) on 35 classification and regression datasets.
arXiv Detail & Related papers (2020-03-21T19:52:13Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.