abess: A Fast Best Subset Selection Library in Python and R
- URL: http://arxiv.org/abs/2110.09697v1
- Date: Tue, 19 Oct 2021 02:34:55 GMT
- Title: abess: A Fast Best Subset Selection Library in Python and R
- Authors: Jin Zhu, Liyuan Hu, Junhao Huang, Kangkang Jiang, Yanhang Zhang,
Shiyun Lin, Junxian Zhu, Xueqin Wang
- Abstract summary: We introduce a new library named abess that implements a unified framework of best-subset selection.
The abess certifiably gets the optimal solution within times under the linear model.
The core of the library is programmed in C++, and it can be installed from the Python library Index.
- Score: 1.6208003359512848
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We introduce a new library named abess that implements a unified framework of
best-subset selection for solving diverse machine learning problems, e.g.,
linear regression, classification, and principal component analysis.
Particularly, the abess certifiably gets the optimal solution within polynomial
times under the linear model. Our efficient implementation allows abess to
attain the solution of best-subset selection problems as fast as or even 100x
faster than existing competing variable (model) selection toolboxes.
Furthermore, it supports common variants like best group subset selection and
$\ell_2$ regularized best-subset selection. The core of the library is
programmed in C++. For ease of use, a Python library is designed for
conveniently integrating with scikit-learn, and it can be installed from the
Python library Index. In addition, a user-friendly R library is available at
the Comprehensive R Archive Network. The source code is available at:
https://github.com/abess-team/abess.
Related papers
- $\texttt{skwdro}$: a library for Wasserstein distributionally robust machine learning [6.940992962425166]
skwdro is a Python library for training robust machine learning models.
It features both scikit-learn compatible estimators for popular objectives, as well as a wrapper for PyTorch modules.
arXiv Detail & Related papers (2024-10-28T17:16:00Z) - Causal-learn: Causal Discovery in Python [53.17423883919072]
Causal discovery aims at revealing causal relations from observational data.
$textitcausal-learn$ is an open-source Python library for causal discovery.
arXiv Detail & Related papers (2023-07-31T05:00:35Z) - Deepchecks: A Library for Testing and Validating Machine Learning Models
and Data [8.876608553825227]
Deepchecks is a Python library for comprehensively validating machine learning models and data.
Our goal is to provide an easy-to-use library comprising of many checks related to various types of issues.
arXiv Detail & Related papers (2022-03-16T09:37:22Z) - Submodlib: A Submodular Optimization Library [17.596860081700115]
Submodlib is an open-source, easy-to-use, efficient and scalable Python library for submodular optimization.
Submodlib finds its application in summarization, data subset selection, hyper parameter tuning, efficient training and more.
arXiv Detail & Related papers (2022-02-22T05:48:12Z) - MRCpy: A Library for Minimax Risk Classifiers [10.380882297891272]
Python library, MRCpy, implements minimax risk classifiers (MRCs) based on the robust risk minimization (RRM) approach.
MRCpy follows the standards of popular Python libraries, such as scikit-learn, facilitating readability and easy usage together with a seamless integration with other libraries.
arXiv Detail & Related papers (2021-08-04T10:31:20Z) - Solo-learn: A Library of Self-supervised Methods for Visual
Representation Learning [83.02597612195966]
solo-learn is a library of self-supervised methods for visual representation learning.
Implemented in Python, using Pytorch and Pytorch lightning, the library fits both research and industry needs.
arXiv Detail & Related papers (2021-08-03T22:19:55Z) - BOML: A Modularized Bilevel Optimization Library in Python for Meta
Learning [52.90643948602659]
BOML is a modularized optimization library that unifies several meta-learning algorithms into a common bilevel optimization framework.
It provides a hierarchical optimization pipeline together with a variety of iteration modules, which can be used to solve the mainstream categories of meta-learning methods.
arXiv Detail & Related papers (2020-09-28T14:21:55Z) - Picasso: A Sparse Learning Library for High Dimensional Data Analysis in
R and Python [77.33905890197269]
We describe a new library which implements a unified pathwise coordinate optimization for a variety of sparse learning problems.
The library is coded in R++ and has user-friendly sparse experiments.
arXiv Detail & Related papers (2020-06-27T02:39:24Z) - OPFython: A Python-Inspired Optimum-Path Forest Classifier [68.8204255655161]
This paper proposes a Python-based Optimum-Path Forest framework, denoted as OPFython.
As OPFython is a Python-based library, it provides a more friendly environment and a faster prototyping workspace than the C language.
arXiv Detail & Related papers (2020-01-28T15:46:19Z) - Multi-layer Optimizations for End-to-End Data Analytics [71.05611866288196]
We introduce Iterative Functional Aggregate Queries (IFAQ), a framework that realizes an alternative approach.
IFAQ treats the feature extraction query and the learning task as one program given in the IFAQ's domain-specific language.
We show that a Scala implementation of IFAQ can outperform mlpack, Scikit, and specialization by several orders of magnitude for linear regression and regression tree models over several relational datasets.
arXiv Detail & Related papers (2020-01-10T16:14:44Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.