Speeding Up OPFython with Numba
- URL: http://arxiv.org/abs/2106.11828v1
- Date: Tue, 22 Jun 2021 14:39:32 GMT
- Title: Speeding Up OPFython with Numba
- Authors: Gustavo H. de Rosa, Jo\~ao Paulo Papa
- Abstract summary: Optimum-Path Forest (OPF) has proven to be a state-of-the-art algorithm comparable to Logistic Regressors, Support Vector Machines.
Recently, its Python-based version, denoted as OPFython, has been proposed to provide a more friendly framework and a faster prototyping environment.
This paper proposes a simple yet highly efficient speed up using the Numba package, which accelerates Numpy-based calculations and attempts to increase the algorithm's overall performance.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: A graph-inspired classifier, known as Optimum-Path Forest (OPF), has proven
to be a state-of-the-art algorithm comparable to Logistic Regressors, Support
Vector Machines in a wide variety of tasks. Recently, its Python-based version,
denoted as OPFython, has been proposed to provide a more friendly framework and
a faster prototyping environment. Nevertheless, Python-based algorithms are
slower than their counterpart C-based algorithms, impacting their performance
when confronted with large amounts of data. Therefore, this paper proposed a
simple yet highly efficient speed up using the Numba package, which accelerates
Numpy-based calculations and attempts to increase the algorithm's overall
performance. Experimental results showed that the proposed approach achieved
better results than the na\"ive Python-based OPF and speeded up its distance
measurement calculation.
Related papers
- Faster WIND: Accelerating Iterative Best-of-$N$ Distillation for LLM Alignment [81.84950252537618]
This paper reveals a unified game-theoretic connection between iterative BOND and self-play alignment.
We establish a novel framework, WIN rate Dominance (WIND), with a series of efficient algorithms for regularized win rate dominance optimization.
arXiv Detail & Related papers (2024-10-28T04:47:39Z) - Sketch 'n Solve: An Efficient Python Package for Large-Scale Least Squares Using Randomized Numerical Linear Algebra [0.0]
We present Sketch 'n Solve, an open-source Python package that implements efficient randomized numerical linear algebra techniques.
We show that our implementation achieves up to 50x speedup over traditional LSQR while maintaining high accuracy, even for ill-conditioned matrices.
The package shows particular promise for applications in machine learning optimization, signal processing, and scientific computing.
arXiv Detail & Related papers (2024-09-22T04:29:51Z) - Poisson Process for Bayesian Optimization [126.51200593377739]
We propose a ranking-based surrogate model based on the Poisson process and introduce an efficient BO framework, namely Poisson Process Bayesian Optimization (PoPBO)
Compared to the classic GP-BO method, our PoPBO has lower costs and better robustness to noise, which is verified by abundant experiments.
arXiv Detail & Related papers (2024-02-05T02:54:50Z) - High Performance Computing Applied to Logistic Regression: A CPU and GPU
Implementation Comparison [0.0]
We present a versatile GPU-based parallel version of Logistic Regression (LR)
Our implementation is a direct translation of the parallel Gradient Descent Logistic Regression algorithm proposed by X. Zou et al.
Our method is particularly advantageous for real-time prediction applications like image recognition, spam detection, and fraud detection.
arXiv Detail & Related papers (2023-08-19T14:49:37Z) - PyBADS: Fast and robust black-box optimization in Python [11.4219428942199]
PyBADS is an implementation of the Adaptive Direct Search (BADS) algorithm for fast and robust black-box optimization.
It comes along with an easy-to-use Python interface for running the algorithm for running the results.
arXiv Detail & Related papers (2023-06-27T15:54:44Z) - Large-scale Optimization of Partial AUC in a Range of False Positive
Rates [51.12047280149546]
The area under the ROC curve (AUC) is one of the most widely used performance measures for classification models in machine learning.
We develop an efficient approximated gradient descent method based on recent practical envelope smoothing technique.
Our proposed algorithm can also be used to minimize the sum of some ranked range loss, which also lacks efficient solvers.
arXiv Detail & Related papers (2022-03-03T03:46:18Z) - Accelerated Stochastic Gradient for Nonnegative Tensor Completion and
Parallel Implementation [0.3670422696827525]
We adopt the alternating optimization framework and solve each nonnegative matrix completion problem via a variation of the gradient accelerated algorithm.
We develop a shared-memory implementation of our algorithm using the multi-threaded API OpenMP, which attains significant speedup.
arXiv Detail & Related papers (2021-09-20T13:32:12Z) - Towards Optimally Efficient Tree Search with Deep Learning [76.64632985696237]
This paper investigates the classical integer least-squares problem which estimates signals integer from linear models.
The problem is NP-hard and often arises in diverse applications such as signal processing, bioinformatics, communications and machine learning.
We propose a general hyper-accelerated tree search (HATS) algorithm by employing a deep neural network to estimate the optimal estimation for the underlying simplified memory-bounded A* algorithm.
arXiv Detail & Related papers (2021-01-07T08:00:02Z) - Efficient Computation of Expectations under Spanning Tree Distributions [67.71280539312536]
We propose unified algorithms for the important cases of first-order expectations and second-order expectations in edge-factored, non-projective spanning-tree models.
Our algorithms exploit a fundamental connection between gradients and expectations, which allows us to derive efficient algorithms.
arXiv Detail & Related papers (2020-08-29T14:58:26Z) - OPFython: A Python-Inspired Optimum-Path Forest Classifier [68.8204255655161]
This paper proposes a Python-based Optimum-Path Forest framework, denoted as OPFython.
As OPFython is a Python-based library, it provides a more friendly environment and a faster prototyping workspace than the C language.
arXiv Detail & Related papers (2020-01-28T15:46:19Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.