Large-scale Optimization of Partial AUC in a Range of False Positive
Rates
- URL: http://arxiv.org/abs/2203.01505v1
- Date: Thu, 3 Mar 2022 03:46:18 GMT
- Title: Large-scale Optimization of Partial AUC in a Range of False Positive
Rates
- Authors: Yao Yao, Qihang Lin, Tianbao Yang
- Abstract summary: The area under the ROC curve (AUC) is one of the most widely used performance measures for classification models in machine learning.
We develop an efficient approximated gradient descent method based on recent practical envelope smoothing technique.
Our proposed algorithm can also be used to minimize the sum of some ranked range loss, which also lacks efficient solvers.
- Score: 51.12047280149546
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The area under the ROC curve (AUC) is one of the most widely used performance
measures for classification models in machine learning. However, it summarizes
the true positive rates (TPRs) over all false positive rates (FPRs) in the ROC
space, which may include the FPRs with no practical relevance in some
applications. The partial AUC, as a generalization of the AUC, summarizes only
the TPRs over a specific range of the FPRs and is thus a more suitable
performance measure in many real-world situations. Although partial AUC
optimization in a range of FPRs had been studied, existing algorithms are not
scalable to big data and not applicable to deep learning. To address this
challenge, we cast the problem into a non-smooth difference-of-convex (DC)
program for any smooth predictive functions (e.g., deep neural networks), which
allowed us to develop an efficient approximated gradient descent method based
on the Moreau envelope smoothing technique, inspired by recent advances in
non-smooth DC optimization. To increase the efficiency of large data
processing, we used an efficient stochastic block coordinate update in our
algorithm. Our proposed algorithm can also be used to minimize the sum of
ranked range loss, which also lacks efficient solvers. We established a
complexity of $\tilde O(1/\epsilon^6)$ for finding a nearly $\epsilon$-critical
solution. Finally, we numerically demonstrated the effectiveness of our
proposed algorithms for both partial AUC maximization and sum of ranked range
loss minimization.
Related papers
- Adaptive Federated Minimax Optimization with Lower Complexities [82.51223883622552]
We propose an efficient adaptive minimax optimization algorithm (i.e., AdaFGDA) to solve these minimax problems.
It builds our momentum-based reduced and localSGD techniques, and it flexibly incorporate various adaptive learning rates.
arXiv Detail & Related papers (2022-11-14T12:32:18Z) - Asymptotically Unbiased Instance-wise Regularized Partial AUC
Optimization: Theory and Algorithm [101.44676036551537]
One-way Partial AUC (OPAUC) and Two-way Partial AUC (TPAUC) measures the average performance of a binary classifier.
Most of the existing methods could only optimize PAUC approximately, leading to inevitable biases that are not controllable.
We present a simpler reformulation of the PAUC problem via distributional robust optimization AUC.
arXiv Detail & Related papers (2022-10-08T08:26:22Z) - Improved Algorithms for Neural Active Learning [74.89097665112621]
We improve the theoretical and empirical performance of neural-network(NN)-based active learning algorithms for the non-parametric streaming setting.
We introduce two regret metrics by minimizing the population loss that are more suitable in active learning than the one used in state-of-the-art (SOTA) related work.
arXiv Detail & Related papers (2022-10-02T05:03:38Z) - Optimizing ROC Curves with a Sort-Based Surrogate Loss Function for
Binary Classification and Changepoint Detection [1.332560004325655]
We propose a new convex loss function called the AUM, short for Under Min(FP, FN)
We show that our new AUM learning results in improved AUC and comparable relative to previous baselines.
arXiv Detail & Related papers (2021-07-02T21:21:19Z) - Stochastic Hard Thresholding Algorithms for AUC Maximization [49.00683387735522]
We develop a hard thresholding algorithm for AUC in distributiond classification.
We conduct experiments to show the efficiency and effectiveness of the proposed algorithms.
arXiv Detail & Related papers (2020-11-04T16:49:29Z) - Adaptive Sampling for Best Policy Identification in Markov Decision
Processes [79.4957965474334]
We investigate the problem of best-policy identification in discounted Markov Decision (MDPs) when the learner has access to a generative model.
The advantages of state-of-the-art algorithms are discussed and illustrated.
arXiv Detail & Related papers (2020-09-28T15:22:24Z) - Online AUC Optimization for Sparse High-Dimensional Datasets [32.77252579365118]
Area Under the ROC Curve (AUC) is a widely used performance measure for imbalanced classification.
Current online AUC optimization algorithms have high per-iteration cost $mathcalO(d)$.
We propose a new algorithm, textscFTRL-AUC, which can process data in an online fashion with a much cheaper per-iteration cost.
arXiv Detail & Related papers (2020-09-23T00:50:01Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.