Global Optimization with Parametric Function Approximation
- URL: http://arxiv.org/abs/2211.09100v3
- Date: Wed, 19 Jul 2023 23:13:50 GMT
- Title: Global Optimization with Parametric Function Approximation
- Authors: Chong Liu, Yu-Xiang Wang
- Abstract summary: We consider the problem of global optimization with noisy zeroth order oracles.
We propose a new algorithm GO-UCB that leverages a parametric family of functions instead.
We show that GO-UCB achieves a cumulative regret of O$(sqrtT)$ where $T$ is the time horizon.
- Score: 19.699902334787325
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We consider the problem of global optimization with noisy zeroth order
oracles - a well-motivated problem useful for various applications ranging from
hyper-parameter tuning for deep learning to new material design. Existing work
relies on Gaussian processes or other non-parametric family, which suffers from
the curse of dimensionality. In this paper, we propose a new algorithm GO-UCB
that leverages a parametric family of functions (e.g., neural networks)
instead. Under a realizable assumption and a few other mild geometric
conditions, we show that GO-UCB achieves a cumulative regret of \~O$(\sqrt{T})$
where $T$ is the time horizon. At the core of GO-UCB is a carefully designed
uncertainty set over parameters based on gradients that allows optimistic
exploration. Synthetic and real-world experiments illustrate GO-UCB works
better than popular Bayesian optimization approaches, even if the model is
misspecified.
Related papers
- Near-Optimal Parameter Tuning of Level-1 QAOA for Ising Models [3.390330377512402]
We show how to reduce the two-dimensional $(gamma, beta)$ search to a one-dimensional search over $gamma$, with $beta*$ computed analytically.
This approach is validated using Recursive QAOA (RQAOA), where it consistently outperforms both coarsely optimised RQAOA and semidefinite programs.
arXiv Detail & Related papers (2025-01-27T19:00:00Z) - Accelerated zero-order SGD under high-order smoothness and overparameterized regime [79.85163929026146]
We present a novel gradient-free algorithm to solve convex optimization problems.
Such problems are encountered in medicine, physics, and machine learning.
We provide convergence guarantees for the proposed algorithm under both types of noise.
arXiv Detail & Related papers (2024-11-21T10:26:17Z) - Surrogate modeling for Bayesian optimization beyond a single Gaussian
process [62.294228304646516]
We propose a novel Bayesian surrogate model to balance exploration with exploitation of the search space.
To endow function sampling with scalability, random feature-based kernel approximation is leveraged per GP model.
To further establish convergence of the proposed EGP-TS to the global optimum, analysis is conducted based on the notion of Bayesian regret.
arXiv Detail & Related papers (2022-05-27T16:43:10Z) - STORM+: Fully Adaptive SGD with Momentum for Nonconvex Optimization [74.1615979057429]
We investigate non-batch optimization problems where the objective is an expectation over smooth loss functions.
Our work builds on the STORM algorithm, in conjunction with a novel approach to adaptively set the learning rate and momentum parameters.
arXiv Detail & Related papers (2021-11-01T15:43:36Z) - Bayesian Optimisation for Constrained Problems [0.0]
We propose a novel variant of the well-known Knowledge Gradient acquisition function that allows it to handle constraints.
We empirically compare the new algorithm with four other state-of-the-art constrained Bayesian optimisation algorithms and demonstrate its superior performance.
arXiv Detail & Related papers (2021-05-27T15:43:09Z) - Bayesian Optimistic Optimisation with Exponentially Decaying Regret [58.02542541410322]
The current practical BO algorithms have regret bounds ranging from $mathcalO(fraclogNsqrtN)$ to $mathcal O(e-sqrtN)$, where $N$ is the number of evaluations.
This paper explores the possibility of improving the regret bound in the noiseless setting by intertwining concepts from BO and tree-based optimistic optimisation.
We propose the BOO algorithm, a first practical approach which can achieve an exponential regret bound with order $mathcal O(N-sqrt
arXiv Detail & Related papers (2021-05-10T13:07:44Z) - Better Parameter-free Stochastic Optimization with ODE Updates for
Coin-Betting [31.60239268539764]
PFSGD algorithms do not require setting learning rates while achieving optimal theoretical performance.
In this paper, we close the empirical gap with a new parameter-free algorithm based on continuous-time Coin-Betting on truncated models.
We show empirically that this new parameter-free algorithm outperforms algorithms with the "best default" learning rates and almost matches the performance of finely tuned baselines without anything to tune.
arXiv Detail & Related papers (2020-06-12T23:10:25Z) - Convergence of adaptive algorithms for weakly convex constrained
optimization [59.36386973876765]
We prove the $mathcaltilde O(t-1/4)$ rate of convergence for the norm of the gradient of Moreau envelope.
Our analysis works with mini-batch size of $1$, constant first and second order moment parameters, and possibly smooth optimization domains.
arXiv Detail & Related papers (2020-06-11T17:43:19Z) - Explicit Gradient Learning [28.844181847562695]
Black-Box Optimization (BBO) methods can find optimal systems with no analytical representation.
EGL trains a NN to estimate the objective gradient directly.
arXiv Detail & Related papers (2020-06-09T08:56:24Z) - Global Optimization of Gaussian processes [52.77024349608834]
We propose a reduced-space formulation with trained Gaussian processes trained on few data points.
The approach also leads to significantly smaller and computationally cheaper sub solver for lower bounding.
In total, we reduce time convergence by orders of orders of the proposed method.
arXiv Detail & Related papers (2020-05-21T20:59:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.