Global Optimization with Parametric Function Approximation
        - URL: http://arxiv.org/abs/2211.09100v3
- Date: Wed, 19 Jul 2023 23:13:50 GMT
- Title: Global Optimization with Parametric Function Approximation
- Authors: Chong Liu, Yu-Xiang Wang
- Abstract summary: We consider the problem of global optimization with noisy zeroth order oracles.
We propose a new algorithm GO-UCB that leverages a parametric family of functions instead.
We show that GO-UCB achieves a cumulative regret of O$(sqrtT)$ where $T$ is the time horizon.
- Score: 19.699902334787325
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract:   We consider the problem of global optimization with noisy zeroth order
oracles - a well-motivated problem useful for various applications ranging from
hyper-parameter tuning for deep learning to new material design. Existing work
relies on Gaussian processes or other non-parametric family, which suffers from
the curse of dimensionality. In this paper, we propose a new algorithm GO-UCB
that leverages a parametric family of functions (e.g., neural networks)
instead. Under a realizable assumption and a few other mild geometric
conditions, we show that GO-UCB achieves a cumulative regret of \~O$(\sqrt{T})$
where $T$ is the time horizon. At the core of GO-UCB is a carefully designed
uncertainty set over parameters based on gradients that allows optimistic
exploration. Synthetic and real-world experiments illustrate GO-UCB works
better than popular Bayesian optimization approaches, even if the model is
misspecified.
 
      
        Related papers
        - Decentralized Nonconvex Composite Federated Learning with Gradient   Tracking and Momentum [78.27945336558987]
 Decentralized server (DFL) eliminates reliance on client-client architecture.
Non-smooth regularization is often incorporated into machine learning tasks.
We propose a novel novel DNCFL algorithm to solve these problems.
 arXiv  Detail & Related papers  (2025-04-17T08:32:25Z)
- Quantum Non-Linear Bandit Optimization [2.7718037613443127]
 We study non-linear bandit optimization where the learner maximizes a black-box function with zeroth order function oracle.
We propose the new Q-NLB-UCB algorithm which uses the novel parametric function approximation technique.
We prove that the regret bound of Q-NLB-UCB is not only $O(mathrmpolylog T)$ but also input dimension-free.
 arXiv  Detail & Related papers  (2025-03-04T21:53:33Z)
- Gradient-free stochastic optimization for additive models [56.42455605591779]
 We address the problem of zero-order optimization from noisy observations for an objective function satisfying the Polyak-Lojasiewicz or the strong convexity condition.
We assume that the objective function has an additive structure and satisfies a higher-order smoothness property, characterized by the H"older family of functions.
 arXiv  Detail & Related papers  (2025-03-03T23:39:08Z)
- Near-Optimal Parameter Tuning of Level-1 QAOA for Ising Models [3.390330377512402]
 We show how to reduce the two-dimensional $(gamma, beta)$ search to a one-dimensional search over $gamma$, with $beta*$ computed analytically.
This approach is validated using Recursive QAOA (RQAOA), where it consistently outperforms both coarsely optimised RQAOA and semidefinite programs.
 arXiv  Detail & Related papers  (2025-01-27T19:00:00Z)
- Accelerated zero-order SGD under high-order smoothness and   overparameterized regime [79.85163929026146]
 We present a novel gradient-free algorithm to solve convex optimization problems.
Such problems are encountered in medicine, physics, and machine learning.
We provide convergence guarantees for the proposed algorithm under both types of noise.
 arXiv  Detail & Related papers  (2024-11-21T10:26:17Z)
- Surrogate modeling for Bayesian optimization beyond a single Gaussian
  process [62.294228304646516]
 We propose a novel Bayesian surrogate model to balance exploration with exploitation of the search space.
To endow function sampling with scalability, random feature-based kernel approximation is leveraged per GP model.
To further establish convergence of the proposed EGP-TS to the global optimum, analysis is conducted based on the notion of Bayesian regret.
 arXiv  Detail & Related papers  (2022-05-27T16:43:10Z)
- STORM+: Fully Adaptive SGD with Momentum for Nonconvex Optimization [74.1615979057429]
 We investigate non-batch optimization problems where the objective is an expectation over smooth loss functions.
Our work builds on the STORM algorithm, in conjunction with a novel approach to adaptively set the learning rate and momentum parameters.
 arXiv  Detail & Related papers  (2021-11-01T15:43:36Z)
- Bayesian Optimisation for Constrained Problems [0.0]
 We propose a novel variant of the well-known Knowledge Gradient acquisition function that allows it to handle constraints.
We empirically compare the new algorithm with four other state-of-the-art constrained Bayesian optimisation algorithms and demonstrate its superior performance.
 arXiv  Detail & Related papers  (2021-05-27T15:43:09Z)
- Bayesian Optimistic Optimisation with Exponentially Decaying Regret [58.02542541410322]
 The current practical BO algorithms have regret bounds ranging from $mathcalO(fraclogNsqrtN)$ to $mathcal O(e-sqrtN)$, where $N$ is the number of evaluations.
This paper explores the possibility of improving the regret bound in the noiseless setting by intertwining concepts from BO and tree-based optimistic optimisation.
We propose the BOO algorithm, a first practical approach which can achieve an exponential regret bound with order $mathcal O(N-sqrt
 arXiv  Detail & Related papers  (2021-05-10T13:07:44Z)
- Towards Tractable Optimism in Model-Based Reinforcement Learning [37.51073590932658]
 To be successful, an optimistic RL algorithm must over-estimate the true value function (optimism) but not by so much that it is inaccurate (estimation error)
We re-interpret these scalable optimistic model-based algorithms as solving a tractable noise augmented MDP.
We show that if this error is reduced, optimistic model-based RL algorithms can match state-of-the-art performance in continuous control problems.
 arXiv  Detail & Related papers  (2020-06-21T20:53:19Z)
- Better Parameter-free Stochastic Optimization with ODE Updates for
  Coin-Betting [31.60239268539764]
 PFSGD algorithms do not require setting learning rates while achieving optimal theoretical performance.
In this paper, we close the empirical gap with a new parameter-free algorithm based on continuous-time Coin-Betting on truncated models.
We show empirically that this new parameter-free algorithm outperforms algorithms with the "best default" learning rates and almost matches the performance of finely tuned baselines without anything to tune.
 arXiv  Detail & Related papers  (2020-06-12T23:10:25Z)
- Convergence of adaptive algorithms for weakly convex constrained
  optimization [59.36386973876765]
 We prove the $mathcaltilde O(t-1/4)$ rate of convergence for the norm of the gradient of Moreau envelope.
Our analysis works with mini-batch size of $1$, constant first and second order moment parameters, and possibly smooth optimization domains.
 arXiv  Detail & Related papers  (2020-06-11T17:43:19Z)
- Explicit Gradient Learning [28.844181847562695]
 Black-Box Optimization (BBO) methods can find optimal systems with no analytical representation.
EGL trains a NN to estimate the objective gradient directly.
 arXiv  Detail & Related papers  (2020-06-09T08:56:24Z)
- Global Optimization of Gaussian processes [52.77024349608834]
 We propose a reduced-space formulation with trained Gaussian processes trained on few data points.
The approach also leads to significantly smaller and computationally cheaper sub solver for lower bounding.
In total, we reduce time convergence by orders of orders of the proposed method.
 arXiv  Detail & Related papers  (2020-05-21T20:59:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.