Best Arm Identification in Spectral Bandits
- URL: http://arxiv.org/abs/2005.09841v1
- Date: Wed, 20 May 2020 04:12:04 GMT
- Title: Best Arm Identification in Spectral Bandits
- Authors: Tom\'a\v{s} Koc\'ak, Aur\'elien Garivier
- Abstract summary: Best Arm Identification (BAI) is an important challenge in many applications ranging from parameter tuning to clinical trials.
We study best-arm identification with fixed confidence in bandit models with graph smoothness constraint.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We study best-arm identification with fixed confidence in bandit models with
graph smoothness constraint. We provide and analyze an efficient gradient
ascent algorithm to compute the sample complexity of this problem as a solution
of a non-smooth max-min problem (providing in passing a simplified analysis for
the unconstrained case). Building on this algorithm, we propose an
asymptotically optimal strategy. We furthermore illustrate by numerical
experiments both the strategy's efficiency and the impact of the smoothness
constraint on the sample complexity. Best Arm Identification (BAI) is an
important challenge in many applications ranging from parameter tuning to
clinical trials. It is now very well understood in vanilla bandit models, but
real-world problems typically involve some dependency between arms that
requires more involved models. Assuming a graph structure on the arms is an
elegant practical way to encompass this phenomenon, but this had been done so
far only for regret minimization. Addressing BAI with graph constraints
involves delicate optimization problems for which the present paper offers a
solution.
Related papers
- Optimal Multi-Fidelity Best-Arm Identification [65.23078799972188]
In bandit best-arm identification, an algorithm is tasked with finding the arm with highest mean reward with a specified accuracy as fast as possible.
We study multi-fidelity best-arm identification, in which the can choose to sample an arm at a lower fidelity (less accurate mean estimate) for a lower cost.
Several methods have been proposed for tackling this problem, but their optimality remain elusive, notably due to loose lower bounds on the total cost needed to identify the best arm.
arXiv Detail & Related papers (2024-06-05T08:02:40Z) - Best Arm Identification with Fixed Budget: A Large Deviation Perspective [54.305323903582845]
We present sred, a truly adaptive algorithm that can reject arms in it any round based on the observed empirical gaps between the rewards of various arms.
In particular, we present sred, a truly adaptive algorithm that can reject arms in it any round based on the observed empirical gaps between the rewards of various arms.
arXiv Detail & Related papers (2023-12-19T13:17:43Z) - Pure Exploration in Bandits with Linear Constraints [15.547603114649464]
We address the problem of identifying the optimal policy with a fixed confidence level in a multi-armed bandit setup.
We introduce twoally optimal algorithms for this setting, one based on the Track-and-Stop method and the other based on a game-theoretic approach.
We provide empirical results that validate our bounds and visualize how constraints change the hardness of the problem.
arXiv Detail & Related papers (2023-06-22T10:00:33Z) - Optimal Algorithms for Stochastic Complementary Composite Minimization [55.26935605535377]
Inspired by regularization techniques in statistics and machine learning, we study complementary composite minimization.
We provide novel excess risk bounds, both in expectation and with high probability.
Our algorithms are nearly optimal, which we prove via novel lower complexity bounds for this class of problems.
arXiv Detail & Related papers (2022-11-03T12:40:24Z) - Gaussian Graphical Model Selection for Huge Data via Minipatch Learning [1.2891210250935146]
We propose the Minipatch Graph (MPGraph) estimator to solve the problem of graphical model selection.
MPGraph is a generalization of thresholded graph estimators fit to tiny, random subsets of both the observations and the nodes.
We prove that our algorithm achieves finite-sample graph selection consistency.
arXiv Detail & Related papers (2021-10-22T21:06:48Z) - Differentiable Causal Discovery from Interventional Data [141.41931444927184]
We propose a theoretically-grounded method based on neural networks that can leverage interventional data.
We show that our approach compares favorably to the state of the art in a variety of settings.
arXiv Detail & Related papers (2020-07-03T15:19:17Z) - Optimal Best-arm Identification in Linear Bandits [79.3239137440876]
We devise a simple algorithm whose sampling complexity matches known instance-specific lower bounds.
Unlike existing best-arm identification strategies, our algorithm uses a stopping rule that does not depend on the number of arms.
arXiv Detail & Related papers (2020-06-29T14:25:51Z) - Quasi-Newton Solver for Robust Non-Rigid Registration [35.66014845211251]
We propose a formulation for robust non-rigid registration based on a globally smooth robust estimator for data fitting and regularization.
We apply the majorization-minimization algorithm to the problem, which reduces each iteration to solving a simple least-squares problem with L-BFGS.
arXiv Detail & Related papers (2020-04-09T01:45:05Z) - The Simulator: Understanding Adaptive Sampling in the
Moderate-Confidence Regime [52.38455827779212]
We propose a novel technique for analyzing adaptive sampling called the em Simulator.
We prove the first instance-based lower bounds the top-k problem which incorporate the appropriate log-factors.
Our new analysis inspires a simple and near-optimal for the best-arm and top-k identification, the first em practical of its kind for the latter problem.
arXiv Detail & Related papers (2017-02-16T23:42:02Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.