Almost Cost-Free Communication in Federated Best Arm Identification
- URL: http://arxiv.org/abs/2208.09215v1
- Date: Fri, 19 Aug 2022 08:37:09 GMT
- Title: Almost Cost-Free Communication in Federated Best Arm Identification
- Authors: Kota Srinivas Reddy, P. N. Karthik, and Vincent Y. F. Tan
- Abstract summary: We study the problem of best arm identification in a federated learning multi-armed bandit setup with a central server and multiple clients.
We propose a novel algorithm sc FedElim that is based on successive elimination and communicates only in exponential time steps.
- Score: 76.12303738941254
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We study the problem of best arm identification in a federated learning
multi-armed bandit setup with a central server and multiple clients. Each
client is associated with a multi-armed bandit in which each arm yields {\em
i.i.d.}\ rewards following a Gaussian distribution with an unknown mean and
known variance. The set of arms is assumed to be the same at all the clients.
We define two notions of best arm -- local and global. The local best arm at a
client is the arm with the largest mean among the arms local to the client,
whereas the global best arm is the arm with the largest average mean across all
the clients. We assume that each client can only observe the rewards from its
local arms and thereby estimate its local best arm. The clients communicate
with a central server on uplinks that entail a cost of $C\ge0$ units per usage
per uplink. The global best arm is estimated at the server. The goal is to
identify the local best arms and the global best arm with minimal total cost,
defined as the sum of the total number of arm selections at all the clients and
the total communication cost, subject to an upper bound on the error
probability. We propose a novel algorithm {\sc FedElim} that is based on
successive elimination and communicates only in exponential time steps and
obtain a high probability instance-dependent upper bound on its total cost. The
key takeaway from our paper is that for any $C\geq 0$ and error probabilities
sufficiently small, the total number of arm selections (resp.\ the total cost)
under {\sc FedElim} is at most~$2$ (resp.~$3$) times the maximum total number
of arm selections under its variant that communicates in every time step.
Additionally, we show that the latter is optimal in expectation up to a
constant factor, thereby demonstrating that communication is almost cost-free
in {\sc FedElim}. We numerically validate the efficacy of {\sc FedElim}.
Related papers
- A General Framework for Clustering and Distribution Matching with Bandit Feedback [81.50716021326194]
We develop a general framework for clustering and distribution matching problems with bandit feedback.
We derive a non-asymptotic lower bound on the average number of arm pulls for any online algorithm with an error probability not exceeding $delta$.
arXiv Detail & Related papers (2024-09-08T12:19:12Z) - Representative Arm Identification: A fixed confidence approach to identify cluster representatives [7.459521930846415]
We study the representative arm identification problem in the multi-armed bandits (MAB) framework.
The RAI problem covers as special cases several well-studied MAB problems such as identifying the best arm or any $M$ out of the top $K$.
We propose two algorithms, based on the idea of confidence intervals, and provide high probability upper bounds on their sample complexity.
arXiv Detail & Related papers (2024-08-26T11:47:52Z) - Optimal Best Arm Identification with Fixed Confidence in Restless Bandits [66.700654953613]
We study best arm identification in a restless multi-armed bandit setting with finitely many arms.
The discrete-time data generated by each arm forms a homogeneous Markov chain taking values in a common, finite state space.
It is demonstrated that tracking the long-term behavior of a certain Markov decision process and its state-action visitation proportions are the key ingredients in analyzing the converse and achievability bounds.
arXiv Detail & Related papers (2023-10-20T10:04:05Z) - Federated Best Arm Identification with Heterogeneous Clients [62.36929749450298]
We study best arm identification in a federated multi-armed bandit setting with a central server and multiple clients.
We show that for any algorithm whose upper bound on the expected stopping time matches with the lower bound up to a multiplicative constant (em almost-optimal algorithm)
We propose a novel algorithm that communicates at exponential time instants, and demonstrate that it is almost-optimal.
arXiv Detail & Related papers (2022-10-14T13:09:11Z) - Max-Quantile Grouped Infinite-Arm Bandits [39.7239093424124]
bandit problem in which there are a number of groups each consisting of infinitely many arms.
We introduce a two-step algorithm that first requests a fixed number of arms from each group and then runs a finite-arm grouped max-quantile bandit algorithm.
We characterize both the instance-dependent and worst-case regret, and provide a matching lower bound for the latter.
arXiv Detail & Related papers (2022-10-04T00:46:49Z) - Best Arm Identification in Restless Markov Multi-Armed Bandits [85.55466536537293]
We study the problem of identifying the best arm in a multi-armed bandit environment.
A decision entity wishes to find the index of the best arm as quickly as possible, subject to an upper bound error probability.
We show that this policy achieves an upper bound that depends on $R$ and is monotonically non-increasing as $Rtoinfty$.
arXiv Detail & Related papers (2022-03-29T04:58:04Z) - Top-$k$ eXtreme Contextual Bandits with Arm Hierarchy [71.17938026619068]
We study the top-$k$ extreme contextual bandits problem, where the total number of arms can be enormous.
We first propose an algorithm for the non-extreme realizable setting, utilizing the Inverse Gap Weighting strategy.
We show that our algorithm has a regret guarantee of $O(ksqrt(A-k+1)T log (|mathcalF|T))$.
arXiv Detail & Related papers (2021-02-15T19:10:52Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.