Fast swap regret minimization and applications to approximate correlated
equilibria
- URL: http://arxiv.org/abs/2310.19647v2
- Date: Tue, 14 Nov 2023 17:39:27 GMT
- Title: Fast swap regret minimization and applications to approximate correlated
equilibria
- Authors: Binghui Peng and Aviad Rubinstein
- Abstract summary: For any constant $varepsilon>0$, our algorithm obtains $varepsilon T$-swap regret within only $T = mathsfpolylog(n)$ rounds.
Our algorithm has an exponential dependence on $varepsilon$, but we prove a new, matching lower bound.
- Score: 20.047624953965965
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We give a simple and computationally efficient algorithm that, for any
constant $\varepsilon>0$, obtains $\varepsilon T$-swap regret within only $T =
\mathsf{polylog}(n)$ rounds; this is an exponential improvement compared to the
super-linear number of rounds required by the state-of-the-art algorithm, and
resolves the main open problem of [Blum and Mansour 2007]. Our algorithm has an
exponential dependence on $\varepsilon$, but we prove a new, matching lower
bound.
Our algorithm for swap regret implies faster convergence to
$\varepsilon$-Correlated Equilibrium ($\varepsilon$-CE) in several regimes: For
normal form two-player games with $n$ actions, it implies the first uncoupled
dynamics that converges to the set of $\varepsilon$-CE in polylogarithmic
rounds; a $\mathsf{polylog}(n)$-bit communication protocol for $\varepsilon$-CE
in two-player games (resolving an open problem mentioned by
[Babichenko-Rubinstein'2017, Goos-Rubinstein'2018, Ganor-CS'2018]); and an
$\tilde{O}(n)$-query algorithm for $\varepsilon$-CE (resolving an open problem
of [Babichenko'2020] and obtaining the first separation between
$\varepsilon$-CE and $\varepsilon$-Nash equilibrium in the query complexity
model).
For extensive-form games, our algorithm implies a PTAS for $\mathit{normal}$
$\mathit{form}$ $\mathit{correlated}$ $\mathit{equilibria}$, a solution concept
often conjectured to be computationally intractable (e.g. [Stengel-Forges'08,
Fujii'23]).
Related papers
- The complexity of approximate (coarse) correlated equilibrium for incomplete information games [16.96984593866157]
We study the complexity of decentralized learning of approximate correlated equilibria in incomplete information games.
Our lower bound holds even for the easier solution concept of $epsilon$-approximate $mathitco$ correlated equilibrium.
arXiv Detail & Related papers (2024-06-04T14:35:27Z) - Sample-Efficient Linear Regression with Self-Selection Bias [7.605563562103568]
We consider the problem of linear regression with self-selection bias in the unknown-index setting.
We provide a novel and near optimally sample-efficient (in terms of $k$) algorithm to recover $mathbfw_1,ldots,mathbfw_kin.
Our algorithm succeeds under significantly relaxed noise assumptions, and therefore also succeeds in the related setting of max-linear regression.
arXiv Detail & Related papers (2024-02-22T02:20:24Z) - Fast $(1+\varepsilon)$-Approximation Algorithms for Binary Matrix
Factorization [54.29685789885059]
We introduce efficient $(1+varepsilon)$-approximation algorithms for the binary matrix factorization (BMF) problem.
The goal is to approximate $mathbfA$ as a product of low-rank factors.
Our techniques generalize to other common variants of the BMF problem.
arXiv Detail & Related papers (2023-06-02T18:55:27Z) - Near-Optimal Learning of Extensive-Form Games with Imperfect Information [54.55092907312749]
We present the first line of algorithms that require only $widetildemathcalO((XA+YB)/varepsilon2)$ episodes of play to find an $varepsilon$-approximate Nash equilibrium in two-player zero-sum games.
This improves upon the best known sample complexity of $widetildemathcalO((X2A+Y2B)/varepsilon2)$ by a factor of $widetildemathcalO(maxX,
arXiv Detail & Related papers (2022-02-03T18:18:28Z) - Threshold Phenomena in Learning Halfspaces with Massart Noise [56.01192577666607]
We study the problem of PAC learning halfspaces on $mathbbRd$ with Massart noise under Gaussian marginals.
Our results qualitatively characterize the complexity of learning halfspaces in the Massart model.
arXiv Detail & Related papers (2021-08-19T16:16:48Z) - Private Stochastic Convex Optimization: Optimal Rates in $\ell_1$
Geometry [69.24618367447101]
Up to logarithmic factors the optimal excess population loss of any $(varepsilon,delta)$-differently private is $sqrtlog(d)/n + sqrtd/varepsilon n.$
We show that when the loss functions satisfy additional smoothness assumptions, the excess loss is upper bounded (up to logarithmic factors) by $sqrtlog(d)/n + (log(d)/varepsilon n)2/3.
arXiv Detail & Related papers (2021-03-02T06:53:44Z) - An Optimal Separation of Randomized and Quantum Query Complexity [67.19751155411075]
We prove that for every decision tree, the absolute values of the Fourier coefficients of a given order $ellsqrtbinomdell (1+log n)ell-1,$ sum to at most $cellsqrtbinomdell (1+log n)ell-1,$ where $n$ is the number of variables, $d$ is the tree depth, and $c>0$ is an absolute constant.
arXiv Detail & Related papers (2020-08-24T06:50:57Z) - Robust Gaussian Covariance Estimation in Nearly-Matrix Multiplication
Time [14.990725929840892]
We show an algorithm that runs in time $widetildeO(T(N, d) log kappa / mathrmpoly (varepsilon))$, where $T(N, d)$ is the time it takes to multiply a $d times N$ matrix by its transpose.
Our runtime matches that of the fastest algorithm for covariance estimation without outliers, up to poly-logarithmic factors.
arXiv Detail & Related papers (2020-06-23T20:21:27Z) - Model-Free Reinforcement Learning: from Clipped Pseudo-Regret to Sample
Complexity [59.34067736545355]
Given an MDP with $S$ states, $A$ actions, the discount factor $gamma in (0,1)$, and an approximation threshold $epsilon > 0$, we provide a model-free algorithm to learn an $epsilon$-optimal policy.
For small enough $epsilon$, we show an improved algorithm with sample complexity.
arXiv Detail & Related papers (2020-06-06T13:34:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.