A Generalized Extensive-Form Fictitious Play Algorithm
- URL: http://arxiv.org/abs/2310.09658v1
- Date: Sat, 14 Oct 2023 20:18:49 GMT
- Title: A Generalized Extensive-Form Fictitious Play Algorithm
- Authors: Tim P. Schulze
- Abstract summary: We introduce a simple extensive-form algorithm for finding equilibria of two-player, zero-sum games.
We compare its performance to that of a similar extensive-form fictitious play algorithm and a counter-factual regret minimization algorithm.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We introduce a simple extensive-form algorithm for finding equilibria of
two-player, zero-sum games. The algorithm is realization equivalent to a
generalized form of Fictitious Play. We compare its performance to that of a
similar extensive-form fictitious play algorithm and a counter-factual regret
minimization algorithm. All three algorithms share the same advantages over
normal-form fictitious play in terms of reducing storage requirements and
computational complexity. The new algorithm is intuitive and straightforward to
implement, making it an appealing option for those looking for a quick and easy
game solving tool.
Related papers
- Online Learning and Solving Infinite Games with an ERM Oracle [20.1330044382824]
We propose an algorithm for online binary classification setting that relies solely on ERM oracle calls.
We show that it has finite regret in the realizable setting and sublinearly growing regret in the agnostic setting.
Our algorithms apply to both binary-valued and real-valued games and can be viewed as providing justification for the wide use of double oracle and multiple oracle algorithms in the practice of solving large games.
arXiv Detail & Related papers (2023-07-04T12:51:21Z) - A Fully Single Loop Algorithm for Bilevel Optimization without Hessian
Inverse [121.54116938140754]
We propose a new Hessian inverse free Fully Single Loop Algorithm for bilevel optimization problems.
We show that our algorithm converges with the rate of $O(epsilon-2)$.
arXiv Detail & Related papers (2021-12-09T02:27:52Z) - Practical, Provably-Correct Interactive Learning in the Realizable
Setting: The Power of True Believers [12.09273192079783]
We consider interactive learning in the realizable setting and develop a general framework to handle problems ranging from best arm identification to active classification.
We design novel computationally efficient algorithms for the realizable setting that match the minimax lower bound up to logarithmic factors.
arXiv Detail & Related papers (2021-11-09T02:33:36Z) - Towards General Function Approximation in Zero-Sum Markov Games [126.58493169301012]
This paper considers two-player zero-sum finite-horizon Markov games with simultaneous moves.
Provably efficient algorithms for both decoupled and coordinated settings are developed.
arXiv Detail & Related papers (2021-07-30T15:25:13Z) - Last-iterate Convergence in Extensive-Form Games [49.31256241275577]
We study last-iterate convergence of optimistic algorithms in sequential games.
We show that all of these algorithms enjoy last-iterate convergence, with some of them even converging exponentially fast.
arXiv Detail & Related papers (2021-06-27T22:02:26Z) - Minimax Optimization with Smooth Algorithmic Adversaries [59.47122537182611]
We propose a new algorithm for the min-player against smooth algorithms deployed by an adversary.
Our algorithm is guaranteed to make monotonic progress having no limit cycles, and to find an appropriate number of gradient ascents.
arXiv Detail & Related papers (2021-06-02T22:03:36Z) - Provably Efficient Policy Gradient Methods for Two-Player Zero-Sum
Markov Games [95.70078702838654]
This paper studies natural extensions of Natural Policy Gradient algorithm for solving two-player zero-sum games.
We thoroughly characterize the algorithms' performance in terms of the number of samples, number of iterations, concentrability coefficients, and approximation error.
arXiv Detail & Related papers (2021-02-17T17:49:57Z) - Single-Timescale Stochastic Nonconvex-Concave Optimization for Smooth
Nonlinear TD Learning [145.54544979467872]
We propose two single-timescale single-loop algorithms that require only one data point each step.
Our results are expressed in a form of simultaneous primal and dual side convergence.
arXiv Detail & Related papers (2020-08-23T20:36:49Z) - Follow the Perturbed Leader: Optimism and Fast Parallel Algorithms for
Smooth Minimax Games [33.9383996530254]
We consider the problem of online learning and its application to solving minimax games.
For the online learning problem, Follow Perturbed Leader is a widely perturbed oracle setting which computes the best response.
arXiv Detail & Related papers (2020-06-13T02:55:41Z) - Fast Complete Algorithm for Multiplayer Nash Equilibrium [1.7132914341329848]
We describe a new complete algorithm for computing Nash equilibrium in multiplayer general-sum games.
We demonstrate that the algorithm runs significantly faster than the prior fastest complete algorithm on several game classes previously studied.
arXiv Detail & Related papers (2020-02-11T23:42:14Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.