Revisiting Game Representations: The Hidden Costs of Efficiency in
Sequential Decision-making Algorithms
- URL: http://arxiv.org/abs/2112.10890v3
- Date: Tue, 5 Dec 2023 22:12:46 GMT
- Title: Revisiting Game Representations: The Hidden Costs of Efficiency in
Sequential Decision-making Algorithms
- Authors: Vojt\v{e}ch Kova\v{r}\'ik, David Milec, Michal \v{S}ustr, Dominik
Seitz, Viliam Lis\'y
- Abstract summary: Recent advancements in algorithms for sequential decision-making under imperfect information have shown remarkable success in large games.
These algorithms traditionally formalize the games using the extensive-form game formalism.
We show that a popular workaround involves using a specialized representation based on player specific information-state trees.
- Score: 0.6749750044497732
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Recent advancements in algorithms for sequential decision-making under
imperfect information have shown remarkable success in large games such as
limit- and no-limit poker. These algorithms traditionally formalize the games
using the extensive-form game formalism, which, as we show, while theoretically
sound, is memory-inefficient and computationally intensive in practice. To
mitigate these challenges, a popular workaround involves using a specialized
representation based on player specific information-state trees. However, as we
show, this alternative significantly narrows the set of games that can be
represented efficiently.
In this study, we identify the set of large games on which modern algorithms
have been benchmarked as being naturally represented by Sequential Bayesian
Games. We elucidate the critical differences between extensive-form game and
sequential Bayesian game representations, both theoretically and empirically.
We further argue that the impressive experimental results often cited in the
literature may be skewed, as they frequently stem from testing these algorithms
only on this restricted class of games. By understanding these nuances, we aim
to guide future research in developing more universally applicable and
efficient algorithms for sequential decision-making under imperfect
information.
Related papers
- History Filtering in Imperfect Information Games: Algorithms and
Complexity [16.23892847804002]
We introduce and analyze the computational aspects and tractability of filtering histories for subgame decomposition.
We show that constructing a single history from the root of the subgame is generally intractable.
We also introduce a novel Markov Chain Monte Carlo-based generation algorithm for trick-taking card games.
arXiv Detail & Related papers (2023-11-24T18:34:36Z) - The Update-Equivalence Framework for Decision-Time Planning [78.44953498421854]
We introduce an alternative framework for decision-time planning that is not based on solving subgames, but rather on update equivalence.
We derive a provably sound search algorithm for fully cooperative games based on mirror descent and a search algorithm for adversarial games based on magnetic mirror descent.
arXiv Detail & Related papers (2023-04-25T20:28:55Z) - Hardness of Independent Learning and Sparse Equilibrium Computation in
Markov Games [70.19141208203227]
We consider the problem of decentralized multi-agent reinforcement learning in Markov games.
We show that no algorithm attains no-regret in general-sum games when executed independently by all players.
We show that our lower bounds hold even for seemingly easier setting in which all agents are controlled by a centralized algorithm.
arXiv Detail & Related papers (2023-03-22T03:28:12Z) - No-Regret Learning in Time-Varying Zero-Sum Games [99.86860277006318]
Learning from repeated play in a fixed zero-sum game is a classic problem in game theory and online learning.
We develop a single parameter-free algorithm that simultaneously enjoys favorable guarantees under three performance measures.
Our algorithm is based on a two-layer structure with a meta-algorithm learning over a group of black-box base-learners satisfying a certain property.
arXiv Detail & Related papers (2022-01-30T06:10:04Z) - Public Information Representation for Adversarial Team Games [31.29335755664997]
adversarial team games reside in the asymmetric information available to the team members during the play.
Our algorithms convert a sequential team game with adversaries to a classical two-player zero-sum game.
Due to the NP-hard nature of the problem, the resulting Public Team game may be exponentially larger than the original one.
arXiv Detail & Related papers (2022-01-25T15:07:12Z) - Spatial State-Action Features for General Games [5.849736173068868]
We formulate a design and efficient implementation of spatial state-action features for general games.
These are patterns that can be trained to incentivise or disincentivise actions based on whether or not they match variables of the state in a local area.
We propose an efficient approach for evaluating active features for any given set of features.
arXiv Detail & Related papers (2022-01-17T13:34:04Z) - Student of Games: A unified learning algorithm for both perfect and
imperfect information games [22.97853623156316]
Student of Games is an algorithm that combines guided search, self-play learning, and game-theoretic reasoning.
We prove that Student of Games is sound, converging to perfect play as available computation and approximation capacity increases.
Student of Games reaches strong performance in chess and Go, beats the strongest openly available agent in heads-up no-limit Texas hold'em poker, and defeats the state-of-the-art agent in Scotland Yard.
arXiv Detail & Related papers (2021-12-06T17:16:24Z) - Last-iterate Convergence in Extensive-Form Games [49.31256241275577]
We study last-iterate convergence of optimistic algorithms in sequential games.
We show that all of these algorithms enjoy last-iterate convergence, with some of them even converging exponentially fast.
arXiv Detail & Related papers (2021-06-27T22:02:26Z) - Generating Diverse and Competitive Play-Styles for Strategy Games [58.896302717975445]
We propose Portfolio Monte Carlo Tree Search with Progressive Unpruning for playing a turn-based strategy game (Tribes)
We show how it can be parameterized so a quality-diversity algorithm (MAP-Elites) is used to achieve different play-styles while keeping a competitive level of play.
Our results show that this algorithm is capable of achieving these goals even for an extensive collection of game levels beyond those used for training.
arXiv Detail & Related papers (2021-04-17T20:33:24Z) - Learning to Play Sequential Games versus Unknown Opponents [93.8672371143881]
We consider a repeated sequential game between a learner, who plays first, and an opponent who responds to the chosen action.
We propose a novel algorithm for the learner when playing against an adversarial sequence of opponents.
Our results include algorithm's regret guarantees that depend on the regularity of the opponent's response.
arXiv Detail & Related papers (2020-07-10T09:33:05Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.