Three-Player Game Training Dynamics
- URL: http://arxiv.org/abs/2208.06531v1
- Date: Fri, 12 Aug 2022 23:57:44 GMT
- Title: Three-Player Game Training Dynamics
- Authors: Kenneth Christofferson and Fernando J. Yanez
- Abstract summary: We explore three-player game training dynamics using an extended version of a simplified bilinear smooth game.
We find that trilinear games do not converge on the Nash equilibrium in most cases.
In addition to alternating and simultaneous updates, we explore a new update order--maximizer-first---which is only possible in a three-player game.
- Score: 77.34726150561087
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This work explores three-player game training dynamics, under what conditions
three-player games converge and the equilibria the converge on. In contrast to
prior work, we examine a three-player game architecture in which all players
explicitly interact with each other. Prior work analyzes games in which two of
three agents interact with only one other player, constituting dual two-player
games. We explore three-player game training dynamics using an extended version
of a simplified bilinear smooth game, called a simplified trilinear smooth
game. We find that trilinear games do not converge on the Nash equilibrium in
most cases, rather converging on a fixed point which is optimal for two
players, but not for the third. Further, we explore how the order of the
updates influences convergence. In addition to alternating and simultaneous
updates, we explore a new update order--maximizer-first--which is only possible
in a three-player game. We find that three-player games can converge on a Nash
equilibrium using maximizer-first updates. Finally, we experiment with
differing momentum values for each player in a trilinear smooth game under all
three update orders and show that maximizer-first updates achieve more optimal
results in a larger set of player-specific momentum value triads than other
update orders.
Related papers
- Leading the Pack: N-player Opponent Shaping [52.682734939786464]
We extend Opponent Shaping (OS) methods to environments involving multiple co-players and multiple shaping agents.
We find that when playing with a large number of co-players, OS methods' relative performance reduces, suggesting that in the limit OS methods may not perform well.
arXiv Detail & Related papers (2023-12-19T20:01:42Z) - On the Convergence of No-Regret Learning Dynamics in Time-Varying Games [89.96815099996132]
We characterize the convergence of optimistic gradient descent (OGD) in time-varying games.
Our framework yields sharp convergence bounds for the equilibrium gap of OGD in zero-sum games.
We also provide new insights on dynamic regret guarantees in static games.
arXiv Detail & Related papers (2023-01-26T17:25:45Z) - Learning Correlated Equilibria in Mean-Field Games [62.14589406821103]
We develop the concepts of Mean-Field correlated and coarse-correlated equilibria.
We show that they can be efficiently learnt in emphall games, without requiring any additional assumption on the structure of the game.
arXiv Detail & Related papers (2022-08-22T08:31:46Z) - Evolutionary Game-Theoretical Analysis for General Multiplayer
Asymmetric Games [22.753799819424785]
We fill the gap between payoff table and dynamic analysis without any inaccuracy.
We compare our method with the state-of-the-art in some classic games.
arXiv Detail & Related papers (2022-06-22T14:06:23Z) - Towards convergence to Nash equilibria in two-team zero-sum games [17.4461045395989]
Two-team zero-sum games are defined as multi-player games where players are split into two competing sets of agents.
We focus on the solution concept of Nash equilibria (NE)
We show that computing NE for this class of games is $textithard$ for the complexity class $mathrm$.
arXiv Detail & Related papers (2021-11-07T21:15:35Z) - Computing Nash Equilibria in Multiplayer DAG-Structured Stochastic Games
with Persistent Imperfect Information [1.7132914341329848]
We present an algorithm for approximating Nash equilibrium in multiplayer general-suming games with persistent imperfect information.
Using a new procedure, we are able to demonstrate that our algorithm computes a strategy that closely approximates Nash equilibrium in this game.
arXiv Detail & Related papers (2020-10-26T19:27:26Z) - Exponential Convergence of Gradient Methods in Concave Network Zero-sum
Games [6.129776019898013]
We study the computation of Nash equilibrium in concave network zero-sum games (NZSGs)
We show that various game theoretic properties of convex-concave two-player zero-sum games are preserved in this generalization.
arXiv Detail & Related papers (2020-07-10T16:56:56Z) - Chaos, Extremism and Optimism: Volume Analysis of Learning in Games [55.24050445142637]
We present volume analyses of Multiplicative Weights Updates (MWU) and Optimistic Multiplicative Weights Updates (OMWU) in zero-sum as well as coordination games.
We show that OMWU contracts volume, providing an alternative understanding for its known convergent behavior.
We also prove a no-free-lunch type of theorem, in the sense that when examining coordination games the roles are reversed: OMWU expands volume exponentially fast, whereas MWU contracts.
arXiv Detail & Related papers (2020-05-28T13:47:09Z) - From Poincar\'e Recurrence to Convergence in Imperfect Information
Games: Finding Equilibrium via Regularization [49.368421783733815]
We show how adapting the reward can give strong convergence guarantees in monotone games.
We also show how this reward adaptation technique can be leveraged to build algorithms that converge exactly to the Nash equilibrium.
arXiv Detail & Related papers (2020-02-19T21:36:58Z) - Empirical Analysis of Fictitious Play for Nash Equilibrium Computation in Multiplayer Games [0.4895118383237099]
fictitious play is guaranteed to converge to Nash equilibrium in certain game classes, such as two-player zero-sum games.
We show that fictitious play in fact leads to improved Nash equilibrium approximation over a variety of game classes and sizes.
arXiv Detail & Related papers (2020-01-30T03:47:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.