Bounded rationality for relaxing best response and mutual consistency:
An information-theoretic model of partial self-reference
- URL: http://arxiv.org/abs/2106.15844v1
- Date: Wed, 30 Jun 2021 06:56:56 GMT
- Title: Bounded rationality for relaxing best response and mutual consistency:
An information-theoretic model of partial self-reference
- Authors: Benjamin Patrick Evans, Mikhail Prokopenko
- Abstract summary: This work focuses on some of the assumptions underlying rationality such as mutual consistency and best-response.
We consider ways to relax these assumptions using concepts from level-$k$ reasoning and quantal response equilibrium (QRE) respectively.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: While game theory has been transformative for decision-making, the
assumptions made can be overly restrictive in certain instances. In this work,
we focus on some of the assumptions underlying rationality such as mutual
consistency and best-response, and consider ways to relax these assumptions
using concepts from level-$k$ reasoning and quantal response equilibrium (QRE)
respectively. Specifically, we provide an information-theoretic two-parameter
model that can relax both mutual consistency and best-response, but can recover
approximations of level-$k$, QRE, or typical Nash equilibrium behaviour in the
limiting cases. The proposed approach is based on a recursive form of the
variational free energy principle, representing self-referential games as
(pseudo) sequential decisions. Bounds in player processing abilities are
captured as information costs, where future chains of reasoning are discounted,
implying a hierarchy of players where lower-level players have fewer processing
resources.
Related papers
- Barriers to Welfare Maximization with No-Regret Learning [68.66209476382213]
We prove lower bounds for computing a near-optimal $T$-sparse CCE.
In particular, we show that the inapproximability of maximum clique precludes attaining any non-trivial sparsity in time.
arXiv Detail & Related papers (2024-11-04T00:34:56Z) - Sequential Manipulation Against Rank Aggregation: Theory and Algorithm [119.57122943187086]
We leverage an online attack on the vulnerable data collection process.
From the game-theoretic perspective, the confrontation scenario is formulated as a distributionally robust game.
The proposed method manipulates the results of rank aggregation methods in a sequential manner.
arXiv Detail & Related papers (2024-07-02T03:31:21Z) - Imperfect-Recall Games: Equilibrium Concepts and Their Complexity [74.01381499760288]
We investigate optimal decision making under imperfect recall, that is, when an agent forgets information it once held before.
In the framework of extensive-form games with imperfect recall, we analyze the computational complexities of finding equilibria in multiplayer settings.
arXiv Detail & Related papers (2024-06-23T00:27:28Z) - Exploiting hidden structures in non-convex games for convergence to Nash
equilibrium [62.88214569402201]
A wide array of modern machine learning applications can be formulated as non-cooperative Nashlibria.
We provide explicit convergence guarantees for both deterministic and deterministic environments.
arXiv Detail & Related papers (2023-12-27T15:21:25Z) - The Computational Complexity of Single-Player Imperfect-Recall Games [37.550554344575]
We study extensive-form games with imperfect recall, such as the Sleeping Beauty problem or the Absent Driver game.
For such games, two natural equilibrium concepts have been proposed as alternative solution concepts to ex-ante optimality.
arXiv Detail & Related papers (2023-05-28T19:41:25Z) - Soft-Bellman Equilibrium in Affine Markov Games: Forward Solutions and
Inverse Learning [37.176741793213694]
We formulate a class of Markov games, termed affine Markov games, where an affine reward function couples the players' actions.
We introduce a novel solution concept, the soft-Bellman equilibrium, where each player is boundedly rational and chooses a soft-Bellman policy.
We then solve the inverse game problem of inferring the players' reward parameters from observed state-action trajectories via a projected-gradient algorithm.
arXiv Detail & Related papers (2023-03-31T22:50:47Z) - Evolutionary Strategies with Analogy Partitions in p-guessing Games [0.0]
We introduce an evolutionary process of learning to investigate the dynamics of learning in unstable p-guessing games environments.
We show that our genetic algorithm behaves consistently with previous results in persistent environments, converging to the Nash equilibrium.
arXiv Detail & Related papers (2021-03-26T10:28:23Z) - Model-Free Online Learning in Unknown Sequential Decision Making
Problems and Games [114.90723492840499]
In large two-player zero-sum imperfect-information games, modern extensions of counterfactual regret minimization (CFR) are currently the practical state of the art for computing a Nash equilibrium.
We formalize an online learning setting in which the strategy space is not known to the agent.
We give an efficient algorithm that achieves $O(T3/4)$ regret with high probability for that setting, even when the agent faces an adversarial environment.
arXiv Detail & Related papers (2021-03-08T04:03:24Z) - Hindsight and Sequential Rationality of Correlated Play [18.176128899338433]
We look at algorithms that ensure strong performance in hindsight relative to what could have been achieved with modified behavior.
We develop and advocate for this hindsight framing of learning in general sequential decision-making settings.
We present examples illustrating the distinct strengths and weaknesses of each type of equilibrium in the literature.
arXiv Detail & Related papers (2020-12-10T18:30:21Z) - A modified axiomatic foundation of the analytic hierarchy process [7.827025090754844]
This paper reports a modified axiomatic foundation of the analytic hierarchy process (AHP)
The novel concept of reciprocal symmetry breaking is proposed to characterize the considered situation without reciprocal property.
Some results are derived from the new axioms involving the new concept of approximate consistency.
arXiv Detail & Related papers (2020-07-06T00:03:44Z) - Invariant Rationalization [84.1861516092232]
A typical rationalization criterion, i.e. maximum mutual information (MMI), finds the rationale that maximizes the prediction performance based only on the rationale.
We introduce a game-theoretic invariant rationalization criterion where the rationales are constrained to enable the same predictor to be optimal across different environments.
We show both theoretically and empirically that the proposed rationales can rule out spurious correlations, generalize better to different test scenarios, and align better with human judgments.
arXiv Detail & Related papers (2020-03-22T00:50:27Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.