Human strategic decision making in parametrized games
- URL: http://arxiv.org/abs/2104.14744v1
- Date: Fri, 30 Apr 2021 03:40:27 GMT
- Title: Human strategic decision making in parametrized games
- Authors: Sam Ganzfried
- Abstract summary: We present a new framework that enables human decision makers to make fast decisions without the aid of real-time solvers.
We demonstrate applicability to a variety of situations including settings with multiple players and imperfect information.
- Score: 4.264192013842095
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Many real-world games contain parameters which can affect payoffs, action
spaces, and information states. For fixed values of the parameters, the game
can be solved using standard algorithms. However, in many settings agents must
act without knowing the values of the parameters that will be encountered in
advance. Often the decisions must be made by a human under time and resource
constraints, and it is unrealistic to assume that a human can solve the game in
real time. We present a new framework that enables human decision makers to
make fast decisions without the aid of real-time solvers. We demonstrate
applicability to a variety of situations including settings with multiple
players and imperfect information.
Related papers
- People use fast, goal-directed simulation to reason about novel games [75.25089384921557]
We study how people reason about a range of simple but novel connect-n style board games.
We ask people to judge how fair and how fun the games are from very little experience.
arXiv Detail & Related papers (2024-07-19T07:59:04Z) - Imperfect-Recall Games: Equilibrium Concepts and Their Complexity [74.01381499760288]
We investigate optimal decision making under imperfect recall, that is, when an agent forgets information it once held before.
In the framework of extensive-form games with imperfect recall, we analyze the computational complexities of finding equilibria in multiplayer settings.
arXiv Detail & Related papers (2024-06-23T00:27:28Z) - Auto-Encoding Bayesian Inverse Games [36.06617326128679]
We consider the inverse game problem, in which some properties of the game are unknown a priori.
Existing maximum likelihood estimation approaches to solve inverse games provide only point estimates of unknown parameters.
We take a Bayesian perspective and construct posterior distributions of game parameters.
This structured VAE can be trained from an unlabeled dataset of observed interactions.
arXiv Detail & Related papers (2024-02-14T02:17:37Z) - Modeling Player Personality Factors from In-Game Behavior and Affective
Expression [17.01727448431269]
We explore possibilities to predict a series of player personality questionnaire metrics from recorded in-game behavior.
We predict a wide variety of personality metrics from seven established questionnaires across 62 players over 60 minute gameplay of a customized version of the role-playing game Fallout: New Vegas.
arXiv Detail & Related papers (2023-08-27T22:59:08Z) - Value-based Fast and Slow AI Nudging [37.53694593692918]
Nudging is a behavioral strategy aimed at influencing people's thoughts and actions.
In this paper, we propose and discuss a value-based AI-human collaborative framework where AI systems nudge humans by proposing decision recommendations.
arXiv Detail & Related papers (2023-07-14T20:57:27Z) - The Update-Equivalence Framework for Decision-Time Planning [78.44953498421854]
We introduce an alternative framework for decision-time planning that is not based on solving subgames, but rather on update equivalence.
We derive a provably sound search algorithm for fully cooperative games based on mirror descent and a search algorithm for adversarial games based on magnetic mirror descent.
arXiv Detail & Related papers (2023-04-25T20:28:55Z) - Decision-aid or Controller? Steering Human Decision Makers with
Algorithms [5.449173263947196]
We study a decision-aid algorithm that learns about the human decision maker and provides ''personalized recommendations'' to influence final decisions.
We discuss the potential applications of such algorithms and their social implications.
arXiv Detail & Related papers (2023-03-23T23:24:26Z) - Hardness of Independent Learning and Sparse Equilibrium Computation in
Markov Games [70.19141208203227]
We consider the problem of decentralized multi-agent reinforcement learning in Markov games.
We show that no algorithm attains no-regret in general-sum games when executed independently by all players.
We show that our lower bounds hold even for seemingly easier setting in which all agents are controlled by a centralized algorithm.
arXiv Detail & Related papers (2023-03-22T03:28:12Z) - Abstracting Imperfect Information Away from Two-Player Zero-Sum Games [85.27865680662973]
Nayyar et al. (2013) showed that imperfect information can be abstracted away from common-payoff games by having players publicly announce their policies as they play.
This work shows that certain regularized equilibria do not possess the aforementioned non-correspondence problem.
Because these regularized equilibria can be made arbitrarily close to Nash equilibria, our result opens the door to a new perspective to solving two-player zero-sum games.
arXiv Detail & Related papers (2023-01-22T16:54:06Z) - Model-Free Online Learning in Unknown Sequential Decision Making
Problems and Games [114.90723492840499]
In large two-player zero-sum imperfect-information games, modern extensions of counterfactual regret minimization (CFR) are currently the practical state of the art for computing a Nash equilibrium.
We formalize an online learning setting in which the strategy space is not known to the agent.
We give an efficient algorithm that achieves $O(T3/4)$ regret with high probability for that setting, even when the agent faces an adversarial environment.
arXiv Detail & Related papers (2021-03-08T04:03:24Z) - A Bandit Model for Human-Machine Decision Making with Private
Information and Opacity [16.665883787432858]
We show a two-player learning problem where one player is the machine and the other the human.
A lower bound quantifies the worst-case hardness of optimally advising a decision maker who is opaque.
An upper bound shows that a simple coordination strategy is nearly minimax optimal.
arXiv Detail & Related papers (2020-07-09T13:43:08Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.