Distributed No-Regret Learning in Multi-Agent Systems
- URL: http://arxiv.org/abs/2002.09047v1
- Date: Thu, 20 Feb 2020 22:30:17 GMT
- Title: Distributed No-Regret Learning in Multi-Agent Systems
- Authors: Xiao Xu, Qing Zhao
- Abstract summary: Four emerging game characteristics challenge canonical game models are explored.
For each of the four characteristics, we illuminate its implications and ramifications in game modeling, notions of regret, feasible game outcomes, and the design and analysis of distributed learning algorithms.
- Score: 12.111429383532888
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this tutorial article, we give an overview of new challenges and
representative results on distributed no-regret learning in multi-agent systems
modeled as repeated unknown games. Four emerging game
characteristics---dynamicity, incomplete and imperfect feedback, bounded
rationality, and heterogeneity---that challenge canonical game models are
explored. For each of the four characteristics, we illuminate its implications
and ramifications in game modeling, notions of regret, feasible game outcomes,
and the design and analysis of distributed learning algorithms.
Related papers
- Reinforcement Learning, Collusion, and the Folk Theorem [0.0]
We explore the behaviour emerging from learning agents repeatedly interacting strategically for a wide range of learning dynamics.
We consider the setting of a general repeated game with finite recall, for different forms of monitoring.
We obtain a Folk Theorem-like result and characterise the set of payoff vectors that can be obtained by these dynamics.
arXiv Detail & Related papers (2024-11-19T18:45:55Z) - Polymath: A Challenging Multi-modal Mathematical Reasoning Benchmark [53.61633384281524]
PolyMATH is a benchmark aimed at evaluating the general cognitive reasoning abilities of MLLMs.
The best scores achieved on PolyMATH are 41%, 36%, and 27%, obtained by Claude-3.5 Sonnet, GPT-4o and Gemini-1.5 Pro respectively.
A further fine-grained error analysis reveals that these models struggle to understand spatial relations and perform drawn-out, high-level reasoning.
arXiv Detail & Related papers (2024-10-06T20:35:41Z) - Tractable Equilibrium Computation in Markov Games through Risk Aversion [12.980882140751895]
We show that risk-averse quantal response equilibria (RQE) become tractable to compute in all $n$-player matrix and finite-horizon Markov games.
RQE is independent of the underlying game structure and only depends on agents' degree of risk-aversion and bounded rationality.
arXiv Detail & Related papers (2024-06-20T09:53:56Z) - CNN-based explanation ensembling for dataset, representation and explanations evaluation [1.1060425537315088]
We explore the potential of ensembling explanations generated by deep classification models using convolutional model.
Through experimentation and analysis, we aim to investigate the implications of combining explanations to uncover a more coherent and reliable patterns of the model's behavior.
arXiv Detail & Related papers (2024-04-16T08:39:29Z) - Unified View of Grokking, Double Descent and Emergent Abilities: A
Perspective from Circuits Competition [83.13280812128411]
Recent studies have uncovered intriguing phenomena in deep learning, such as grokking, double descent, and emergent abilities in large language models.
We present a comprehensive framework that provides a unified view of these three phenomena, focusing on the competition between memorization and generalization circuits.
arXiv Detail & Related papers (2024-02-23T08:14:36Z) - Stochastic Delay Differential Games: Financial Modeling and Machine
Learning Algorithms [3.222802562733787]
We propose a numerical methodology for finding the closed-loop Nash equilibrium of delay differential games through deep learning.
These games are prevalent in finance and economics where multi-agent interaction and delayed effects are often desired features in a model.
arXiv Detail & Related papers (2023-07-12T21:02:45Z) - Accelerating exploration and representation learning with offline
pre-training [52.6912479800592]
We show that exploration and representation learning can be improved by separately learning two different models from a single offline dataset.
We show that learning a state representation using noise-contrastive estimation and a model of auxiliary reward can significantly improve the sample efficiency on the challenging NetHack benchmark.
arXiv Detail & Related papers (2023-03-31T18:03:30Z) - On the Convergence of No-Regret Learning Dynamics in Time-Varying Games [89.96815099996132]
We characterize the convergence of optimistic gradient descent (OGD) in time-varying games.
Our framework yields sharp convergence bounds for the equilibrium gap of OGD in zero-sum games.
We also provide new insights on dynamic regret guarantees in static games.
arXiv Detail & Related papers (2023-01-26T17:25:45Z) - MultiViz: An Analysis Benchmark for Visualizing and Understanding
Multimodal Models [103.9987158554515]
MultiViz is a method for analyzing the behavior of multimodal models by scaffolding the problem of interpretability into 4 stages.
We show that the complementary stages in MultiViz together enable users to simulate model predictions, assign interpretable concepts to features, perform error analysis on model misclassifications, and use insights from error analysis to debug models.
arXiv Detail & Related papers (2022-06-30T18:42:06Z) - Exploration-Exploitation in Multi-Agent Competition: Convergence with
Bounded Rationality [21.94743452608215]
We study smooth Q-learning, a prototypical learning model that captures the balance between game rewards and exploration costs.
We show that Q-learning always converges to the unique quantal-response equilibrium (QRE), the standard solution concept for games under bounded rationality.
arXiv Detail & Related papers (2021-06-24T11:43:38Z) - Explainable Recommender Systems via Resolving Learning Representations [57.24565012731325]
Explanations could help improve user experience and discover system defects.
We propose a novel explainable recommendation model through improving the transparency of the representation learning process.
arXiv Detail & Related papers (2020-08-21T05:30:48Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.