Fugu-MT 論文翻訳(概要): No-Regret Learning Dynamics for Extensive-Form Correlated Equilibrium

論文の概要: No-Regret Learning Dynamics for Extensive-Form Correlated Equilibrium

arxiv url: http://arxiv.org/abs/2004.00603v5
Date: Fri, 2 Sep 2022 16:09:00 GMT
ステータス: 翻訳完了
システム内更新日: 2022-12-17 18:20:24.747452
Title: No-Regret Learning Dynamics for Extensive-Form Correlated Equilibrium
Title（参考訳）: 集中型相関平衡の非回帰学習ダイナミクス
Authors: Andrea Celli, Alberto Marchesi, Gabriele Farina, Nicola Gatti
Abstract要約: 正規形式ゲームにおいて、相関平衡に収束する最初の非共役な非共役ダイナミクスを与える。広義のゲームではトリガー後悔の概念を導入し、通常のゲームでは内部の後悔が延長される。提案アルゴリズムは,各決定点における局所的なサブプロブレムにトリガを分解し,局所解からプレイヤーのグローバルな戦略を構築する。
参考スコア（独自算出の注目度）: 76.78447814623665
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: The existence of simple, uncoupled no-regret dynamics that converge to correlated equilibria in normal-form games is a celebrated result in the theory of multi-agent systems. Specifically, it has been known for more than 20 years that when all players seek to minimize their internal regret in a repeated normal-form game, the empirical frequency of play converges to a normal-form correlated equilibrium. Extensive-form (that is, tree-form) games generalize normal-form games by modeling both sequential and simultaneous moves, as well as private information. Because of the sequential nature and presence of partial information in the game, extensive-form correlation has significantly different properties than the normal-form counterpart, many of which are still open research directions. Extensive-form correlated equilibrium (EFCE) has been proposed as the natural extensive-form counterpart to normal-form correlated equilibrium. However, it was currently unknown whether EFCE emerges as the result of uncoupled agent dynamics. In this paper, we give the first uncoupled no-regret dynamics that converge to the set of EFCEs in $n$-player general-sum extensive-form games with perfect recall. First, we introduce a notion of trigger regret in extensive-form games, which extends that of internal regret in normal-form games. When each player has low trigger regret, the empirical frequency of play is close to an EFCE. Then, we give an efficient no-trigger-regret algorithm. Our algorithm decomposes trigger regret into local subproblems at each decision point for the player, and constructs a global strategy of the player from the local solutions at each decision point.
Abstract（参考訳）: 正規形ゲームにおける相関平衡に収束する単純で非結合な非回帰力学の存在は、マルチエージェント系の理論における有名な結果である。特に20年以上にわたって、全てのプレイヤーが通常のゲームで内的後悔を最小化しようとすると、経験的なプレイ頻度が正規形相関均衡に収束することが知られている。拡張形式のゲーム(すなわち木型ゲーム)は、シーケンシャルと同時の動作とプライベート情報の両方をモデル化することで、正規形式のゲームを一般化する。ゲーム内での逐次的な性質と部分的な情報の存在のため、広角形相関は通常の形式とは大きく異なる性質を持ち、その多くはまだオープンな研究方向である。正規形相関平衡とは自然に拡張型相関平衡 (efce) が提唱されている。しかし、EFCEが未結合のエージェントダイナミクスの結果現れるかどうかは現在不明である。本稿では,$n$$-player general-sum extensive-form game with perfect recallにおいて,EFCEの集合に収束する最初の未結合な非線形ダイナミクスについて述べる。まず、広義のゲームにおいてトリガー後悔の概念を導入し、通常のゲームにおける内部後悔の概念を拡張した。各プレイヤーのトリガー残差が低い場合、経験的なプレイ頻度はEFCEに近い。次に,効率的なノトリガー・レグレットアルゴリズムを提案する。提案アルゴリズムは,各決定点における局所的なサブプロブレムにトリガを分解し,各決定点における局所的な解からプレイヤーのグローバルな戦略を構築する。

論文の概要: No-Regret Learning Dynamics for Extensive-Form Correlated Equilibrium

関連論文リスト