Fugu-MT 論文翻訳(概要): Solving Random Parity Games in Polynomial Time

論文の概要: Solving Random Parity Games in Polynomial Time

arxiv url: http://arxiv.org/abs/2007.08387v1
Date: Thu, 16 Jul 2020 15:05:51 GMT
ステータス: 翻訳完了
システム内更新日: 2022-11-09 23:52:21.828426
Title: Solving Random Parity Games in Polynomial Time
Title（参考訳）: 多項式時間でランダムパリティゲームを解く
Authors: Richard Combes and Mikael Touati
Abstract要約: パリティゲームは高い確率で時間$O(cal V|)$で解けることを示す。また、非スパースゲームは高い確率で時間$O(cal V|)$で解けることを示し、$d=2$の場合の硬さに関する予想を出力する。
参考スコア（独自算出の注目度）: 4.715993310546794
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: We consider the problem of solving random parity games. We prove that parity games exibit a phase transition threshold above $d_P$, so that when the degree of the graph that defines the game has a degree $d > d_P$ then there exists a polynomial time algorithm that solves the game with high probability when the number of nodes goes to infinity. We further propose the SWCP (Self-Winning Cycles Propagation) algorithm and show that, when the degree is large enough, SWCP solves the game with high probability. Furthermore, the complexity of SWCP is polynomial $O\Big(|{\cal V}|^2 + |{\cal V}||{\cal E}|\Big)$. The design of SWCP is based on the threshold for the appearance of particular types of cycles in the players' respective subgraphs. We further show that non-sparse games can be solved in time $O(|{\cal V}|)$ with high probability, and emit a conjecture concerning the hardness of the $d=2$ case.
Abstract（参考訳）: ランダムなパリティゲームを解決する問題を考察する。我々は、パリティゲームが$d_p$以上の位相遷移しきい値を示すことを証明し、ゲームを定義するグラフの次数が $d > d_p$ であるとき、ノード数が無限大となると高い確率でゲームを解く多項式時間アルゴリズムが存在することを証明する。さらに,scp (self-winning cycles propagation) アルゴリズムを提案し,scpが十分に大きい場合には高い確率で解くことを示す。さらに、SWCP の複雑性は多項式 $O\Big(|{\cal V}|^2 + |{\cal V}||{\cal E}|\Big)$ である。 swcpの設計は、プレイヤーの各サブグラフにおける特定の種類のサイクルの出現のしきい値に基づいている。さらに、非スパースゲームは高い確率で時間$o(|{\cal v}|)$で解くことができ、$d=2$ ケースの硬さに関する推測を出力できることを示した。

関連論文リスト

Quantum strategies, error bounds, optimality, and duality gaps for multiplayer XOR, $\mathrm{XOR}^{*}$, compiled XOR, $\mathrm{XOR}^{*}$, and strong parallel repetiton of XOR, $\mathrm{XOR}^{*}$, and FFL games [0.0]
我々は、プレイヤーが量子戦略を用いて操作できるゲームの正確で近似的な最適性を特徴づける。我々は、量子優位性のための提案された情報源として、他の可能な戦略の変種を記述することで、この取り組みを締めくくる。
論文参考訳（メタデータ） (2025-05-09T03:47:41Z)
Instance-Dependent Regret Bounds for Learning Two-Player Zero-Sum Games with Bandit Feedback [60.610120215789976]
純粋な戦略ナッシュ均衡が存在するとき、$c$ は 0 となり、最適のインスタンス依存後悔境界となることを示す。また,本アルゴリズムは最終段階の収束性も享受し,ほぼ最適サンプルを用いて純粋な戦略ナッシュ均衡を同定することができる。
論文参考訳（メタデータ） (2025-02-24T20:20:06Z)
Measuring quantum relative entropy with finite-size effect [53.64687146666141]
相対エントロピー$D(rho|sigma)$を$sigma$が知られているときに推定する。我々の推定器は次元$d$が固定されたときにCram'er-Rao型境界に達する。
論文参考訳（メタデータ） (2024-06-25T06:07:20Z)
Fast Last-Iterate Convergence of Learning in Games Requires Forgetful Algorithms [71.73971094342349]
オンライン学習によるセルフプレイは、大規模な2人プレイのゼロサムゲームを解くための重要な方法の1つだ。我々は,OMWUが支払行列のサイズに対数依存するなど,いくつかの利点があることを示した。我々は、過去のことをすぐに忘れない幅広い種類のアルゴリズムが、すべて同じ問題に悩まされていることを証明している。
論文参考訳（メタデータ） (2024-06-15T13:26:17Z)
Nearly-Optimal Consensus Tolerating Adaptive Omissions: Why is a Lot of Randomness Needed? [4.465134753953128]
同期分散システムにおいて,通信リンクから障害当事者への通信がメッセージの送信を省略できる場合,$n$の自律的パーティによる合意に達するという問題について検討する。我々は、$O(sqrtnlog2 n)$ラウンドで動作し、$O(n2log3 n)$通信ビットを送信するランダム化アルゴリズムを設計する。 MCアルゴリズムが$O(R)$呼び出しを使用する場合、$Omega(fracn2maxR,nlog n)$ラウンドで動作しないことを示す。
論文参考訳（メタデータ） (2024-05-08T02:17:10Z)
Horizon-free Reinforcement Learning in Adversarial Linear Mixture MDPs [72.40181882916089]
我々のアルゴリズムが $tildeObig((d+log (|mathcalS|2 |mathcalA|))sqrtKbig)$ regret with full-information feedback, where $d$ is the dimension of a known feature mapping is linearly parametrizing the unknown transition kernel of the MDP, $K$ is the number of episodes, $|mathcalS|$ and $|mathcalA|$ is the standardities of the state and action space。
論文参考訳（メタデータ） (2023-05-15T05:37:32Z)
Predicting Winning Regions in Parity Games via Graph Neural Networks (Extended Abstract) [68.8204255655161]
グラフニューラルネットワークを用いてパリティゲームの勝利領域を決定するための不完全時間的アプローチを提案する。これは、データセットの60%の勝利領域を正しく決定し、残りの領域で小さなエラーしか発生しない。
論文参考訳（メタデータ） (2022-10-18T15:10:25Z)
Satisfiability Phase Transtion for Random Quantum 3XOR Games [0.0]
ランダムに生成された3XORゲームの動作を,多数の質問に対して数値的に検討する。我々の実験は、ランダムに生成されたゲームが擬テレパシーである確率が 1 から遠く離れていることを強く示唆している。また、ランダムに生成された3XORゲームが量子および古典的な「相転移」の両方を実行しているという強い証拠も見つかる。
論文参考訳（メタデータ） (2022-09-10T12:54:53Z)
Near-Optimal Learning of Extensive-Form Games with Imperfect Information [54.55092907312749]
本稿では,2プレイヤーゼロサムゲームにおいて,$widetildemathcalO((XA+YB)/varepsilon2)$プレイのエピソードのみを必要とするアルゴリズムの最初の行を,$varepsilon$-approximate Nash平衡を求める。これにより$widetildemathcalO((X2A+Y2B)/varepsilon2)$が$widetildemathcalO(maxX,
論文参考訳（メタデータ） (2022-02-03T18:18:28Z)
The Parallel Reversible Pebbling Game: Analyzing the Post-Quantum Security of iMHFs [6.305950347749109]
本稿では,量子コンピューティングにおけるNo-Deletion Theoremによる追加制約を捉える並列可逆ペブブリングゲームを提案する。我々は、いくつかの重要なグラフの可逆空間時間複雑性を解析するために、新しい可逆ペブブリングゲームを適用した。
論文参考訳（メタデータ） (2021-10-08T15:24:31Z)
When Can We Learn General-Sum Markov Games with a Large Number of Players Sample-Efficiently? [10.397170312149067]
本稿では,m$-player General-sum Markovゲームの設定において,学習目標がより優れたサンプル複雑性を許容するものについて検討する。まず,$epsilon$-Coarse Correlated Equilibrium (CCE)を$widetildemathcalO(H5Smax_ile m A_i / epsilon2)$ episodesで学習するアルゴリズムを設計する。第二に、マルコフポテンシャルゲームの重要な特別な場合を考え、$widet内で$epsilon$-approximate Nash平衡を学ぶアルゴリズムを設計する。
論文参考訳（メタデータ） (2021-10-08T15:06:22Z)
Near-Optimal Reinforcement Learning with Self-Play [50.29853537456737]
我々は,直接の監督なしに自己対決で最適な政策を学習するセルフプレイアルゴリズムに焦点をあてる。本稿では,サンプル複雑性を$tildemathcalO(SAB)$,サンプル複雑性を$tildemathcalO(S(A+B)$とする新しいemphNash Vラーニングアルゴリズムを提案する。
論文参考訳（メタデータ） (2020-06-22T05:00:13Z)
Quasi-polynomial time algorithms for free quantum games in bounded dimension [11.56707165033]
2プレイヤフリーゲームの値に対する加法$epsilon$-approximationsを計算するために、$exp(mathcalObig(T12(log2(AT)+log(Q)log(AT))/epsilon2big))という半定値プログラムを与える。量子分離性問題と接続し、線形制約を伴う改良された多部量子デ・フィネッティ定理を用いる。
論文参考訳（メタデータ） (2020-05-18T16:55:08Z)

関連論文リストは本サイト内にある論文のタイトル・アブストラクトから自動的に作成しています。

指定された論文の情報です。
本サイトの運営者は本サイト（すべての情報・翻訳含む）の品質を保証せず、本サイト（すべての情報・翻訳含む）を使用して発生したあらゆる結果について一切の責任を負いません。