Suphx: Mastering Mahjong with Deep Reinforcement Learning
- URL: http://arxiv.org/abs/2003.13590v2
- Date: Wed, 1 Apr 2020 03:46:55 GMT
- Title: Suphx: Mastering Mahjong with Deep Reinforcement Learning
- Authors: Junjie Li, Sotetsu Koyamada, Qiwei Ye, Guoqing Liu, Chao Wang, Ruihan
Yang, Li Zhao, Tao Qin, Tie-Yan Liu, Hsiao-Wuen Hon
- Abstract summary: We design an AI for Mahjong, named Suphx, based on deep reinforcement learning with some newly introduced techniques.
Suphx has demonstrated stronger performance than most top human players in terms of stable rank.
This is the first time that a computer program outperforms most top human players in Mahjong.
- Score: 114.68233321904623
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Artificial Intelligence (AI) has achieved great success in many domains, and
game AI is widely regarded as its beachhead since the dawn of AI. In recent
years, studies on game AI have gradually evolved from relatively simple
environments (e.g., perfect-information games such as Go, chess, shogi or
two-player imperfect-information games such as heads-up Texas hold'em) to more
complex ones (e.g., multi-player imperfect-information games such as
multi-player Texas hold'em and StartCraft II). Mahjong is a popular
multi-player imperfect-information game worldwide but very challenging for AI
research due to its complex playing/scoring rules and rich hidden information.
We design an AI for Mahjong, named Suphx, based on deep reinforcement learning
with some newly introduced techniques including global reward prediction,
oracle guiding, and run-time policy adaptation. Suphx has demonstrated stronger
performance than most top human players in terms of stable rank and is rated
above 99.99% of all the officially ranked human players in the Tenhou platform.
This is the first time that a computer program outperforms most top human
players in Mahjong.
Related papers
- DanZero+: Dominating the GuanDan Game through Reinforcement Learning [95.90682269990705]
We develop an AI program for an exceptionally complex and popular card game called GuanDan.
We first put forward an AI program named DanZero for this game.
In order to further enhance the AI's capabilities, we apply policy-based reinforcement learning algorithm to GuanDan.
arXiv Detail & Related papers (2023-12-05T08:07:32Z) - DanZero: Mastering GuanDan Game with Reinforcement Learning [121.93690719186412]
Card game AI has always been a hot topic in the research of artificial intelligence.
In this paper, we are devoted to developing an AI program for a more complex card game, GuanDan.
We propose the first AI program DanZero for GuanDan using reinforcement learning technique.
arXiv Detail & Related papers (2022-10-31T06:29:08Z) - Mastering the Game of No-Press Diplomacy via Human-Regularized
Reinforcement Learning and Planning [95.78031053296513]
No-press Diplomacy is a complex strategy game involving both cooperation and competition.
We introduce a planning algorithm we call DiL-piKL that regularizes a reward-maximizing policy toward a human imitation-learned policy.
We show that DiL-piKL can be extended into a self-play reinforcement learning algorithm we call RL-DiL-piKL.
arXiv Detail & Related papers (2022-10-11T14:47:35Z) - Mastering the Game of Stratego with Model-Free Multiagent Reinforcement
Learning [86.37438204416435]
Stratego is one of the few iconic board games that Artificial Intelligence (AI) has not yet mastered.
Decisions in Stratego are made over a large number of discrete actions with no obvious link between action and outcome.
DeepNash beats existing state-of-the-art AI methods in Stratego and achieved a yearly (2022) and all-time top-3 rank on the Gravon games platform.
arXiv Detail & Related papers (2022-06-30T15:53:19Z) - AI in Games: Techniques, Challenges and Opportunities [40.86375378643978]
Various game AI systems (AIs) have been developed such as Libratus, OpenAI Five and AlphaStar, beating professional human players.
In this paper, we survey recent successful game AIs, covering board game AIs, card game AIs, first-person shooting game AIs and real time strategy game AIs.
arXiv Detail & Related papers (2021-11-15T09:35:53Z) - A User-Centred Framework for Explainable Artificial Intelligence in
Human-Robot Interaction [70.11080854486953]
We propose a user-centred framework for XAI that focuses on its social-interactive aspect.
The framework aims to provide a structure for interactive XAI solutions thought for non-expert users.
arXiv Detail & Related papers (2021-09-27T09:56:23Z) - ScrofaZero: Mastering Trick-taking Poker Game Gongzhu by Deep
Reinforcement Learning [2.7178968279054936]
We study Gongzhu, a trick-taking game analogous to, but slightly simpler than contract bridge.
We train a strong Gongzhu AI ScrofaZero from textittabula rasa by deep reinforcement learning.
We introduce new techniques for imperfect information game including stratified sampling, importance weighting, integral over equivalent class, Bayesian inference, etc.
arXiv Detail & Related papers (2021-02-15T12:01:44Z) - Towards Playing Full MOBA Games with Deep Reinforcement Learning [34.153341961273554]
MOBA games, e.g., Honor of Kings, League of Legends, and Dota 2, pose grand challenges to AI systems.
We propose a MOBA AI learning paradigm that methodologically enables playing full MOBA games with deep reinforcement learning.
arXiv Detail & Related papers (2020-11-25T12:52:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.