Building a 3-Player Mahjong AI using Deep Reinforcement Learning
- URL: http://arxiv.org/abs/2202.12847v1
- Date: Fri, 25 Feb 2022 17:41:43 GMT
- Title: Building a 3-Player Mahjong AI using Deep Reinforcement Learning
- Authors: Xiangyu Zhao, Sean B. Holden
- Abstract summary: We present Meowjong, an AI for Sanma using deep reinforcement learning.
Meowjong's models achieve test accuracies comparable with AIs for 4-player Mahjong.
Being the first ever AI in Sanma, we claim that Meowjong stands as a state-of-the-art in this game.
- Score: 9.603486077267693
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Mahjong is a popular multi-player imperfect-information game developed in
China in the late 19th-century, with some very challenging features for AI
research. Sanma, being a 3-player variant of the Japanese Riichi Mahjong,
possesses unique characteristics including fewer tiles and, consequently, a
more aggressive playing style. It is thus challenging and of great research
interest in its own right, but has not yet been explored. In this paper, we
present Meowjong, an AI for Sanma using deep reinforcement learning. We define
an informative and compact 2-dimensional data structure for encoding the
observable information in a Sanma game. We pre-train 5 convolutional neural
networks (CNNs) for Sanma's 5 actions -- discard, Pon, Kan, Kita and Riichi,
and enhance the major action's model, namely the discard model, via self-play
reinforcement learning using the Monte Carlo policy gradient method. Meowjong's
models achieve test accuracies comparable with AIs for 4-player Mahjong through
supervised learning, and gain a significant further enhancement from
reinforcement learning. Being the first ever AI in Sanma, we claim that
Meowjong stands as a state-of-the-art in this game.
Related papers
- Mxplainer: Explain and Learn Insights by Imitating Mahjong Agents [0.8088999193162028]
This paper introduces Mxplainer, a parameterized search algorithm that can be converted into an equivalent neural network to learn the parameters of black-box agents.<n>Experiments conducted on AI and human player data demonstrate that the learned parameters provide human-understandable insights into these agents' characteristics and play styles.
arXiv Detail & Related papers (2025-06-17T07:07:13Z) - Reinforcement Learning for High-Level Strategic Control in Tower Defense Games [47.618236610219554]
In strategy games, one of the most important aspects of game design is maintaining a sense of challenge for players.
We propose an automated approach that combines traditional scripted methods with reinforcement learning.
Results show that combining a learned approach, such as reinforcement learning, with a scripted AI produces a higher-performing and more robust agent than using only AI.
arXiv Detail & Related papers (2024-06-12T08:06:31Z) - DanZero+: Dominating the GuanDan Game through Reinforcement Learning [95.90682269990705]
We develop an AI program for an exceptionally complex and popular card game called GuanDan.
We first put forward an AI program named DanZero for this game.
In order to further enhance the AI's capabilities, we apply policy-based reinforcement learning algorithm to GuanDan.
arXiv Detail & Related papers (2023-12-05T08:07:32Z) - DanZero: Mastering GuanDan Game with Reinforcement Learning [121.93690719186412]
Card game AI has always been a hot topic in the research of artificial intelligence.
In this paper, we are devoted to developing an AI program for a more complex card game, GuanDan.
We propose the first AI program DanZero for GuanDan using reinforcement learning technique.
arXiv Detail & Related papers (2022-10-31T06:29:08Z) - Mastering the Game of No-Press Diplomacy via Human-Regularized
Reinforcement Learning and Planning [95.78031053296513]
No-press Diplomacy is a complex strategy game involving both cooperation and competition.
We introduce a planning algorithm we call DiL-piKL that regularizes a reward-maximizing policy toward a human imitation-learned policy.
We show that DiL-piKL can be extended into a self-play reinforcement learning algorithm we call RL-DiL-piKL.
arXiv Detail & Related papers (2022-10-11T14:47:35Z) - DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning [65.00325925262948]
We propose a conceptually simple yet effective DouDizhu AI system, namely DouZero.
DouZero enhances traditional Monte-Carlo methods with deep neural networks, action encoding, and parallel actors.
It was ranked the first in the Botzone leaderboard among 344 AI agents.
arXiv Detail & Related papers (2021-06-11T02:45:51Z) - ScrofaZero: Mastering Trick-taking Poker Game Gongzhu by Deep
Reinforcement Learning [2.7178968279054936]
We study Gongzhu, a trick-taking game analogous to, but slightly simpler than contract bridge.
We train a strong Gongzhu AI ScrofaZero from textittabula rasa by deep reinforcement learning.
We introduce new techniques for imperfect information game including stratified sampling, importance weighting, integral over equivalent class, Bayesian inference, etc.
arXiv Detail & Related papers (2021-02-15T12:01:44Z) - Suphx: Mastering Mahjong with Deep Reinforcement Learning [114.68233321904623]
We design an AI for Mahjong, named Suphx, based on deep reinforcement learning with some newly introduced techniques.
Suphx has demonstrated stronger performance than most top human players in terms of stable rank.
This is the first time that a computer program outperforms most top human players in Mahjong.
arXiv Detail & Related papers (2020-03-30T16:18:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.