Deep Q-Network for AI Soccer
- URL: http://arxiv.org/abs/2209.09491v2
- Date: Wed, 21 Sep 2022 05:26:15 GMT
- Title: Deep Q-Network for AI Soccer
- Authors: Curie Kim, Yewon Hwang, and Jong-Hwan Kim
- Abstract summary: Deep Q-Network is designed to implement our original rewards, the state space, and the action space to train each agent.
Our algorithm was able to successfully train the agents, and its performance was preliminarily proven through the mini-competition.
With our algorithm, we got the achievement of advancing to the round of 16 in this international competition with 130 teams from 39 countries.
- Score: 6.417982603606359
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Reinforcement learning has shown an outstanding performance in the
applications of games, particularly in Atari games as well as Go. Based on
these successful examples, we attempt to apply one of the well-known
reinforcement learning algorithms, Deep Q-Network, to the AI Soccer game. AI
Soccer is a 5:5 robot soccer game where each participant develops an algorithm
that controls five robots in a team to defeat the opponent participant. Deep
Q-Network is designed to implement our original rewards, the state space, and
the action space to train each agent so that it can take proper actions in
different situations during the game. Our algorithm was able to successfully
train the agents, and its performance was preliminarily proven through the
mini-competition against 10 teams wishing to take part in the AI Soccer
international competition. The competition was organized by the AI World Cup
committee, in conjunction with the WCG 2019 Xi'an AI Masters. With our
algorithm, we got the achievement of advancing to the round of 16 in this
international competition with 130 teams from 39 countries.
Related papers
- Brilla AI: AI Contestant for the National Science and Maths Quiz [0.7329200485567825]
This work describes and evaluates the first key output for the NSMQ AI Grand Challenge.
It proposes a robust, real-world benchmark for such an AI: "Build an AI to compete live in Ghana's National Science and Maths Quiz (NSMQ) competition and win"
In its debut, our AI answered one of the 4 riddles ahead of the 3 human contesting teams, unofficially placing second (tied)
arXiv Detail & Related papers (2024-03-04T03:24:18Z) - DanZero+: Dominating the GuanDan Game through Reinforcement Learning [95.90682269990705]
We develop an AI program for an exceptionally complex and popular card game called GuanDan.
We first put forward an AI program named DanZero for this game.
In order to further enhance the AI's capabilities, we apply policy-based reinforcement learning algorithm to GuanDan.
arXiv Detail & Related papers (2023-12-05T08:07:32Z) - Teamwork under extreme uncertainty: AI for Pokemon ranks 33rd in the
world [0.0]
This paper describes the mechanics of the game and we perform a game analysis.
We propose unique AI algorithms based on our understanding that the two biggest challenges in the game are keeping a balanced team and dealing with three sources of uncertainty.
Our AI agent performed significantly better than all previous attempts and peaked at the 33rd place in the world, in one of the most popular battle formats, while running on only 4 single socket servers.
arXiv Detail & Related papers (2022-12-27T01:52:52Z) - DanZero: Mastering GuanDan Game with Reinforcement Learning [121.93690719186412]
Card game AI has always been a hot topic in the research of artificial intelligence.
In this paper, we are devoted to developing an AI program for a more complex card game, GuanDan.
We propose the first AI program DanZero for GuanDan using reinforcement learning technique.
arXiv Detail & Related papers (2022-10-31T06:29:08Z) - Retrospective on the 2021 BASALT Competition on Learning from Human
Feedback [92.37243979045817]
The goal of the competition was to promote research towards agents that use learning from human feedback (LfHF) techniques to solve open-world tasks.
Rather than mandating the use of LfHF techniques, we described four tasks in natural language to be accomplished in the video game Minecraft.
Teams developed a diverse range of LfHF algorithms across a variety of possible human feedback types.
arXiv Detail & Related papers (2022-04-14T17:24:54Z) - AI in Games: Techniques, Challenges and Opportunities [40.86375378643978]
Various game AI systems (AIs) have been developed such as Libratus, OpenAI Five and AlphaStar, beating professional human players.
In this paper, we survey recent successful game AIs, covering board game AIs, card game AIs, first-person shooting game AIs and real time strategy game AIs.
arXiv Detail & Related papers (2021-11-15T09:35:53Z) - Snakes AI Competition 2020 and 2021 Report [65.7695644335859]
The Snakes AI Competition was held by the Innopolis University.
It was part of the IEEE Conference on Games 2020 and 2021 editions.
It aimed to create a sandbox for learning and implementing artificial intelligence algorithms in agents.
arXiv Detail & Related papers (2021-08-11T10:27:11Z) - A Game AI Competition to foster Collaborative AI research and
development [5.682875185620577]
We present the Geometry Friends Game AI Competition.
The concept of the game is simple, though its solving has proven to be difficult.
We discuss the competition and the challenges it brings, and present an overview of the current solutions.
arXiv Detail & Related papers (2020-10-17T23:03:06Z) - TotalBotWar: A New Pseudo Real-time Multi-action Game Challenge and
Competition for AI [62.997667081978825]
TotalBotWar is a new pseudo real-time multi-action challenge for game AI.
The game is based on the popular TotalWar games series where players manage an army to defeat the opponent's one.
arXiv Detail & Related papers (2020-09-18T09:13:56Z) - Suphx: Mastering Mahjong with Deep Reinforcement Learning [114.68233321904623]
We design an AI for Mahjong, named Suphx, based on deep reinforcement learning with some newly introduced techniques.
Suphx has demonstrated stronger performance than most top human players in terms of stable rank.
This is the first time that a computer program outperforms most top human players in Mahjong.
arXiv Detail & Related papers (2020-03-30T16:18:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.