Efficient Reasoning in Regular Boardgames
- URL: http://arxiv.org/abs/2006.08295v1
- Date: Mon, 15 Jun 2020 11:42:08 GMT
- Title: Efficient Reasoning in Regular Boardgames
- Authors: Jakub Kowalski, Rados{\l}aw Miernik, Maksymilian Mika, Wojciech
Pawlik, Jakub Sutowicz, Marek Szyku{\l}a, Andrzej Tkaczyk
- Abstract summary: We present the technical side of reasoning in Regular Boardgames (RBG) language.
RBG serves as a research tool that aims to aid in the development of generalized algorithms for knowledge inference, analysis, generation, learning, and playing games.
- Score: 2.909363382704072
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We present the technical side of reasoning in Regular Boardgames (RBG)
language -- a universal General Game Playing (GGP) formalism for the class of
finite deterministic games with perfect information, encoding rules in the form
of regular expressions. RBG serves as a research tool that aims to aid in the
development of generalized algorithms for knowledge inference, analysis,
generation, learning, and playing games. In all these tasks, both generality
and efficiency are important.
In the first part, this paper describes optimizations used by the RBG
compiler. The impact of these optimizations ranges from 1.7 to even 33-fold
efficiency improvement when measuring the number of possible game playouts per
second. Then, we perform an in-depth efficiency comparison with three other
modern GGP systems (GDL, Ludii, Ai Ai). We also include our own highly
optimized game-specific reasoners to provide a point of reference of the
maximum speed. Our experiments show that RBG is currently the fastest among the
abstract general game playing languages, and its efficiency can be competitive
to common interface-based systems that rely on handcrafted game-specific
implementations. Finally, we discuss some issues and methodology of computing
benchmarks like this.
Related papers
- Fast and Knowledge-Free Deep Learning for General Game Playing (Student
Abstract) [1.9750759888062657]
We develop a method of adapting the AlphaZero model to General Game Playing (GGP)
The dataset generation uses MCTS playing instead of self-play; only the value network is used, and attention layers replace the convolutional ones.
arXiv Detail & Related papers (2023-12-21T18:44:19Z) - SPRING: Studying the Paper and Reasoning to Play Games [102.5587155284795]
We propose a novel approach, SPRING, to read the game's original academic paper and use the knowledge learned to reason and play the game through a large language model (LLM)
In experiments, we study the quality of in-context "reasoning" induced by different forms of prompts under the setting of the Crafter open-world environment.
Our experiments suggest that LLMs, when prompted with consistent chain-of-thought, have great potential in completing sophisticated high-level trajectories.
arXiv Detail & Related papers (2023-05-24T18:14:35Z) - The Update-Equivalence Framework for Decision-Time Planning [78.44953498421854]
We introduce an alternative framework for decision-time planning that is not based on solving subgames, but rather on update equivalence.
We derive a provably sound search algorithm for fully cooperative games based on mirror descent and a search algorithm for adversarial games based on magnetic mirror descent.
arXiv Detail & Related papers (2023-04-25T20:28:55Z) - Kernelized Multiplicative Weights for 0/1-Polyhedral Games: Bridging the
Gap Between Learning in Extensive-Form and Normal-Form Games [76.21916750766277]
We show that the Optimistic Multiplicative Weights Update (OMWU) algorithm can be simulated on the normal-form equivalent of an EFG in linear time per iteration in the game tree size using a kernel trick.
Specifically, KOMWU gives the first algorithm that guarantees at the same time last-iterate convergence.
arXiv Detail & Related papers (2022-02-01T06:28:51Z) - No-Regret Learning in Time-Varying Zero-Sum Games [99.86860277006318]
Learning from repeated play in a fixed zero-sum game is a classic problem in game theory and online learning.
We develop a single parameter-free algorithm that simultaneously enjoys favorable guarantees under three performance measures.
Our algorithm is based on a two-layer structure with a meta-algorithm learning over a group of black-box base-learners satisfying a certain property.
arXiv Detail & Related papers (2022-01-30T06:10:04Z) - Spatial State-Action Features for General Games [5.849736173068868]
We formulate a design and efficient implementation of spatial state-action features for general games.
These are patterns that can be trained to incentivise or disincentivise actions based on whether or not they match variables of the state in a local area.
We propose an efficient approach for evaluating active features for any given set of features.
arXiv Detail & Related papers (2022-01-17T13:34:04Z) - Revisiting Game Representations: The Hidden Costs of Efficiency in
Sequential Decision-making Algorithms [0.6749750044497732]
Recent advancements in algorithms for sequential decision-making under imperfect information have shown remarkable success in large games.
These algorithms traditionally formalize the games using the extensive-form game formalism.
We show that a popular workaround involves using a specialized representation based on player specific information-state trees.
arXiv Detail & Related papers (2021-12-20T22:34:19Z) - Optimised Playout Implementations for the Ludii General Game System [8.344476599818828]
The Ludii general game system can automatically infer, based on a game's description in its general game description language, whether any optimised implementations are applicable.
An empirical evaluation demonstrates major speedups over a standard implementation, with a median result of running playouts 5.08 times as fast, over 145 different games in Ludii.
arXiv Detail & Related papers (2021-11-04T12:59:53Z) - Deep Reinforcement Learning with Stacked Hierarchical Attention for
Text-based Games [64.11746320061965]
We study reinforcement learning for text-based games, which are interactive simulations in the context of natural language.
We aim to conduct explicit reasoning with knowledge graphs for decision making, so that the actions of an agent are generated and supported by an interpretable inference procedure.
We extensively evaluate our method on a number of man-made benchmark games, and the experimental results demonstrate that our method performs better than existing text-based agents.
arXiv Detail & Related papers (2020-10-22T12:40:22Z) - Learning Dynamic Belief Graphs to Generalize on Text-Based Games [55.59741414135887]
Playing text-based games requires skills in processing natural language and sequential decision making.
In this work, we investigate how an agent can plan and generalize in text-based games using graph-structured representations learned end-to-end from raw text.
arXiv Detail & Related papers (2020-02-21T04:38:37Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.