Related papers: Word Play for Playing Othello (Reverses)

Word Play for Playing Othello (Reverses)

URL: http://arxiv.org/abs/2207.08766v1
Date: Mon, 18 Jul 2022 17:13:32 GMT
Title: Word Play for Playing Othello (Reverses)
Authors: Samantha E. Miller Noever, David Noever
Abstract summary: Research applies both the larger (GPT-3) and smaller (GPT-2) language models to explore the complex strategies for the game of Othello (or Reverses) The language model automatically captures or emulates championship-level strategies. The fine-tuned GPT-2 model generates Othello games ranging from 13-71% completion, while the larger GPT-3 model reaches 41% of a complete game.
Score: 0.0
License: http://creativecommons.org/licenses/by-sa/4.0/
Abstract: Language models like OpenAI's Generative Pre-Trained Transformers (GPT-2/3) capture the long-term correlations needed to generate text in a variety of domains (such as language translators) and recently in gameplay (chess, Go, and checkers). The present research applies both the larger (GPT-3) and smaller (GPT-2) language models to explore the complex strategies for the game of Othello (or Reverses). Given the game rules for rapid reversals of fortune, the language model not only represents a candidate predictor of the next move based on previous game moves but also avoids sparse rewards in gameplay. The language model automatically captures or emulates championship-level strategies. The fine-tuned GPT-2 model generates Othello games ranging from 13-71% completion, while the larger GPT-3 model reaches 41% of a complete game. Like previous work with chess and Go, these language models offer a novel way to generate plausible game archives, particularly for comparing opening moves across a larger sample than humanly possible to explore. A primary contribution of these models magnifies (by two-fold) the previous record for player archives (120,000 human games over 45 years from 1977-2022), thus supplying the research community with more diverse and original strategies for sampling with other reinforcement learning techniques.

Related papers

Revisiting the Othello World Model Hypothesis [46.84113324750507]
We analyze Othello board states and train a model to predict the next move based on previous moves. We find that all models achieve up to 99% accuracy in unsupervised grounding and exhibit high similarity in the board features they learned.
arXiv Detail & Related papers (2025-03-06T13:26:58Z)
Strategic Insights in Human and Large Language Model Tactics at Word Guessing Games [0.0]
At the beginning of 2022, a simplistic word-guessing game took the world by storm. We examine the strategies of daily word-guessing game players that have evolved during a period of over two years.
arXiv Detail & Related papers (2024-09-17T12:06:05Z)
Instruction-Driven Game Engines on Large Language Models [59.280666591243154]
The IDGE project aims to democratize game development by enabling a large language model to follow free-form game rules. We train the IDGE in a curriculum manner that progressively increases the model's exposure to complex scenarios. Our initial progress lies in developing an IDGE for Poker, a universally cherished card game.
arXiv Detail & Related papers (2024-03-30T08:02:16Z)
Retrieval is Accurate Generation [99.24267226311157]
We introduce a novel method that selects context-aware phrases from a collection of supporting documents. Our model achieves the best performance and the lowest latency among several retrieval-augmented baselines.
arXiv Detail & Related papers (2024-02-27T14:16:19Z)
Promptable Game Models: Text-Guided Game Simulation via Masked Diffusion Models [68.85478477006178]
We present a Promptable Game Model (PGM) for neural video game simulators. It allows a user to play the game by prompting it with high- and low-level action sequences. Most captivatingly, our PGM unlocks the director's mode, where the game is played by specifying goals for the agents in the form of a prompt. Our method significantly outperforms existing neural video game simulators in terms of rendering quality and unlocks applications beyond the capabilities of the current state of the art.
arXiv Detail & Related papers (2023-03-23T17:43:17Z)
Robust Preference Learning for Storytelling via Contrastive Reinforcement Learning [53.92465205531759]
Controlled automated story generation seeks to generate natural language stories satisfying constraints from natural language critiques or preferences. We train a contrastive bi-encoder model to align stories with human critiques, building a general purpose preference model. We further fine-tune the contrastive reward model using a prompt-learning technique to increase story generation robustness.
arXiv Detail & Related papers (2022-10-14T13:21:33Z)
Bidirectional Language Models Are Also Few-shot Learners [54.37445173284831]
We present SAP (Sequential Autoregressive Prompting), a technique that enables the prompting of bidirectional models. We show SAP is effective on question answering and summarization. For the first time, our results demonstrate prompt-based learning is an emergent property of a broader class of language models.
arXiv Detail & Related papers (2022-09-29T01:35:57Z)
Keep CALM and Explore: Language Models for Action Generation in Text-based Games [27.00685301984832]
We propose the Contextual Action Language Model (CALM) to generate a compact set of action candidates at each game state. We combine CALM with a reinforcement learning agent which re-ranks the generated action candidates to maximize in-game rewards.
arXiv Detail & Related papers (2020-10-06T17:36:29Z)
Navigating Human Language Models with Synthetic Agents [7.99536002595393]
We train a version of the GPT-2 on a corpora of historical chess games, and then "launch" clusters of synthetic agents into the model. We find that the percentages of moves by piece using the model are substantially similar from human patterns.
arXiv Detail & Related papers (2020-08-10T14:39:53Z)
The Chess Transformer: Mastering Play using Generative Language Models [0.0]
This work demonstrates that natural language transformers can support more generic strategic modeling. In addition to learning natural language skills, the abstract transformer architecture can generate meaningful moves on a chessboard. We anticipate future work will build on this transformer's promise, particularly in other strategy games.
arXiv Detail & Related papers (2020-08-02T18:04:36Z)
The Go Transformer: Natural Language Modeling for Game Play [0.0]
This work applies natural language modeling to generate plausible strategic moves in the ancient game of Go. We train the Generative Pretrained Transformer (GPT-2) to mimic the style of Go champions as archived in Smart Game Format. The trained model further generates valid but previously unseen strategies for Go.
arXiv Detail & Related papers (2020-07-07T14:37:27Z)
Optimus: Organizing Sentences via Pre-trained Modeling of a Latent Space [109.79957125584252]
Variational Autoencoder (VAE) can be both a powerful generative model and an effective representation learning framework for natural language. In this paper, we propose the first large-scale language VAE model, Optimus.
arXiv Detail & Related papers (2020-04-05T06:20:18Z)

This list is automatically generated from the titles and abstracts of the papers in this site.