Fugu-MT 論文翻訳(概要): Boardwalk: Towards a Framework for Creating Board Games with LLMs

論文の概要: Boardwalk: Towards a Framework for Creating Board Games with LLMs

arxiv url: http://arxiv.org/abs/2508.16447v1
Date: Fri, 22 Aug 2025 15:02:07 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-25 16:42:36.424017
Title: Boardwalk: Towards a Framework for Creating Board Games with LLMs
Title（参考訳）: ボードウォーク: LLMでボードゲームを作るためのフレームワークを目指す
Authors: Álvaro Guglielmin Becker, Gabriel Bauer de Oliveira, Lana Bertoldo Rossato, Anderson Rocha Tavares,
Abstract要約: 我々は,自然言語で記述されたルールから,大規模言語モデルがボードゲームのデジタル版を実装できるかどうかを検討することを目的とする。我々は,ボードウォーク内およびボードウォーク内において,人気ゲーム12選をコーディングするために,最先端の3つのLSMを課題とする。我々のアプローチは、最高のパフォーマンスモデルであるClaude 3.7 Sonnetで、エラーなく55.6%のゲームが得られることを証明している。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Implementing board games in code can be a time-consuming task. However, Large Language Models (LLMs) have been proven effective at generating code for domain-specific tasks with simple contextual information. We aim to investigate whether LLMs can implement digital versions of board games from rules described in natural language. This would be a step towards an LLM-assisted framework for quick board game code generation. We expect to determine the main challenges for LLMs to implement the board games, and how different approaches and models compare to one another. We task three state-of-the-art LLMs (Claude, DeepSeek and ChatGPT) with coding a selection of 12 popular and obscure games in free-form and within Boardwalk, our proposed General Game Playing API. We anonymize the games and components to avoid evoking pre-trained LLM knowledge. The implementations are tested for playability and rule compliance. We evaluate success rate and common errors across LLMs and game popularity. Our approach proves viable, with the best performing model, Claude 3.7 Sonnet, yielding 55.6\% of games without any errors. While compliance with the API increases error frequency, the severity of errors is more significantly dependent on the LLM. We outline future steps for creating a framework to integrate this process, making the elaboration of board games more accessible.
Abstract（参考訳）: ボードゲームをコードで実装するのは時間を要する作業です。しかし、Large Language Models (LLM) は、シンプルな文脈情報を持つドメイン固有のタスクのコードを生成するのに有効であることが証明されている。我々は,LLMが自然言語で記述されたルールから,ボードゲームのデジタル版を実装できるかどうかを検討することを目的とする。これは、高速ボードゲームコード生成のためのLCM支援フレームワークへのステップとなるだろう。 LLMがボードゲームを実装する上での主な課題と、異なるアプローチとモデルが相互にどのように比較されるかを決定することを期待しています。提案するGeneral Game Playing APIであるClaude,DeepSeek,ChatGPTの3つの最先端 LLM を,ボードウォークおよびボードウォークの12種類の人気ゲームの選択をコーディングする。ゲームやコンポーネントを匿名化して,事前学習したLLM知識を回避します。実装はプレイ容易性とルールコンプライアンスのためにテストされる。 LLMにおける成功率と共通誤差とゲーム人気を評価する。我々のアプローチは、最高のパフォーマンスモデルであるClaude 3.7 Sonnetで、エラーなく55.6\%のゲームが得られることを証明している。 APIへの準拠はエラー頻度を増大させるが、エラーの深刻度はLLMに大きく依存する。このプロセスを統合するためのフレームワークを作成するための今後のステップについて概説する。

論文の概要: Boardwalk: Towards a Framework for Creating Board Games with LLMs

関連論文リスト