Fugu-MT 論文翻訳(概要): UrzaGPT: LoRA-Tuned Large Language Models for Card Selection in Collectible Card Games

論文の概要: UrzaGPT: LoRA-Tuned Large Language Models for Card Selection in Collectible Card Games

arxiv url: http://arxiv.org/abs/2508.08382v1
Date: Mon, 11 Aug 2025 18:09:15 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-13 21:07:34.203228
Title: UrzaGPT: LoRA-Tuned Large Language Models for Card Selection in Collectible Card Games
Title（参考訳）: UrzaGPT: カードゲームにおけるカード選択のための LoRA-Tuned Large Language Models
Authors: Timo Bertram,
Abstract要約: 我々は、リアルタイムのドラフト決定を推奨するドメイン適応の大規模言語モデルである$textitUrzaGPT$を紹介します。ゼロショット LLM や最先端のドメイン固有モデルと比較して $textitUrzaGPT$ をベンチマークする。 UrzaGPTを使って小さなモデルを微調整し、わずか10,000ステップで6.2%の精度を得る。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Collectible card games (CCGs) are a difficult genre for AI due to their partial observability, long-term decision-making, and evolving card sets. Due to this, current AI models perform vastly worse than human players at CCG tasks such as deckbuilding and gameplay. In this work, we introduce $\textit{UrzaGPT}$, a domain-adapted large language model that recommends real-time drafting decisions in $\textit{Magic: The Gathering}$. Starting from an open-weight LLM, we use Low-Rank Adaptation fine-tuning on a dataset of annotated draft logs. With this, we leverage the language modeling capabilities of LLM, and can quickly adapt to different expansions of the game. We benchmark $\textit{UrzaGPT}$ in comparison to zero-shot LLMs and the state-of-the-art domain-specific model. Untuned, small LLMs like Llama-3-8B are completely unable to draft, but the larger GPT-4o achieves a zero-shot performance of $43\%$. Using UrzaGPT to fine-tune smaller models, we achieve an accuracy of $66.2\%$ using only 10,000 steps. Despite this not reaching the capability of domain-specific models, we show that solely using LLMs to draft is possible and conclude that using LLMs can enable performant, general, and update-friendly drafting AIs in the future.
Abstract（参考訳）: 集合カードゲーム(CCG)は、部分的な可観測性、長期的な意思決定、進化するカードセットのために、AIにとって難しいジャンルである。このため、現在のAIモデルは、デッキビルディングやゲームプレイといったCGタスクにおいて、人間のプレイヤーよりもはるかにパフォーマンスが悪くなっている。本稿では、$\textit{Magic: The Gathering}$でリアルタイムのドラフト決定を推奨するドメイン適応の大規模言語モデルである$\textit{UrzaGPT}$を紹介します。オープンウェイト LLM から始めて,注釈付きドラフトログのデータセット上で低ランク適応を微調整する。これにより、LLMの言語モデリング能力を活用し、ゲームの様々な拡張に迅速に適応できる。ゼロショットLLMや最先端のドメイン固有モデルと比較して、$\textit{UrzaGPT}$をベンチマークする。 Llama-3-8Bのような小型のLLMは完全にドラフトできないが、より大きなGPT-4oは43\%のゼロショット性能を実現している。 UrzaGPTを使って小さなモデルを微調整し、わずか10,000ステップで6.2 %の精度を得る。このことはドメイン固有モデルの能力に届かなかったが、LLMをドラフトにのみ使用することは可能であり、将来的にはLLMを使用することで、パフォーマンス、汎用、アップデートフレンドリなドラフトAIを実現できると結論付けている。

論文の概要: UrzaGPT: LoRA-Tuned Large Language Models for Card Selection in Collectible Card Games

関連論文リスト