Fugu-MT 論文翻訳(概要): Dream-Cubed: Controllable Generative Modeling in Minecraft by Training on Billions of Cubes

論文の概要: Dream-Cubed: Controllable Generative Modeling in Minecraft by Training on Billions of Cubes

arxiv url: http://arxiv.org/abs/2604.22847v1
Date: Wed, 22 Apr 2026 00:46:12 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-28 17:12:06.984903
Title: Dream-Cubed: Controllable Generative Modeling in Minecraft by Training on Billions of Cubes
Title（参考訳）: Dream-Cubed: 数十億キューブのトレーニングによるMinecraftの制御可能な生成モデリング
Authors: Tim Merino, Sam Earle, Ryunosuke Iwai, Julian Togelius, Edoardo Cetin,
Abstract要約: 我々は,Minecraftの大規模なデータセットであるDream-Cubedを,ボクセル解像度で紹介する。 Dream-Cubedは、プロシージャのバイオメ地形と人間による地図の混成から数千億枚ものトークンで構成されている。このデータセットを用いて、ボクセル生成のための3次元拡散モデルの最初の大規模研究を行う。
参考スコア（独自算出の注目度）: 14.861822164650967
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: We introduce Dream-Cubed, a large-scale dataset of Minecraft worlds at voxel resolution, and a family of models using cubes as powerful compositional units for efficient generation of interactive 3D environments. Dream-Cubed comprises tens of billions of tokens from a carefully curated mixture of procedural biome terrain and high-quality human-authored maps. We use this dataset to conduct the first large-scale study of 3D diffusion models for voxel generation, analyzing discrete and continuous diffusion formulations, data compositions, and architectural design choices. Our models operate directly in the space of blocks, enabling efficient and semantically grounded generation while supporting interactive user workflows such as inpainting and outpainting from user-authored blocks. To quantitatively evaluate our models, we adapt the FID metric to assess semantic differences between real and generated world renderings, and validate generation quality through a human preference study. We release the full dataset, code, and all our pretrained models, which we hope will provide a foundation for future research in efficient generative modeling for structured, interactive 3D environments.
Abstract（参考訳）: ボクセル解像度のMinecraft世界の大規模なデータセットであるDream-Cubedと、インタラクティブな3D環境を効率的に生成するための強力な構成単位として立方体を用いたモデルのファミリーを紹介する。 Dream-Cubedは、プロシージャのバイオメ地形と高品質な人間による地図を慎重にキュレートした、何千億ものトークンで構成されている。このデータセットを用いて、ボクセル生成のための3次元拡散モデルの最初の大規模研究を行い、離散的かつ連続的な拡散の定式化、データ構成、アーキテクチャ設計の選択を分析する。我々のモデルはブロックの空間で直接動作し、効率的なセマンティック・グラウンドド・ジェネレーションを可能にしながら、ユーザ認可ブロックからのインペインティングやアウトペインティングといったインタラクティブなユーザ・ワークフローをサポートします。実世界レンダリングと実世界レンダリングのセマンティックな差異を定量的に評価するためにFIDメトリクスを適用し,人間の嗜好調査を通じて生成品質を評価する。構造化されたインタラクティブな3D環境のための効率的な生成モデリングに関する、将来の研究のための基盤を提供したいと思っています。

論文の概要: Dream-Cubed: Controllable Generative Modeling in Minecraft by Training on Billions of Cubes

関連論文リスト