Fugu-MT 論文翻訳(概要): SceneCode: Executable World Programs for Editable Indoor Scenes with Articulated Objects

論文の概要: SceneCode: Executable World Programs for Editable Indoor Scenes with Articulated Objects

arxiv url: http://arxiv.org/abs/2605.19587v1
Date: Tue, 19 May 2026 09:31:04 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-20 15:03:09.249766
Title: SceneCode: Executable World Programs for Editable Indoor Scenes with Articulated Objects
Title（参考訳）: SceneCode: アーティキュレートされたオブジェクトで編集可能な屋内シーンのための実行可能なワールドプログラム
Authors: Puyi Wang, Yuhao Wang, Linjie Li, Zhengyuan Yang, Kevin Qinghong Lin, Yangguang Li, Yu Cheng,
Abstract要約: 室内シーンの合成は、AI、ロボット操作、シミュレーションベースのポリシー評価を具体化する。既存のパイプラインは、生成されたコンテンツを静的メッシュとして表現し、キュレートされたアセットライブラリからのみ調音を継承する。我々は、自然言語プロンプトを実行可能なコード駆動屋内世界にコンパイルするフレームワークであるSceneCodeを紹介する。
参考スコア（独自算出の注目度）: 69.20984454755512
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Indoor scene synthesis underpins embodied AI, robotic manipulation, and simulation-based policy evaluation, where a useful scene must specify not only what the environment looks like, but also how its objects are structured. Existing pipelines, however, typically represent generated content as static meshes and inherit articulation only from curated asset libraries, which limits object-level controllability and prevents new interactable assets from being produced on demand. We address this gap by formulating physically interactable indoor scene synthesis as programmatic world generation, and present SceneCode, a framework that compiles a natural language prompt into an executable, code-driven indoor world rather than a collection of opaque meshes. A room-level agentic backbone first turns the prompt into a structured house layout and emits per-object AssetRequests through a planner--designer--critic loop. Each request is then routed to one of five code-generation strategies and converted into a synthesized part-wise Blender Python programs that are validated through an execution-guided repair-and-refine loop. The resulting programs are compiled into simulation-ready assets, and exported as SDF for physics simulation. A persistent scene-state registry links object requests, executable programs, rendered geometry, and simulation assets, turning scene assembly into a traceable and locally editable world-building process. We evaluate SceneCode across scene-level synthesis, object-level asset quality, human judgment, and downstream robot interaction. Results show that executable world programs improve prompt-faithful indoor scene generation and produce assets with cleaner mesh structure, and simulator-loadable articulation metadata. Project page: https://scene-code.github.io/.
Abstract（参考訳）: 室内シーンの合成は、AI、ロボット操作、シミュレーションベースのポリシー評価を具現化したものだ。しかし、既存のパイプラインは通常、生成されたコンテンツを静的メッシュとして表現し、オブジェクトレベルの制御性を制限し、必要に応じて新たな対話可能なアセットが生成されるのを防ぐ、キュレートされたアセットライブラリのみを継承する。このギャップを、物理的に相互作用可能な屋内シーン合成をプログラム的世界生成として定式化し、自然言語プロンプトを不透明なメッシュの集合ではなく実行可能なコード駆動屋内世界にコンパイルするフレームワークであるSceneCodeを提示する。部屋レベルのエージェントバックボーンは、まずプロンプトを構造化されたハウスレイアウトに変換し、プランナー--デザイナ--批判ループを介してオブジェクトごとのアセットリクエストを出力する。それぞれのリクエストは5つのコード生成戦略のうちの1つにルーティングされ、実行誘導の修理と修正のループを通じて検証される、合成された部分ワイドのBlender Pythonプログラムに変換される。得られたプログラムはシミュレーション可能な資産にコンパイルされ、物理シミュレーションのためにSDFとしてエクスポートされる。永続的なシーンステートレジストリは、オブジェクトリクエスト、実行可能プログラム、レンダリングされた幾何学、シミュレーション資産をリンクし、シーンアセンブリをトレース可能でローカルに編集可能な世界構築プロセスに変換する。 SceneCodeはシーンレベルの合成、オブジェクトレベルの資産品質、人間の判断、下流ロボットのインタラクションなどにわたって評価する。その結果、実行可能世界プログラムは、プロンプトフルな屋内シーン生成を改善し、よりクリーンなメッシュ構造で資産を生産し、シミュレーター搭載可能な調音メタデータを作成した。プロジェクトページ: https://scene-code.github.io/

論文の概要: SceneCode: Executable World Programs for Editable Indoor Scenes with Articulated Objects

関連論文リスト