Fugu-MT 論文翻訳(概要): LivingWorld: Interactive 4D World Generation with Environmental Dynamics

論文の概要: LivingWorld: Interactive 4D World Generation with Environmental Dynamics

arxiv url: http://arxiv.org/abs/2604.01641v1
Date: Thu, 02 Apr 2026 05:38:48 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-03 14:21:10.3778
Title: LivingWorld: Interactive 4D World Generation with Environmental Dynamics
Title（参考訳）: LivingWorld: インタラクティブな4Dワールドジェネレーションと環境ダイナミクス
Authors: Hyeongju Mun, In-Hwan Jin, Sohyeong Kim, Kyeongbo Kong,
Abstract要約: リビングワールド(LivingWorld)は、1つの画像から環境動態を持つ4次元世界を生成するインタラクティブなフレームワークである。 LivingWorldはこの課題に対処し、シーンが拡大するにつれて、グローバルなコヒーレントなモーションフィールドを徐々に構築する。我々はさらに、コンパクトなハッシュベースの運動場を用いて動きを表現し、シーン全体にわたって効率的なクエリと安定した動的伝播を可能にする。
参考スコア（独自算出の注目度）: 8.868060488503847
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: We introduce LivingWorld, an interactive framework for generating 4D worlds with environmental dynamics from a single image. While recent advances in 3D scene generation enable large-scale environment creation, most approaches focus primarily on reconstructing static geometry, leaving scene-scale environmental dynamics such as clouds, water, or smoke largely unexplored. Modeling such dynamics is challenging because motion must remain coherent across an expanding scene while supporting low-latency user feedback. LivingWorld addresses this challenge by progressively constructing a globally coherent motion field as the scene expands. To maintain global consistency during expansion, we introduce a geometry-aware alignment module that resolves directional and scale ambiguities across views. We further represent motion using a compact hash-based motion field, enabling efficient querying and stable propagation of dynamics throughout the scene. This representation also supports bidirectional motion propagation during rendering, producing long and temporally coherent 4D sequences without relying on expensive video-based refinement. On a single RTX 5090 GPU, generating each new scene expansion step requires 9 seconds, followed by 3 seconds for motion alignment and motion field updates, enabling interactive 4D world generation with globally coherent environmental dynamics. Video demonstrations are available at cvsp-lab.github.io/LivingWorld.
Abstract（参考訳）: リビングワールド(LivingWorld)は、1つの画像から環境動態を持つ4次元世界を生成するインタラクティブなフレームワークである。近年の3Dシーン生成の進歩は大規模な環境生成を可能にしているが、ほとんどのアプローチは、主に静的な幾何学の再構築に焦点を当てており、雲や水、煙といったシーンスケールの環境ダイナミクスは、ほとんど探索されていない。このようなダイナミクスのモデリングは、低レイテンシのユーザフィードバックをサポートしながら、動きは拡大するシーン全体で一貫性を保たなければならないため、難しい。 LivingWorldはこの課題に対処し、シーンが拡大するにつれて、グローバルなコヒーレントなモーションフィールドを徐々に構築する。拡張時のグローバルな整合性を維持するため,ビュー間の方向やスケールのあいまいさを解消する幾何対応アライメントモジュールを導入する。我々はさらに、コンパクトなハッシュベースの運動場を用いて動きを表現し、シーン全体にわたって効率的なクエリと安定した動的伝播を可能にする。この表現はまた、レンダリング中の双方向のモーション伝搬をサポートし、高価なビデオベースの精細化に頼ることなく、長時間かつ時間的にコヒーレントな4Dシーケンスを生成する。 1つのRTX 5090 GPUでは、新しいシーン展開ステップを生成するのに9秒を必要とし、その後3秒で動きのアライメントとモーションフィールドが更新され、グローバルなコヒーレントな環境ダイナミクスを備えたインタラクティブな4Dワールドジェネレーションが可能になる。ビデオデモはcvsp-lab.github.io/LivingWorldで公開されている。

論文の概要: LivingWorld: Interactive 4D World Generation with Environmental Dynamics

関連論文リスト