Fugu-MT 論文翻訳(概要): Causal Reasoning Elicits Controllable 3D Scene Generation

論文の概要: Causal Reasoning Elicits Controllable 3D Scene Generation

arxiv url: http://arxiv.org/abs/2509.15249v1
Date: Thu, 18 Sep 2025 01:03:21 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-22 18:18:10.818072
Title: Causal Reasoning Elicits Controllable 3D Scene Generation
Title（参考訳）: 因果推論による3次元シーン生成の制御
Authors: Shen Chen, Ruiyu Zhao, Jiale Zhou, Zongkai Wu, Jenq-Neng Hwang, Lei Li,
Abstract要約: CausalStructは3Dシーン生成に因果推論を組み込む新しいフレームワークである。ノードがオブジェクトや属性を表現する因果グラフを構築し、エッジが因果依存性と物理的制約をエンコードする。提案手法では,3次元ガウス切削およびスコア蒸留サンプリングにより形状精度とレンダリング安定性を向上し,3次元シーンにおけるオブジェクト配置とレイアウトの誘導にテキストや画像を用いる。
参考スコア（独自算出の注目度）: 35.22855710229319
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Existing 3D scene generation methods often struggle to model the complex logical dependencies and physical constraints between objects, limiting their ability to adapt to dynamic and realistic environments. We propose CausalStruct, a novel framework that embeds causal reasoning into 3D scene generation. Utilizing large language models (LLMs), We construct causal graphs where nodes represent objects and attributes, while edges encode causal dependencies and physical constraints. CausalStruct iteratively refines the scene layout by enforcing causal order to determine the placement order of objects and applies causal intervention to adjust the spatial configuration according to physics-driven constraints, ensuring consistency with textual descriptions and real-world dynamics. The refined scene causal graph informs subsequent optimization steps, employing a Proportional-Integral-Derivative(PID) controller to iteratively tune object scales and positions. Our method uses text or images to guide object placement and layout in 3D scenes, with 3D Gaussian Splatting and Score Distillation Sampling improving shape accuracy and rendering stability. Extensive experiments show that CausalStruct generates 3D scenes with enhanced logical coherence, realistic spatial interactions, and robust adaptability.
Abstract（参考訳）: 既存の3Dシーン生成手法は、複雑な論理的依存関係とオブジェクト間の物理的な制約をモデル化するのに苦労し、動的で現実的な環境に適応する能力を制限する。因果推論を3次元シーン生成に組み込む新しいフレームワークCausalStructを提案する。大規模言語モデル(LLM)を用いて,ノードがオブジェクトや属性を表現する因果グラフを構築し,エッジが因果依存性や物理的制約をエンコードする。 CausalStructは、オブジェクトの配置順序を決定するために因果順序を強制することにより、シーンレイアウトを反復的に洗練し、物理駆動的な制約に従って空間構成を調整するために因果介入を適用し、テキスト記述や実世界のダイナミクスとの整合性を確保する。改良されたシーン因果グラフは、オブジェクトのスケールと位置を反復的にチューニングするために、Proportional-Integral-Derivative(PID)コントローラを使用する、その後の最適化手順を通知する。提案手法では,3次元ガウス切削およびスコア蒸留サンプリングにより形状精度とレンダリング安定性を向上し,3次元シーンにおけるオブジェクト配置とレイアウトの誘導にテキストや画像を用いる。大規模な実験により、CausalStructは、拡張された論理コヒーレンス、現実的な空間的相互作用、堅牢な適応性を持つ3Dシーンを生成することが示された。

論文の概要: Causal Reasoning Elicits Controllable 3D Scene Generation

関連論文リスト