Fugu-MT 論文翻訳(概要): PhysGM: Large Physical Gaussian Model for Feed-Forward 4D Synthesis

論文の概要: PhysGM: Large Physical Gaussian Model for Feed-Forward 4D Synthesis

arxiv url: http://arxiv.org/abs/2508.13911v1
Date: Tue, 19 Aug 2025 15:10:30 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-20 15:36:31.978664
Title: PhysGM: Large Physical Gaussian Model for Feed-Forward 4D Synthesis
Title（参考訳）: PhysGM:フィードフォワード4D合成のための大規模物理ガウスモデル
Authors: Chunji Lv, Zequn Chen, Donglin Di, Weinan Zhang, Hao Li, Wei Chen, Changsheng Li,
Abstract要約: PhysGMはフィードフォワードフレームワークであり、単一の画像から3次元ガウス表現とその物理的特性を共同で予測する。 1分で1枚の画像から高忠実度4Dシミュレーションを効果的に生成する。
参考スコア（独自算出の注目度）: 37.21119648359889
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: While physics-grounded 3D motion synthesis has seen significant progress, current methods face critical limitations. They typically rely on pre-reconstructed 3D Gaussian Splatting (3DGS) representations, while physics integration depends on either inflexible, manually defined physical attributes or unstable, optimization-heavy guidance from video models. To overcome these challenges, we introduce PhysGM, a feed-forward framework that jointly predicts a 3D Gaussian representation and its physical properties from a single image, enabling immediate, physical simulation and high-fidelity 4D rendering. We first establish a base model by jointly optimizing for Gaussian reconstruction and probabilistic physics prediction. The model is then refined with physically plausible reference videos to enhance both rendering fidelity and physics prediction accuracy. We adopt the Direct Preference Optimization (DPO) to align its simulations with reference videos, circumventing Score Distillation Sampling (SDS) optimization which needs back-propagating gradients through the complex differentiable simulation and rasterization. To facilitate the training, we introduce a new dataset PhysAssets of over 24,000 3D assets, annotated with physical properties and corresponding guiding videos. Experimental results demonstrate that our method effectively generates high-fidelity 4D simulations from a single image in one minute. This represents a significant speedup over prior works while delivering realistic rendering results. Our project page is at:https://hihixiaolv.github.io/PhysGM.github.io/
Abstract（参考訳）: 物理地上の3Dモーション合成は大きな進歩を遂げているが、現在の手法には限界がある。それらは通常、事前に再構成された3Dガウススプラッティング(3DGS)表現に依存し、物理積分は、柔軟で手動で定義された物理的特性または不安定で最適化性の高いビデオモデルからのガイダンスに依存する。これらの課題を克服するために、フィードフォワードフレームワークであるPhysGMを導入し、単一の画像から3次元ガウス表現とその物理特性を共同で予測し、即時、物理シミュレーションと高忠実度4Dレンダリングを可能にする。まず,ガウスの復元と確率論的物理予測を共同で最適化し,基礎モデルを確立する。モデルには物理的に妥当な参照ビデオが組み込まれ、レンダリングの忠実さと物理予測の精度が向上する。我々はDPO(Direct Preference Optimization)を採用し、複雑な微分可能シミュレーションとラスタライゼーションによる逆伝播勾配を必要とするスコア蒸留サンプリング(SDS)最適化を回避する。トレーニングを容易にするために,24,000以上の3Dアセットからなる新しいデータセットPhysAssetを紹介した。実験の結果,1分で1枚の画像から高忠実度4Dシミュレーションを効果的に生成できることがわかった。これは、リアルなレンダリング結果を提供しながら、以前の作業よりも大幅にスピードアップしている。私たちのプロジェクトページは以下の通りです。

論文の概要: PhysGM: Large Physical Gaussian Model for Feed-Forward 4D Synthesis

関連論文リスト