Fugu-MT 論文翻訳(概要): Physical Simulator In-the-Loop Video Generation

論文の概要: Physical Simulator In-the-Loop Video Generation

arxiv url: http://arxiv.org/abs/2603.06408v1
Date: Fri, 06 Mar 2026 15:48:25 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-09 13:17:46.174479
Title: Physical Simulator In-the-Loop Video Generation
Title（参考訳）: 物理シミュレータインザループ映像生成
Authors: Lin Geng Foo, Mark He Huang, Alexandros Lattas, Stylianos Moschoglou, Thabo Beeler, Christian Theobalt,
Abstract要約: Physical Simulator In-the-loop Video Generation (PSIVG)は、物理シミュレータをビデオ拡散プロセスに統合する新しいフレームワークである。 PSIVGは、視覚的品質と多様性を保ちながら、現実世界の物理に忠実なビデオを制作する。
参考スコア（独自算出の注目度）: 96.87054314612142
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Recent advances in diffusion-based video generation have achieved remarkable visual realism but still struggle to obey basic physical laws such as gravity, inertia, and collision. Generated objects often move inconsistently across frames, exhibit implausible dynamics, or violate physical constraints, limiting the realism and reliability of AI-generated videos. We address this gap by introducing Physical Simulator In-the-loop Video Generation (PSIVG), a novel framework that integrates a physical simulator into the video diffusion process. Starting from a template video generated by a pre-trained diffusion model, PSIVG reconstructs the 4D scene and foreground object meshes, initializes them within a physical simulator, and generates physically consistent trajectories. These simulated trajectories are then used to guide the video generator toward spatio-temporally physically coherent motion. To further improve texture consistency during object movement, we propose a Test-Time Texture Consistency Optimization (TTCO) technique that adapts text and feature embeddings based on pixel correspondences from the simulator. Comprehensive experiments demonstrate that PSIVG produces videos that better adhere to real-world physics while preserving visual quality and diversity. Project Page: https://vcai.mpi-inf.mpg.de/projects/PSIVG/
Abstract（参考訳）: 拡散に基づくビデオ生成の最近の進歩は目覚ましい視覚的リアリズムを達成しているが、重力や慣性、衝突といった基本的な物理法則に従うのに苦慮している。生成されたオブジェクトは、しばしばフレームを無矛盾に移動し、不可解なダイナミクスを示したり、物理的制約に違反して、AI生成ビデオのリアリズムと信頼性を制限する。物理シミュレータをビデオ拡散プロセスに統合する新しいフレームワークPSIVGを導入することで、このギャップに対処する。事前訓練された拡散モデルによって生成されたテンプレートビデオから、PSIVGは4Dシーンと前景のオブジェクトメッシュを再構成し、それらを物理シミュレータ内で初期化し、物理的に一貫した軌道を生成する。これらのシミュレートされた軌道は、ビデオジェネレータを時空間的に物理的にコヒーレントな動きへと導くのに使用される。オブジェクト移動時のテクスチャの整合性を改善するため,シミュレータからの画素対応に基づいたテキストと特徴埋め込みを適応するTTCO(Test-Time Texture Consistency Optimization)技術を提案する。総合的な実験により、PSIVGは視覚的品質と多様性を保ちながら、現実世界の物理に忠実なビデオを生成する。プロジェクトページ:https://vcai.mpi-inf.mpg.de/projects/PSIVG/

論文の概要: Physical Simulator In-the-Loop Video Generation

関連論文リスト