Fugu-MT 論文翻訳(概要): The Pulse of Motion: Measuring Physical Frame Rate from Visual Dynamics

論文の概要: The Pulse of Motion: Measuring Physical Frame Rate from Visual Dynamics

arxiv url: http://arxiv.org/abs/2603.14375v1
Date: Sun, 15 Mar 2026 13:29:31 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-17 16:19:35.780996
Title: The Pulse of Motion: Measuring Physical Frame Rate from Visual Dynamics
Title（参考訳）: 動きのパルス:視覚力学から物理フレーム速度を測定する
Authors: Xiangbo Gao, Mingyang Wu, Siyuan Yang, Jiongze Yu, Pardis Taghavi, Fangzhou Lin, Zhengzhong Tu,
Abstract要約: 入力ビデオの視覚力学から秒間物理フレームを直接復元する予測器であるビジュアルクロノメーターを提案する。我々の評価では、最先端のビデオジェネレータが深刻なPhyFPSのミスアライメントと時間的不安定に悩まされているという厳しい現実が明らかになっている。 PhyFPS補正を適用することで、AI生成ビデオの人間の知覚する自然さが大幅に向上する。
参考スコア（独自算出の注目度）: 18.3026562815791
License: http://creativecommons.org/licenses/by/4.0/
Abstract: While recent generative video models have achieved remarkable visual realism and are being explored as world models, true physical simulation requires mastering both space and time. Current models can produce visually smooth kinematics, yet they lack a reliable internal motion pulse to ground these motions in a consistent, real-world time scale. This temporal ambiguity stems from the common practice of indiscriminately training on videos with vastly different real-world speeds, forcing them into standardized frame rates. This leads to what we term chronometric hallucination: generated sequences exhibit ambiguous, unstable, and uncontrollable physical motion speeds. To address this, we propose Visual Chronometer, a predictor that recovers the Physical Frames Per Second (PhyFPS) directly from the visual dynamics of an input video. Trained via controlled temporal resampling, our method estimates the true temporal scale implied by the motion itself, bypassing unreliable metadata. To systematically quantify this issue, we establish two benchmarks, PhyFPS-Bench-Real and PhyFPS-Bench-Gen. Our evaluations reveal a harsh reality: state-of-the-art video generators suffer from severe PhyFPS misalignment and temporal instability. Finally, we demonstrate that applying PhyFPS corrections significantly improves the human-perceived naturalness of AI-generated videos. Our project page is https://xiangbogaobarry.github.io/Visual_Chronometer/.
Abstract（参考訳）: 最近の生成ビデオモデルは目覚ましいビジュアルリアリズムを達成し、世界モデルとして探求されているが、真の物理シミュレーションは空間と時間の両方をマスターする必要がある。現在のモデルは視覚的に滑らかなキネマティックスを生成することができるが、これらの動きを一貫した実世界の時間スケールでグラウンドする信頼性のある内部運動パルスは欠如している。この時間的曖昧さは、異なる現実世界の速度で動画を無差別に訓練する一般的な習慣に起因し、それらを標準化されたフレームレートに強制する。これはクロノメトリ幻覚(chronometric hallucination)と呼ばれるもので、生成シーケンスは曖昧で不安定で、制御不能な物理運動速度を示す。そこで我々は,入力ビデオの視覚力学から直接物理フレーム/秒(PhyFPS)を復元する予測器であるビジュアルクロノメーターを提案する。制御された時間的リサンプリングによってトレーニングされた本手法は,動作自体が入力する真の時間的スケールを推定し,信頼性の低いメタデータをバイパスする。この問題を体系的に定量化するために、PhyFPS-Bench-RealとPhyFPS-Bench-Genという2つのベンチマークを構築した。我々の評価では、最先端のビデオジェネレータが深刻なPhyFPSのミスアライメントと時間的不安定に悩まされているという厳しい現実が明らかになっている。最後に、PhyFPS補正を適用することで、AI生成ビデオの人間の知覚自然性を大幅に改善することを示す。私たちのプロジェクトページはhttps://xiangbogaobarry.github.io/Visual_Chronometer/です。

論文の概要: The Pulse of Motion: Measuring Physical Frame Rate from Visual Dynamics

関連論文リスト