Fugu-MT 論文翻訳(概要): Motion-Aware Caching for Efficient Autoregressive Video Generation

論文の概要: Motion-Aware Caching for Efficient Autoregressive Video Generation

arxiv url: http://arxiv.org/abs/2605.01725v1
Date: Sun, 03 May 2026 05:49:27 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-05 20:33:49.905872
Title: Motion-Aware Caching for Efficient Autoregressive Video Generation
Title（参考訳）: 効率的な自己回帰映像生成のための動き認識キャッシング
Authors: Jing Xu, Yuexiao Ma, Songwei Liu, Xuzhe Zheng, Shiwei Liu, Chenqian Yan, Xiawu Zheng, Rongrong Ji, Fei Chao, Xing Wang,
Abstract要約: MotionCacheは、ピクセルレベルのモーション特性の軽量プロキシとして、フレーム間の差異を利用する。 SkyReels-V2やMAGI-1のような最先端モデルの実験では、MotionCacheが大幅なスピードアップを実現している。
参考スコア（独自算出の注目度）: 73.27762884730272
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Autoregressive video generation paradigms offer theoretical promise for long video synthesis, yet their practical deployment is hindered by the computational burden of sequential iterative denoising. While cache reuse strategies can accelerate generation by skipping redundant denoising steps, existing methods rely on coarse-grained chunk-level skipping that fails to capture fine-grained pixel dynamics. This oversight is critical: pixels with high motion require more denoising steps to prevent error accumulation, while static pixels tolerate aggressive skipping. We formalize this insight theoretically by linking cache errors to residual instability, and propose MotionCache, a motion-aware cache framework that exploits inter-frame differences as a lightweight proxy for pixel-level motion characteristics. MotionCache employs a coarse-to-fine strategy: an initial warm-up phase establishes semantic coherence, followed by motion-weighted cache reuse that dynamically adjusts update frequencies per token. Extensive experiments on state-of-the-art models like SkyReels-V2 and MAGI-1 demonstrate that MotionCache achieves significant speedups of $\textbf{6.28}\times$ and $\textbf{1.64}\times$ respectively, while effectively preserving generation quality (VBench: $1\%\downarrow$ and $0.01\%\downarrow$ respectively). The code is available at https://github.com/ywlq/MotionCache.
Abstract（参考訳）: 自己回帰ビデオ生成のパラダイムは、長いビデオ合成の理論的な約束を提供するが、それらの実践的な展開は、逐次反復的復調の計算負担によって妨げられる。キャッシュ再利用戦略は冗長なデノジングステップをスキップすることで生成を加速するが、既存の方法は粗いチャンクレベルのスキップに依存しており、微細なピクセルのダイナミックスをキャプチャできない。高い動きを持つピクセルは、エラーの蓄積を防ぐためにより多くのデノイングステップを必要とし、静的なピクセルはアグレッシブなスキップを許容する。我々は、キャッシュエラーと残像の不安定性をリンクすることで理論的にこの知見を定式化し、フレーム間の差を利用したモーション対応キャッシュフレームワークであるMotionCacheを画素レベルの動作特性の軽量プロキシとして提案する。初期ウォームアップフェーズはセマンティックコヒーレンスを確立し、その後にトークン毎に更新頻度を動的に調整するモーション重み付けキャッシュを再使用する。 SkyReels-V2 や MAGI-1 のような最先端モデルに対する大規模な実験は、MotionCache がそれぞれ$\textbf{6.28}\times$ と $\textbf{1.64}\times$ の大幅なスピードアップを達成し、生成品質を効果的に保ちながら(VBench: $1\%\downarrow$ と $0.01\%\downarrow$ )。コードはhttps://github.com/ywlq/MotionCacheで入手できる。

論文の概要: Motion-Aware Caching for Efficient Autoregressive Video Generation

関連論文リスト