Fugu-MT 論文翻訳(概要): LightCache: Memory-Efficient, Training-Free Acceleration for Video Generation

論文の概要: LightCache: Memory-Efficient, Training-Free Acceleration for Video Generation

arxiv url: http://arxiv.org/abs/2510.05367v1
Date: Mon, 06 Oct 2025 20:54:44 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-08 17:57:07.980013
Title: LightCache: Memory-Efficient, Training-Free Acceleration for Video Generation
Title（参考訳）: LightCache: ビデオ生成のためのメモリ効率が高く、トレーニング不要なアクセラレーション
Authors: Yang Xiao, Gen Li, Kaiyuan Deng, Yushu Wu, Zheng Zhan, Yanzhi Wang, Xiaolong Ma, Bo Hui,
Abstract要約: 拡散モデルに基づくビデオ生成の先進的な研究領域として、トレーニングフリー加速が出現している。本稿では,推論過程を符号化・復号化・復号化段階に分解する。本稿では,メモリ消費を減らすためのステージ固有の戦略を提案する。
参考スコア（独自算出の注目度）: 40.968338980157846
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Training-free acceleration has emerged as an advanced research area in video generation based on diffusion models. The redundancy of latents in diffusion model inference provides a natural entry point for acceleration. In this paper, we decompose the inference process into the encoding, denoising, and decoding stages, and observe that cache-based acceleration methods often lead to substantial memory surges in the latter two stages. To address this problem, we analyze the characteristics of inference across different stages and propose stage-specific strategies for reducing memory consumption: 1) Asynchronous Cache Swapping. 2) Feature chunk. 3) Slicing latents to decode. At the same time, we ensure that the time overhead introduced by these three strategies remains lower than the acceleration gains themselves. Compared with the baseline, our approach achieves faster inference speed and lower memory usage, while maintaining quality degradation within an acceptable range. The Code is available at https://github.com/NKUShaw/LightCache .
Abstract（参考訳）: 拡散モデルに基づくビデオ生成の先進的な研究領域として、トレーニングフリー加速が出現している。拡散モデル推論における潜伏剤の冗長性は、加速度の自然なエントリポイントを提供する。本稿では,推論過程を符号化,復号化,復号化段階に分解し,キャッシュベースの高速化手法が後半2段階のメモリ急増の原因となる場合が多いことを観察する。この問題に対処するために、異なる段階にわたる推論の特徴を分析し、メモリ消費を減らすためのステージ固有の戦略を提案する。 1)非同期キャッシュスワッピング。 2)特徴チャンク。 3) 遅延を復号化するためのスライシング。同時に、これらの3つの戦略によってもたらされる時間オーバーヘッドが加速よりも低いことを保証します。ベースラインと比較すると,提案手法は許容範囲内で品質劣化を維持しつつ,推論速度の向上とメモリ使用量の削減を実現している。コードはhttps://github.com/NKUShaw/LightCacheで入手できる。

論文の概要: LightCache: Memory-Efficient, Training-Free Acceleration for Video Generation

関連論文リスト