Fugu-MT 論文翻訳(概要): DiCache: Let Diffusion Model Determine Its Own Cache

論文の概要: DiCache: Let Diffusion Model Determine Its Own Cache

arxiv url: http://arxiv.org/abs/2508.17356v2
Date: Thu, 02 Oct 2025 14:42:41 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-03 16:59:20.032831
Title: DiCache: Let Diffusion Model Determine Its Own Cache
Title（参考訳）: DiCache: 拡散モデルが独自のキャッシュを決定する
Authors: Jiazi Bu, Pengyang Ling, Yujie Zhou, Yibin Wang, Yuhang Zang, Dahua Lin, Jiaqi Wang,
Abstract要約: DiCacheは、実行時に拡散モデルを加速するためのトレーニング不要のアダプティブキャッシュ戦略である。 Online Probe Profiling Schemeは浅層オンラインプローブを利用して,キャッシュエラーのオンザフライインジケータをリアルタイムで取得する。 Dynamic Cache Trajectory Alignmentは、マルチステップの履歴キャッシュから出力されるディープ層の特徴を近似する。
参考スコア（独自算出の注目度）: 62.954717254728166
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Recent years have witnessed the rapid development of acceleration techniques for diffusion models, especially caching-based acceleration methods. These studies seek to answer two fundamental questions: "When to cache" and "How to use cache", typically relying on predefined empirical laws or dataset-level priors to determine caching timings and adopting handcrafted rules for multi-step cache utilization. However, given the highly dynamic nature of the diffusion process, they often exhibit limited generalizability and fail to cope with diverse samples. In this paper, a strong sample-specific correlation is revealed between the variation patterns of the shallow-layer feature differences in the diffusion model and those of deep-layer features. Moreover, we have observed that the features from different model layers form similar trajectories. Based on these observations, we present DiCache, a novel training-free adaptive caching strategy for accelerating diffusion models at runtime, answering both when and how to cache within a unified framework. Specifically, DiCache is composed of two principal components: (1) Online Probe Profiling Scheme leverages a shallow-layer online probe to obtain an on-the-fly indicator for the caching error in real time, enabling the model to dynamically customize the caching schedule for each sample. (2) Dynamic Cache Trajectory Alignment adaptively approximates the deep-layer feature output from multi-step historical caches based on the shallow-layer feature trajectory, facilitating higher visual quality. Extensive experiments validate DiCache's capability in achieving higher efficiency and improved fidelity over state-of-the-art approaches on various leading diffusion models including WAN 2.1, HunyuanVideo and Flux.
Abstract（参考訳）: 近年,拡散モデル,特にキャッシングに基づく加速法の開発が急速に進んでいる。これらの研究は、“キャッシュのタイミング”と“キャッシュの使い方”という2つの基本的な疑問に答えようとしている。しかし、拡散過程の非常にダイナミックな性質を考えると、それらはしばしば限定的な一般化性を示し、多様なサンプルに対処できない。本稿では, 拡散モデルにおける浅層特徴差の変動パターンと深層特徴の変動パターンとの間に, 試料比の強い相関関係を明らかにする。さらに,異なるモデル層の特徴が類似した軌道を形成することも確認した。これらの観測に基づいて、我々は、実行時に拡散モデルを加速するための新しいトレーニングフリー適応キャッシュ戦略であるDiCacheを紹介し、いつ、どのように統合されたフレームワーク内でキャッシュするかを問う。オンラインプローブプロファイリングスキームは浅層オンラインプローブを利用してキャッシュエラーのオンザフライインジケータをリアルタイムで取得し、各サンプルのキャッシュスケジュールを動的にカスタマイズする。 2)動的キャッシュトラジェクトリアライメントは,浅層機能トラジェクトリに基づいて複数ステップの履歴キャッシュから出力されるディープレイヤ機能を適応的に近似し,視覚的品質を向上させる。広範にわたる実験により、WAN 2.1、HunyuanVideo、Fluxなど、様々な主要な拡散モデルに対する最先端のアプローチに対するDiCacheの高効率化と忠実性の向上が検証された。

論文の概要: DiCache: Let Diffusion Model Determine Its Own Cache

関連論文リスト