Fugu-MT 論文翻訳(概要): SpecDiff: Accelerating Diffusion Model Inference with Self-Speculation

論文の概要: SpecDiff: Accelerating Diffusion Model Inference with Self-Speculation

arxiv url: http://arxiv.org/abs/2509.13848v1
Date: Wed, 17 Sep 2025 09:24:40 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-18 18:41:50.800617
Title: SpecDiff: Accelerating Diffusion Model Inference with Self-Speculation
Title（参考訳）: SpecDiff:自己スペキュレーションによる拡散モデル推論の高速化
Authors: Jiayi Pan, Jiaming Xu, Yongkang Zhou, Guohao Dai,
Abstract要約: 拡散モデル推論のためのトレーニング不要なマルチレベル特徴キャッシング戦略である textitSpecDiff を提案する。 textitSpecDiffは、特徴重要度スコアの違いを活用してトークンを分類し、マルチレベルの特徴計算戦略を導入する。 textitSpecDiffは、NVIDIA A800-80GB GPUのRFlowと比較して、安定拡散3、3.5、FLUXの無視できる品質損失で平均2.80回、2.74回、3.17回のスピードアップを達成する。
参考スコア（独自算出の注目度）: 13.700117460027164
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Feature caching has recently emerged as a promising method for diffusion model acceleration. It effectively alleviates the inefficiency problem caused by high computational requirements by caching similar features in the inference process of the diffusion model. In this paper, we analyze existing feature caching methods from the perspective of information utilization, and point out that relying solely on historical information will lead to constrained accuracy and speed performance. And we propose a novel paradigm that introduces future information via self-speculation based on the information similarity at the same time step across different iteration times. Based on this paradigm, we present \textit{SpecDiff}, a training-free multi-level feature caching strategy including a cached feature selection algorithm and a multi-level feature classification algorithm. (1) Feature selection algorithm based on self-speculative information. \textit{SpecDiff} determines a dynamic importance score for each token based on self-speculative information and historical information, and performs cached feature selection through the importance score. (2) Multi-level feature classification algorithm based on feature importance scores. \textit{SpecDiff} classifies tokens by leveraging the differences in feature importance scores and introduces a multi-level feature calculation strategy. Extensive experiments show that \textit{SpecDiff} achieves average 2.80 \times, 2.74 \times , and 3.17\times speedup with negligible quality loss in Stable Diffusion 3, 3.5, and FLUX compared to RFlow on NVIDIA A800-80GB GPU. By merging speculative and historical information, \textit{SpecDiff} overcomes the speedup-accuracy trade-off bottleneck, pushing the Pareto frontier of speedup and accuracy in the efficient diffusion model inference.
Abstract（参考訳）: 拡散モデル加速のための有望な方法として、最近フィーチャーキャッシングが登場した。拡散モデルの推論プロセスにおいて、同様の特徴をキャッシュすることで、高い計算要求に起因する効率の低下を効果的に軽減する。本稿では,情報利用の観点から既存の特徴キャッシング手法を解析し,過去の情報のみに頼れば,精度や性能の制約が生じることを指摘する。そこで本研究では,異なる反復時間にまたがる情報類似度に基づいて,自己記述による将来の情報導入を行う新しいパラダイムを提案する。このパラダイムに基づいて、キャッシュされた特徴選択アルゴリズムとマルチレベル特徴分類アルゴリズムを含むトレーニング不要なマルチレベル特徴キャッシング戦略である「textit{SpecDiff}」を提案する。 1)自己推測情報に基づく特徴選択アルゴリズム。 \textit{SpecDiff} は自己推測情報と履歴情報に基づいてトークンごとの動的重要度スコアを決定し、重要度スコアを通じてキャッシュされた特徴選択を行う。 2)特徴重要度スコアに基づく多段階特徴分類アルゴリズム \textit{SpecDiff}は、特徴重要度スコアの違いを活用してトークンを分類し、マルチレベルの特徴計算戦略を導入する。大規模な実験では、NVIDIA A800-80GB GPUのRFlowと比較して、安定拡散3,3.5,FLUXにおいて、平均2.80 \times、2.74 \times、3.17\timesのスピードアップが無視できる品質損失を達成している。投機的および歴史的情報をマージすることで、 \textit{SpecDiff} はスピードアップ・精度のトレードオフボトルネックを克服し、効率的な拡散モデル推論におけるスピードアップと精度のパレートフロンティアを押し上げる。

論文の概要: SpecDiff: Accelerating Diffusion Model Inference with Self-Speculation

関連論文リスト