Fugu-MT 論文翻訳(概要): SGMD: Score Gradient Matching Distillation for Few-Step Video Diffusion Distillation

論文の概要: SGMD: Score Gradient Matching Distillation for Few-Step Video Diffusion Distillation

arxiv url: http://arxiv.org/abs/2605.30116v1
Date: Thu, 28 May 2026 15:50:55 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-30 02:45:56.453666
Title: SGMD: Score Gradient Matching Distillation for Few-Step Video Diffusion Distillation
Title（参考訳）: SGMD-Score Gradient Matching Distillation for Few-Step Video Diffusion Distillation (特集:SGMDとバイオサイバネティックス)
Authors: Zhuguanyu Wu, Ruihao Gong, Yang Yong, Yushi Huang, Xiangyu Fan, Lei Yang, Dahua Lin, Xianglong Liu,
Abstract要約: 分散マッチング蒸留(DMD)は、数ステップのビデオ拡散モデルにおいて、推論を加速するための広く使われているパラダイムである。 textbfScore Gradient Matching Distillation (SGMD)を提案する。教師の停止段階のフィッシャーを安定した分布マッチングの目的として使用しながら、教師に対して偽スコアを直接最適化することで、偽スコアの視点を採用する。時間的一貫性を維持しつつ、4段階蒸留モデルの運動力学を大幅に改善する。
参考スコア（独自算出の注目度）: 57.297118390628384
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Distribution Matching Distillation (DMD) is a widely used paradigm for accelerating inference in few-step video diffusion models. However, DMD-style video distillation faces two coupled challenges: the fake score must track a continuously evolving generator, making training costly when frequent updates are required, while reverse-KL-style matching can be mode-seeking and conservative for preserving strong motion dynamics. To address these issues, we propose \textbf{Score Gradient Matching Distillation (SGMD)}. SGMD adopts a fake-score perspective by directly optimizing the fake score toward the teacher, while using teacher stop-gradient Fisher as a stable distribution-matching objective. We provide a gradient analysis that motivates this objective choice under ideal tracking. Building on this, SGMD introduces a pair of dual potentials: negative-residual (NR) for outer-loop correction and residual-contraction (RC) for inner-loop tracking. Empirically, compared to DMD2, SGMD achieves an approximately $\sim 3\times$ training speedup and substantially improves motion dynamics for 4-step distilled models while preserving temporal consistency. A human study confirms that SGMD is preferred in motion quality and overall preference, while visual quality and text alignment remain comparable. Code is available at https://github.com/ModelTC/LightX2V.
Abstract（参考訳）: 分散マッチング蒸留(Distributed Matching Distillation, DMD)は, 数段階のビデオ拡散モデルにおいて, 推論の高速化に広く用いられているパラダイムである。しかし、DMDスタイルのビデオ蒸留は、2つの複合的な課題に直面している: 偽のスコアは継続的に進化するジェネレータを追跡し、頻繁な更新が必要なときにトレーニングをコストで行わなければならない。これらの問題に対処するため、我々はtextbf{Score Gradient Matching Distillation (SGMD)を提案する。 SGMDは、教師の停止段階フィッシャーを安定した分布マッチング目的として使用しながら、教師に対して偽スコアを直接最適化することで、偽スコアの視点を採用する。我々は、理想的なトラッキングの下で、この客観的選択を動機付ける勾配解析を提供する。これに基づいてSGMDは、外ループ補正のための負残差(NR)と内ループ追跡のための残留収縮(RC)の2つの双対ポテンシャルを導入した。実証的には、SGMDはMDD2と比較して約$\sim 3\times$のトレーニングスピードアップを実現し、時間的一貫性を維持しながら4段階蒸留モデルの運動ダイナミクスを大幅に改善する。人間の研究では、SGMDは動きの質と全体的な好みで好まれる一方で、視覚的品質とテキストアライメントは相容れないことが確認されている。コードはhttps://github.com/ModelTC/LightX2Vで入手できる。

論文の概要: SGMD: Score Gradient Matching Distillation for Few-Step Video Diffusion Distillation

関連論文リスト