Fugu-MT 論文翻訳(概要): Teacher-Feature Drifting: One-Step Diffusion Distillation with Pretrained Diffusion Representations

論文の概要: Teacher-Feature Drifting: One-Step Diffusion Distillation with Pretrained Diffusion Representations

arxiv url: http://arxiv.org/abs/2605.07327v1
Date: Fri, 08 May 2026 06:33:33 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-11 19:43:38.861171
Title: Teacher-Feature Drifting: One-Step Diffusion Distillation with Pretrained Diffusion Representations
Title（参考訳）: 教師機能ドリフト:1ステップ拡散蒸留と事前拡散表現
Authors: Yuan Zhang, Chenyi Li, Guoqing Ma, Jiajun Zha, Yuanming Yang, Bo Wang, Wei Tang, Wenbo Li, Haoyang Huang, Nan Duan,
Abstract要約: また, 一段蒸留を簡略化するために, 一段のドリフト損失を直接利用できることが示唆された。追加の事前訓練された特徴抽出器に依存するオリジナルのドリフトモデルとは異なり、事前訓練された教師モデルの中間的隠れ状態が特徴表現として使用される。提案手法は、画像品質と多様性の競争力のあるワンステップ生成を実現し、ImageNet-64$times$64およびSDXL18.4のFIDスコアを1.58とする。
参考スコア（独自算出の注目度）: 48.0169045294604
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Sampling from pretrained diffusion and flow-matching models typically requires many forward passes to generate diverse and high-fidelity images. Existing distillation methods often rely on multiple auxiliary networks, carefully designed training stages, or complex optimization pipelines. In this work, we revisit the recently proposed Drifting Model objective and show that a single drifting loss can be directly used to simplify one step distillation. A key observation is that the pretrained diffusion teacher itself already provides a strong representation space. Unlike the original Drifting Model, which relies on an additional pretrained feature extractor, we use intermediate hidden states of the pretrained teacher model as the feature representation. This removes the need for training or introducing an extra representation network while preserving a semantically meaningful feature geometry for drifting. Furthermore, we introduce a lightweight mode coverage loss to mitigate mode collapse during distillation and encourage the student generator to cover diverse teacher-supported regions. Extensive experiments on ImageNet and SDXL demonstrate that our method achieves efficient one step generation with competitive image quality and diversity, achieving FID scores of 1.58 on ImageNet-64$\times$64 and 18.4 on SDXL, while substantially simplifying the overall distillation framework.
Abstract（参考訳）: 事前訓練された拡散と流れマッチングモデルからのサンプリングは通常、多彩で高忠実な画像を生成するために多くの前方通過を必要とする。既存の蒸留法は、しばしば複数の補助的なネットワーク、慎重に設計された訓練段階、複雑な最適化パイプラインに依存している。本研究では,最近提案したドリフトモデル目標を再検討し,一段蒸留を簡略化するために,単一のドリフト損失を直接利用できることを示す。鍵となる観察は、事前訓練された拡散教師自身が既に強力な表現空間を提供していることである。追加の事前訓練された特徴抽出器に依存するオリジナルのドリフトモデルとは異なり、事前訓練された教師モデルの中間的隠れ状態が特徴表現として使用される。これにより、ドリフトのための意味論的に意味のある特徴幾何学を維持しながら、トレーニングや余分な表現ネットワークの導入が不要になる。さらに,蒸留時のモード崩壊を軽減し,多様な教師支援地域をカバーすることを促すために,軽量モードカバレッジ損失を導入する。 ImageNet-64$\times$64, SDXL$18.4ではFIDスコア1.58, SDXLではFIDスコア1.58, SDXLでは18.4, 全体の蒸留フレームワークは大幅に簡略化された。

論文の概要: Teacher-Feature Drifting: One-Step Diffusion Distillation with Pretrained Diffusion Representations

関連論文リスト