Fugu-MT 論文翻訳(概要): OBS-Diff: Accurate Pruning For Diffusion Models in One-Shot

論文の概要: OBS-Diff: Accurate Pruning For Diffusion Models in One-Shot

arxiv url: http://arxiv.org/abs/2510.06751v1
Date: Wed, 08 Oct 2025 08:19:15 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-09 16:41:20.363043
Title: OBS-Diff: Accurate Pruning For Diffusion Models in One-Shot
Title（参考訳）: OBS-Diff:1ショットで拡散モデルの正確なプルーニング
Authors: Junhan Zhu, Hesong Wang, Mingluo Su, Zefang Wang, Huan Wang,
Abstract要約: OBS-Diffは、大規模テキスト・画像拡散モデルの正確かつトレーニング不要な圧縮を可能にする、新しいワンショットプルーニングフレームワークである。広汎な実験により、OBS-Diffは拡散モデルに対する最先端のワンショットプルーニングを実現し、視覚的品質の最小限の劣化を伴う推論加速を実現する。
参考スコア（独自算出の注目度）: 4.990334603434127
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Large-scale text-to-image diffusion models, while powerful, suffer from prohibitive computational cost. Existing one-shot network pruning methods can hardly be directly applied to them due to the iterative denoising nature of diffusion models. To bridge the gap, this paper presents OBS-Diff, a novel one-shot pruning framework that enables accurate and training-free compression of large-scale text-to-image diffusion models. Specifically, (i) OBS-Diff revitalizes the classic Optimal Brain Surgeon (OBS), adapting it to the complex architectures of modern diffusion models and supporting diverse pruning granularity, including unstructured, N:M semi-structured, and structured (MHA heads and FFN neurons) sparsity; (ii) To align the pruning criteria with the iterative dynamics of the diffusion process, by examining the problem from an error-accumulation perspective, we propose a novel timestep-aware Hessian construction that incorporates a logarithmic-decrease weighting scheme, assigning greater importance to earlier timesteps to mitigate potential error accumulation; (iii) Furthermore, a computationally efficient group-wise sequential pruning strategy is proposed to amortize the expensive calibration process. Extensive experiments show that OBS-Diff achieves state-of-the-art one-shot pruning for diffusion models, delivering inference acceleration with minimal degradation in visual quality.
Abstract（参考訳）: 大規模テキスト・画像拡散モデルは強力ではあるが、計算コストの禁止に悩まされている。既存のワンショット・ネットワーク・プルーニング法は拡散モデルの反復的復調性のために直接適用することは困難である。このギャップを埋めるために,大規模なテキスト・画像拡散モデルの正確かつトレーニング不要な圧縮を可能にする,新しいワンショットプルーニングフレームワークOBS-Diffを提案する。具体的には i)OBS-Diffは、古典的な最適脳サージオン(OBS)を再活性化し、近代拡散モデルの複雑なアーキテクチャに適応し、非構造化、N:M半構造化、構造化(MHAヘッドとFFNニューロン)の空間性を含む多様なプルーニング粒度をサポートする。二拡散過程の反復力学と整合させるため、誤差蓄積の観点から問題を検証し、対数-減少重み付け方式を取り入れた新しい時間ステップ対応ヘッセン構成を提案し、潜在的なエラーの蓄積を軽減するために、早期の時間ステップにより高い重要性を割り当てる。さらに, 高額なキャリブレーションプロセスを改善するために, 計算効率のよいグループワイド・シーケンシャル・プルーニング戦略を提案する。広汎な実験により、OBS-Diffは拡散モデルに対する最先端のワンショットプルーニングを実現し、視覚的品質の最小限の劣化を伴う推論加速を実現する。

論文の概要: OBS-Diff: Accurate Pruning For Diffusion Models in One-Shot

関連論文リスト