Fugu-MT 論文翻訳(概要): PeLAP-A: Adaptive Latent Pruning for Lightweight Latent Diffusion Models

論文の概要: PeLAP-A: Adaptive Latent Pruning for Lightweight Latent Diffusion Models

arxiv url: http://arxiv.org/abs/2606.23086v1
Date: Mon, 22 Jun 2026 09:33:19 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-25 00:18:04.511266
Title: PeLAP-A: Adaptive Latent Pruning for Lightweight Latent Diffusion Models
Title（参考訳）: PeLAP-A:軽量潜伏拡散モデルのための適応潜伏プルーニング
Authors: Kissa Zahra, Zaib Un Nisa,
Abstract要約: PeLAP-A(Adaptive Latent Pruning for Diffusion)は、標準的な潜伏拡散パイプラインを学習可能なチャネルワイドな予測器で拡張する軽量フレームワークである。アグレッシブなスパシティ規則化の下では、重要予測器は全ての潜伏チャネルをほぼゼロに駆動するが、デノナイズされたUNetは拡散損失を低くする。これらの知見は、潜伏拡散訓練におけるスパーシティダイナミクスの探索的研究であり、潜伏流路抑制に対してかなり頑健であることを示すものである。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Latent diffusion models achieve strong generative performance by operating in a compressed latent space produced by a variational autoencoder (VAE). However, it remains unclear whether all latent channels contribute equally to the diffusion process, or whether significant redundancy exists. We introduce PeLAP-A (Adaptive Latent Pruning for Diffusion), a lightweight framework that augments a standard latent diffusion pipeline with a learnable channel-wise importance predictor. A two-layer MLP operating on globally pooled latent features produces a soft mask that suppresses unimportant latent channels before they enter the denoising UNet. The entire system is trained jointly on CIFAR-10 under a combined diffusion, reconstruction, and sparsity loss. Experiments reveal a striking result: under aggressive sparsity regularization (lambda = 0.01), the importance predictor drives all latent channels to near-zero yet the denoising UNet achieves lower diffusion loss (0.0236 vs. 0.0240) and lower VAE reconstruction MSE (22.59 vs. 24.67) compared to the unpruned baseline. We term this the sparsity collapse phenomenon and provide an analysis of why it occurs and what it reveals about the information requirements of latent diffusion models. These findings constitute an exploratory study of sparsity dynamics in latent diffusion training, and demonstrate that denoising UNets can remain remarkably robust to latent channel suppression even under aggressive regularization. Code is available at: https://github.com/kissasium/PeLAP-A.git.
Abstract（参考訳）: 変分オートエンコーダ(VAE)によって生成された圧縮潜時空間で演算することで、潜時拡散モデルによる強い生成性能を実現する。しかしながら、全ての潜伏チャネルが拡散過程に等しく寄与するか、あるいは大きな冗長性が存在するかは、まだ不明である。 PeLAP-A(Adaptive Latent Pruning for Diffusion)は、標準的な潜伏拡散パイプラインを学習可能なチャネルワイドな予測器で拡張する軽量フレームワークである。グローバルプールされた潜在機能で動作する2層MLPは、デノイングUNetに入る前に重要でない潜在チャネルを抑えるソフトマスクを生成する。システム全体はCIFAR-10で共同で、拡散、再構築、空間的損失の複合で訓練される。攻撃的なスパーシリティ正規化(lambda = 0.01)の下では、重要予測器は全ての潜伏チャネルをほぼゼロに駆動するが、デノナイズされたUNetは拡散損失(0.0236 vs. 0.0240)を低くし、VAE再構成MSE(22.59 vs. 24.67)を未実行ベースラインと比較する。これを空間崩壊現象と呼び、なぜそれが起こるのか、また潜伏拡散モデルの情報要求について何を明らかにするのかを解析する。これらの知見は潜伏拡散訓練におけるスパーシティダイナミクスの探索的研究であり、アグレッシブ・レギュレーションの下でも潜伏チャネルの抑制に対してかなり頑健であることを示す。コードは、https://github.com/kissasium/PeLAP-A.git.comで入手できる。

論文の概要: PeLAP-A: Adaptive Latent Pruning for Lightweight Latent Diffusion Models

関連論文リスト