Fugu-MT 論文翻訳(概要): Post-Training Quantization via Residual Truncation and Zero Suppression for Diffusion Models

論文の概要: Post-Training Quantization via Residual Truncation and Zero Suppression for Diffusion Models

arxiv url: http://arxiv.org/abs/2509.26436v1
Date: Tue, 30 Sep 2025 15:55:42 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-01 14:45:00.191749
Title: Post-Training Quantization via Residual Truncation and Zero Suppression for Diffusion Models
Title（参考訳）: 拡散モデルにおける残留トラニケーションとゼロ抑制による後学習量子化
Authors: Donghoon Kim, Dongyoung Lee, Ik Joon Chang, Sung-Ho Bae,
Abstract要約: 拡散モデルは、高品質な画像生成を実現するが、高い計算要求のため、デプロイメントの課題に直面している。拡散モデルのための4ビットPTQスキームであるResidual Truncation and Zero Suppression (QuaRTZ) による量子化を提案する。提案手法は,外乱保存とLSB精度のバランスをとることにより,丸め誤差を低減し,量子化効率を向上させる。
参考スコア（独自算出の注目度）: 10.000323762676633
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Diffusion models achieve high-quality image generation but face deployment challenges due to their high computational requirements. Although 8-bit outlier-aware post-training quantization (PTQ) matches full-precision performance, extending PTQ to 4 bits remains challenging. Larger step sizes in 4-bit quantization amplify rounding errors in dense, low-magnitude activations, leading to the loss of fine-grained textures. We hypothesize that not only outliers but also small activations are critical for texture fidelity. To this end, we propose Quantization via Residual Truncation and Zero Suppression (QuaRTZ), a 4-bit PTQ scheme for diffusion models. QuaRTZ applies 8-bit min-max quantization for outlier handling and compresses to 4 bits via leading-zero suppression to retain LSBs, thereby preserving texture details. Our approach reduces rounding errors and improves quantization efficiency by balancing outlier preservation and LSB precision. Both theoretical derivations and empirical evaluations demonstrate the generalizability of QuaRTZ across diverse activation distributions. Notably, 4-bit QuaRTZ achieves an FID of 6.98 on FLUX.1-schnell, outperforming SVDQuant that requires auxiliary FP16 branches.
Abstract（参考訳）: 拡散モデルは、高品質な画像生成を実現するが、高い計算要求のため、デプロイメントの課題に直面している。 8ビットのアウトリー・アウェア・ポストトレーニング量子化(PTQ)は完全精度に適合するが、PTQを4ビットに拡張することは依然として困難である。 4ビット量子化におけるより大きなステップサイズは、密度の低い高次の活性化における丸め誤差を増幅し、きめ細かいテクスチャが失われる。我々は、アウトレーヤだけでなく、小さなアクティベーションもテクスチャの忠実度に重要であると仮定する。そこで本研究では,拡散モデルのための4ビットPTQスキームであるResidual Truncation and Zero Suppression (QuaRTZ) による量子化を提案する。 QuaRTZ は8ビットの min-max 量子化を外周処理に適用し、LSB を保持するために先行ゼロの抑圧によって4ビットに圧縮し、テクスチャの詳細を保存する。提案手法は,外乱保存とLSB精度のバランスをとることにより,丸め誤差を低減し,量子化効率を向上させる。理論的導出と経験的評価の両方が、様々な活性化分布における QuaRTZ の一般化可能性を示している。 4ビットのQuaRTZはFLUX.1-schnell上で6.98のFIDを達成し、補助的なFP16ブランチを必要とするSVDQuantを上回っている。

論文の概要: Post-Training Quantization via Residual Truncation and Zero Suppression for Diffusion Models

関連論文リスト