Fugu-MT 論文翻訳(概要): SlimDiffSR: Toward Lightweight and Efficient Remote Sensing Image Super-Resolution via Diffusion Model Distillation

論文の概要: SlimDiffSR: Toward Lightweight and Efficient Remote Sensing Image Super-Resolution via Diffusion Model Distillation

arxiv url: http://arxiv.org/abs/2605.02198v1
Date: Mon, 04 May 2026 03:54:54 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-05 20:33:50.129537
Title: SlimDiffSR: Toward Lightweight and Efficient Remote Sensing Image Super-Resolution via Diffusion Model Distillation
Title（参考訳）: SlimDiffSR:拡散モデル蒸留による軽量・高効率リモートセンシング画像超解像に向けて
Authors: Ce Wang, Zhenyu Hu, Wanjie Sun,
Abstract要約: SlimDiffSRは、現実世界のリモートセンシング画像の超高解像度化のための軽量で効率的な拡散ベースのフレームワークである。推論アクセラレーションは最大200ドル、モデルパラメータはマルチステップの拡散モデルと比較して20ドルまで削減できる。
参考スコア（独自算出の注目度）: 10.518150741228862
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Diffusion models have recently achieved remarkable performance in image super-resolution (SR), but their high computational cost limits practical deployment in remote sensing applications. To address this issue, we propose SlimDiffSR, a lightweight and efficient diffusion-based framework for real-world remote sensing image super-resolution. Unlike existing single-step diffusion methods that rely on fixed timesteps, we first introduce an uncertainty-guided timestep assignment strategy to construct a stronger single-step teacher model, where reconstruction difficulty is explicitly linked to diffusion timesteps, enabling adaptive generative strength. Building upon this teacher, we further present a structured pruning strategy tailored to remote sensing imagery, which systematically removes redundant semantic modules and replaces standard operations with lightweight designs, including frequency-separable convolution, direction-separable convolution, and a query-driven global aggregation module. These components explicitly exploit the unique characteristics of remote sensing data, such as sparse high-frequency details, strong directional patterns, and long-range spatial dependencies. To enhance knowledge transfer, we incorporate Maximum Mean Discrepancy (MMD) into the distillation process to align feature distributions between the teacher and student models. Extensive experiments on multiple remote sensing benchmarks demonstrate that SlimDiffSR achieves a favorable balance between efficiency and reconstruction quality. In particular, it attains up to $200\times$ inference acceleration and a $20\times$ reduction in model parameters compared with multi-step diffusion models, while achieving competitive perceptual quality and clearly outperforming existing lightweight diffusion baselines in efficiency. The code is available at: https://github.com/wwangcece/SlimDiffSR.
Abstract（参考訳）: 拡散モデルは最近、画像超解像(SR)において顕著な性能を達成しているが、その高い計算コストはリモートセンシングアプリケーションにおける実用的な展開を制限している。この問題に対処するために,実世界のリモートセンシング画像の超高解像度化のための軽量かつ効率的な拡散ベースフレームワークであるSlimDiffSRを提案する。固定時間ステップに依存する既存の単段階拡散法とは異なり、我々はまず、より強力な単段階教師モデルを構築するための不確実性誘導時間ステップ割り当て戦略を導入する。この教師を基盤として、リモートセンシング画像に適した構造化プルーニング戦略を提案し、冗長なセマンティックモジュールを体系的に除去し、周波数分離可能な畳み込み、方向分離可能な畳み込み、クエリ駆動のグローバルアグリゲーションモジュールを含む軽量な設計で標準操作を置き換える。これらのコンポーネントは、細かな高周波の詳細、強い方向パターン、長距離空間依存など、リモートセンシングデータのユニークな特性を明示的に活用する。知識伝達の促進を目的として, 最大平均離散度(MMD)を蒸留プロセスに組み込んで, 教師モデルと学生モデルの特徴分布を整合させる。複数のリモートセンシングベンチマークの大規模な実験により、SlimDiffSRは効率と再構築品質のバランスが良好であることが示された。特に、最大200\times$推論アクセラレーションと20\times$モデルパラメータをマルチステップ拡散モデルと比較し、競争力のある知覚品質を実現し、既存の軽量拡散ベースラインを効率良く上回る。コードは、https://github.com/wwangcece/SlimDiffSR.comで入手できる。

論文の概要: SlimDiffSR: Toward Lightweight and Efficient Remote Sensing Image Super-Resolution via Diffusion Model Distillation

関連論文リスト