Fugu-MT 論文翻訳(概要): UnfoldLDM: Deep Unfolding-based Blind Image Restoration with Latent Diffusion Priors

論文の概要: UnfoldLDM: Deep Unfolding-based Blind Image Restoration with Latent Diffusion Priors

arxiv url: http://arxiv.org/abs/2511.18152v1
Date: Sat, 22 Nov 2025 18:44:01 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-25 18:34:24.656036
Title: UnfoldLDM: Deep Unfolding-based Blind Image Restoration with Latent Diffusion Priors
Title（参考訳）: UnfoldLDM:遅延拡散前処理による深部展開型ブラインド画像復元
Authors: Chunming He, Rihan Zhang, Zheng Chen, Bowen Yang, CHengyu Fang, Yunlong Lin, Fengyang Xiao, Sina Farsiu,
Abstract要約: そこで我々は,UnfoldLDMを提案し,深部展開ネットワークと潜時拡散モデル(LDM)を統合してブラインド画像復元を行う。各段階において、UnfoldLDMは勾配降下ステップとして多粒度劣化認識モジュール(MGDA)を用いる。本研究では, 劣化抑制型LCM (DR-LDM) を設計し, コンパクトな劣化不変前駆体を抽出する。このユニークな組み合わせは、最終的な結果が劣化のない、視覚的に豊かなものであることを保証します。
参考スコア（独自算出の注目度）: 21.35552789229871
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Deep unfolding networks (DUNs) combine the interpretability of model-based methods with the learning ability of deep networks, yet remain limited for blind image restoration (BIR). Existing DUNs suffer from: (1) \textbf{Degradation-specific dependency}, as their optimization frameworks are tied to a known degradation model, making them unsuitable for BIR tasks; and (2) \textbf{Over-smoothing bias}, resulting from the direct feeding of gradient descent outputs, dominated by low-frequency content, into the proximal term, suppressing fine textures. To overcome these issues, we propose UnfoldLDM to integrate DUNs with latent diffusion model (LDM) for BIR. In each stage, UnfoldLDM employs a multi-granularity degradation-aware (MGDA) module as the gradient descent step. MGDA models BIR as an unknown degradation estimation problem and estimates both the holistic degradation matrix and its decomposed forms, enabling robust degradation removal. For the proximal step, we design a degradation-resistant LDM (DR-LDM) to extract compact degradation-invariant priors from the MGDA output. Guided by this prior, an over-smoothing correction transformer (OCFormer) explicitly recovers high-frequency components and enhances texture details. This unique combination ensures the final result is degradation-free and visually rich. Experiments show that our UnfoldLDM achieves a leading place on various BIR tasks and benefits downstream tasks. Moreover, our design is compatible with existing DUN-based methods, serving as a plug-and-play framework. Code will be released.
Abstract（参考訳）: ディープ・アンフォールディング・ネットワーク(DUN)は、モデルベースの手法の解釈可能性とディープ・ネットワークの学習能力を組み合わせるが、ブラインド・イメージ・リカバリ(BIR)に限られる。既存の DUN には、(1) \textbf{Degradation-specific dependency} という最適化フレームワークが既知の劣化モデルに結びついているので、BIR タスクには適さない。これらの課題を克服するため、我々はUnfoldLDMを提案し、BIRのためのDUNと潜在拡散モデル(LDM)を統合する。各段階において、UnfoldLDMは勾配降下ステップとして多粒度劣化認識モジュール(MGDA)を用いる。 MGDAは、BIRを未知の劣化推定問題としてモデル化し、全体分解行列とその分解形式の両方を推定し、堅牢な劣化除去を可能にする。近位ステップでは、MGDA出力からコンパクトな劣化不変前駆体を抽出する耐劣化性LDM(DR-LDM)を設計する。この前もって、OCFormer(Over-smoothing correct transformer)は、高周波数成分を明示的に回収し、テクスチャの詳細を強化する。このユニークな組み合わせは、最終的な結果が劣化のない、視覚的に豊かなものであることを保証します。実験の結果、UnfoldLDMは様々なBIRタスクや下流タスクにおいて主要な位置を占めます。さらに、DUNベースの既存のメソッドと互換性があり、プラグイン・アンド・プレイのフレームワークとして機能します。コードはリリースされる。

論文の概要: UnfoldLDM: Deep Unfolding-based Blind Image Restoration with Latent Diffusion Priors

関連論文リスト