Fugu-MT 論文翻訳(概要): Unleashing the Potential of the Semantic Latent Space in Diffusion Models for Image Dehazing

論文の概要: Unleashing the Potential of the Semantic Latent Space in Diffusion Models for Image Dehazing

arxiv url: http://arxiv.org/abs/2509.20091v1
Date: Wed, 24 Sep 2025 13:11:37 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-25 20:53:19.818401
Title: Unleashing the Potential of the Semantic Latent Space in Diffusion Models for Image Dehazing
Title（参考訳）: 画像デハジング用拡散モデルにおけるセマンティック潜時空間の可能性
Authors: Zizheng Yang, Hu Yu, Bing Li, Jinghao Zhang, Jie Huang, Feng Zhao,
Abstract要約: DiffLI$2$Dという画像デハジングのための拡散遅延インスパイアネットワークを提案する。まず、事前学習した拡散モデルのセマンティック潜時空間が、画像の内容とヘイズ特性を表現することができることを明らかにした。異なる時間ステップでの拡散遅延表現を繊細に設計したデハジングネットワークに統合し、画像デハジングの指示を与える。
参考スコア（独自算出の注目度）: 25.138589492384654
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Diffusion models have recently been investigated as powerful generative solvers for image dehazing, owing to their remarkable capability to model the data distribution. However, the massive computational burden imposed by the retraining of diffusion models, coupled with the extensive sampling steps during the inference, limit the broader application of diffusion models in image dehazing. To address these issues, we explore the properties of hazy images in the semantic latent space of frozen pre-trained diffusion models, and propose a Diffusion Latent Inspired network for Image Dehazing, dubbed DiffLI$^2$D. Specifically, we first reveal that the semantic latent space of pre-trained diffusion models can represent the content and haze characteristics of hazy images, as the diffusion time-step changes. Building upon this insight, we integrate the diffusion latent representations at different time-steps into a delicately designed dehazing network to provide instructions for image dehazing. Our DiffLI$^2$D avoids re-training diffusion models and iterative sampling process by effectively utilizing the informative representations derived from the pre-trained diffusion models, which also offers a novel perspective for introducing diffusion models to image dehazing. Extensive experiments on multiple datasets demonstrate that the proposed method achieves superior performance to existing image dehazing methods. Code is available at https://github.com/aaaasan111/difflid.
Abstract（参考訳）: 近年,拡散モデルが画像デハージングの強力な生成的解法として研究されている。しかし,拡散モデルの再訓練によって課される膨大な計算負担は,画像デハジングにおける拡散モデルの適用範囲を制限している。これらの問題に対処するため、凍結事前学習拡散モデルのセマンティック潜在空間におけるハズ画像の特性を探索し、DiffLI$2$Dと呼ばれる画像デハジングのための拡散潜在インスパイアされたネットワークを提案する。具体的には,事前学習した拡散モデルのセマンティック潜時空間が,拡散時間の経過とともにヘイズ画像の内容とヘイズ特性を表現できることを明らかにする。この知見に基づいて、異なる時間ステップでの拡散潜時表現を微妙に設計された脱ハージングネットワークに統合し、脱ハージングの指示を与える。我々のDiffLI$^2$Dは、事前学習した拡散モデルから得られた情報表現を効果的に活用することにより、再学習拡散モデルと反復サンプリングプロセスを回避し、画像デハジングに拡散モデルを導入する新しい視点を提供する。複数のデータセットに対する大規模な実験により,提案手法は既存の画像復調法よりも優れた性能を発揮することが示された。コードはhttps://github.com/aaasan111/difflid.comで入手できる。

論文の概要: Unleashing the Potential of the Semantic Latent Space in Diffusion Models for Image Dehazing

関連論文リスト