Fugu-MT 論文翻訳(概要): Diffusion Models in Low-Level Vision: A Survey

論文の概要: Diffusion Models in Low-Level Vision: A Survey

arxiv url: http://arxiv.org/abs/2406.11138v1
Date: Mon, 17 Jun 2024 01:49:27 GMT
ステータス: 翻訳完了
システム内更新日: 2024-06-18 18:43:55.341742
Title: Diffusion Models in Low-Level Vision: A Survey
Title（参考訳）: 低レベル視における拡散モデル:サーベイ
Authors: Chunming He, Yuqi Shen, Chengyu Fang, Fengyang Xiao, Longxiang Tang, Yulun Zhang, Wangmeng Zuo, Zhenhua Guo, Xiu Li,
Abstract要約: 拡散モデルに基づくソリューションは、優れた品質と多様性のサンプルを作成する能力で広く称賛されている。本稿では,3つの一般化拡散モデリングフレームワークを提案し,それらと他の深層生成モデルとの相関関係について検討する。医療、リモートセンシング、ビデオシナリオなど、他のタスクに適用された拡張拡散モデルについて要約する。
参考スコア（独自算出の注目度）: 82.77962165415153
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Deep generative models have garnered significant attention in low-level vision tasks due to their generative capabilities. Among them, diffusion model-based solutions, characterized by a forward diffusion process and a reverse denoising process, have emerged as widely acclaimed for their ability to produce samples of superior quality and diversity. This ensures the generation of visually compelling results with intricate texture information. Despite their remarkable success, a noticeable gap exists in a comprehensive survey that amalgamates these pioneering diffusion model-based works and organizes the corresponding threads. This paper proposes the comprehensive review of diffusion model-based techniques. We present three generic diffusion modeling frameworks and explore their correlations with other deep generative models, establishing the theoretical foundation. Following this, we introduce a multi-perspective categorization of diffusion models, considering both the underlying framework and the target task. Additionally, we summarize extended diffusion models applied in other tasks, including medical, remote sensing, and video scenarios. Moreover, we provide an overview of commonly used benchmarks and evaluation metrics. We conduct a thorough evaluation, encompassing both performance and efficiency, of diffusion model-based techniques in three prominent tasks. Finally, we elucidate the limitations of current diffusion models and propose seven intriguing directions for future research. This comprehensive examination aims to facilitate a profound understanding of the landscape surrounding denoising diffusion models in the context of low-level vision tasks. A curated list of diffusion model-based techniques in over 20 low-level vision tasks can be found at https://github.com/ChunmingHe/awesome-diffusion-models-in-low-level-vision.
Abstract（参考訳）: 深部生成モデルは、その生成能力のため、低レベルの視覚タスクに大きな注目を集めている。このうち, 拡散モデルに基づく解法は, 高い品質と多様性のサンプルを生成できることで広く評価されている。これにより、複雑なテクスチャ情報によって視覚的に説得力のある結果が生成される。その顕著な成功にもかかわらず、これらの先駆的な拡散モデルに基づく研究とそれに対応するスレッドを整理する、包括的な調査に顕著なギャップが存在する。本稿では拡散モデルに基づく手法の総合的なレビューを提案する。本稿では,3つの一般化拡散モデルフレームワークを提案し,それらの関係を他の深層生成モデルと検討し,理論基盤を確立する。次に、基礎となるフレームワークと対象タスクの両方を考慮して、拡散モデルの多重パースペクティブな分類を導入する。さらに、医療、リモートセンシング、ビデオシナリオなど、他のタスクに適用された拡張拡散モデルについても要約する。さらに、よく使われるベンチマークと評価指標の概要について述べる。 3つの課題において拡散モデルに基づく手法の性能と効率の両面を網羅的に評価する。最後に,現在の拡散モデルの限界を解明し,今後の研究に興味深い7つの方向を提案する。この総合的な調査は,低レベル視覚タスクの文脈における認知拡散モデルを取り巻く景観の深い理解を促進することを目的としている。 20以上の低レベルビジョンタスクにおける拡散モデルベースのテクニックのキュレートされたリストは、https://github.com/ChunmingHe/awesome-diffusion-models-in-low-level-visionにある。

論文の概要: Diffusion Models in Low-Level Vision: A Survey

関連論文リスト