When Cloud Removal Meets Diffusion Model in Remote Sensing
- URL: http://arxiv.org/abs/2504.14785v1
- Date: Mon, 21 Apr 2025 00:56:57 GMT
- Title: When Cloud Removal Meets Diffusion Model in Remote Sensing
- Authors: Zhenyu Yu, Mohd Yamani Idna Idris, Pei Wang,
- Abstract summary: We propose DC4CR (Diffusion Control for Cloud Removal), a novel framework for cloud removal in remote sensing imagery.<n>Our method introduces prompt-driven control, allowing selective removal of thin and thick clouds without relying on pre-generated cloud masks.<n>Experiments on the RICE and CUHK-CR datasets demonstrate state-of-the-art performance, achieving superior cloud removal across diverse conditions.
- Score: 4.824120664293887
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Cloud occlusion significantly hinders remote sensing applications by obstructing surface information and complicating analysis. To address this, we propose DC4CR (Diffusion Control for Cloud Removal), a novel multimodal diffusion-based framework for cloud removal in remote sensing imagery. Our method introduces prompt-driven control, allowing selective removal of thin and thick clouds without relying on pre-generated cloud masks, thereby enhancing preprocessing efficiency and model adaptability. Additionally, we integrate low-rank adaptation for computational efficiency, subject-driven generation for improved generalization, and grouped learning to enhance performance on small datasets. Designed as a plug-and-play module, DC4CR seamlessly integrates into existing cloud removal models, providing a scalable and robust solution. Extensive experiments on the RICE and CUHK-CR datasets demonstrate state-of-the-art performance, achieving superior cloud removal across diverse conditions. This work presents a practical and efficient approach for remote sensing image processing with broad real-world applications.
Related papers
- Multimodal Diffusion Bridge with Attention-Based SAR Fusion for Satellite Image Cloud Removal [15.014162404892557]
Diffusion Bridges for Cloud Removal, DB-CR, bridges between cloudy and cloud-free image distributions.<n>DB-CR achieves high-fidelity results while being computationally efficient.
arXiv Detail & Related papers (2025-04-04T17:25:49Z) - Patch-GAN Transfer Learning with Reconstructive Models for Cloud Removal [17.690698736544626]
Cloud removal plays a crucial role in enhancing remote sensing image analysis, yet accurately reconstructing cloud-obscured regions remains a significant challenge.<n>Recent advancements in generative models have made the generation of realistic images increasingly accessible.<n>We propose a deep transfer learning approach built on a generative adversarial network (GAN) framework to explore the potential of the novel masked autoencoder (MAE) image reconstruction model in cloud removal.
arXiv Detail & Related papers (2025-01-09T14:19:46Z) - IDF-CR: Iterative Diffusion Process for Divide-and-Conquer Cloud Removal in Remote-sensing Images [55.40601468843028]
We present an iterative diffusion process for cloud removal (IDF-CR)
IDF-CR is divided into two-stage models that address pixel space and latent space.
In the latent space stage, the diffusion model transforms low-quality cloud removal into high-quality clean output.
arXiv Detail & Related papers (2024-03-18T15:23:48Z) - Towards Robust and Efficient Cloud-Edge Elastic Model Adaptation via Selective Entropy Distillation [56.79064699832383]
We establish a Cloud-Edge Elastic Model Adaptation (CEMA) paradigm in which the edge models only need to perform forward propagation.
In our CEMA, to reduce the communication burden, we devise two criteria to exclude unnecessary samples from uploading to the cloud.
arXiv Detail & Related papers (2024-02-27T08:47:19Z) - Diffusion Enhancement for Cloud Removal in Ultra-Resolution Remote
Sensing Imagery [48.14610248492785]
Cloud layers severely compromise the quality and effectiveness of optical remote sensing (RS) images.
Existing deep-learning (DL)-based Cloud Removal (CR) techniques encounter difficulties in accurately reconstructing the original visual authenticity and detailed semantic content of the images.
This work proposes enhancements at the data and methodology fronts to tackle this challenge.
arXiv Detail & Related papers (2024-01-25T13:14:17Z) - Fast Training of Diffusion Transformer with Extreme Masking for 3D Point
Clouds Generation [64.99362684909914]
We propose FastDiT-3D, a novel masked diffusion transformer tailored for efficient 3D point cloud generation.
We also propose a novel voxel-aware masking strategy to adaptively aggregate background/foreground information from voxelized point clouds.
Our method achieves state-of-the-art performance with an extreme masking ratio of nearly 99%.
arXiv Detail & Related papers (2023-12-12T12:50:33Z) - PMAA: A Progressive Multi-scale Attention Autoencoder Model for
High-performance Cloud Removal from Multi-temporal Satellite Imagery [26.694734522423797]
This study introduces a high-performance cloud removal architecture, termed Progressive Multi-scale Attention Autoencoder (PMAA)
PMAA harnesses global and local information to construct robust contextual dependencies using a novel Multi-scale Attention Module (MAM) and a novel Local Interaction Module (LIM)
PMAA consistently outperforms the previous state-of-the-art model CTGAN on two benchmark datasets.
arXiv Detail & Related papers (2023-03-29T09:47:48Z) - Implicit Diffusion Models for Continuous Super-Resolution [65.45848137914592]
This paper introduces an Implicit Diffusion Model (IDM) for high-fidelity continuous image super-resolution.
IDM integrates an implicit neural representation and a denoising diffusion model in a unified end-to-end framework.
The scaling factor regulates the resolution and accordingly modulates the proportion of the LR information and generated features in the final output.
arXiv Detail & Related papers (2023-03-29T07:02:20Z) - Multi-Head Linear Attention Generative Adversarial Network for Thin
Cloud Removal [5.753245638190626]
thin cloud removal is an indispensable procedure to enhance the utilization of remote sensing images.
We propose a Multi-Head Linear Attention Generative Adversarial Network (MLAGAN) for Thin Cloud Removal.
Compared with six deep learning-based thin cloud removal benchmarks, the experimental results on the RICE1 and RICE2 datasets demonstrate that the proposed framework MLA-GAN has dominant advantages in thin cloud removal.
arXiv Detail & Related papers (2020-12-20T11:50:54Z) - Pseudo-LiDAR Point Cloud Interpolation Based on 3D Motion Representation
and Spatial Supervision [68.35777836993212]
We propose a Pseudo-LiDAR point cloud network to generate temporally and spatially high-quality point cloud sequences.
By exploiting the scene flow between point clouds, the proposed network is able to learn a more accurate representation of the 3D spatial motion relationship.
arXiv Detail & Related papers (2020-06-20T03:11:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.