RDDM: Practicing RAW Domain Diffusion Model for Real-world Image Restoration
- URL: http://arxiv.org/abs/2508.19154v1
- Date: Tue, 26 Aug 2025 16:06:17 GMT
- Title: RDDM: Practicing RAW Domain Diffusion Model for Real-world Image Restoration
- Authors: Yan Chen, Yi Wen, Wei Li, Junchao Liu, Yong Guo, Jie Hu, Xinghao Chen,
- Abstract summary: We present an end-to-end diffusion model that restores photo-realistic images directly from the sensor RAW data.<n>We develop a scalable degradation pipeline RAW LQ-HQ pairs from existing sRGB datasets for large-scale training.
- Score: 27.387521556174104
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We present the RAW domain diffusion model (RDDM), an end-to-end diffusion model that restores photo-realistic images directly from the sensor RAW data. While recent sRGB-domain diffusion methods achieve impressive results, they are caught in a dilemma between high fidelity and realistic generation. As these models process lossy sRGB inputs and neglect the accessibility of the sensor RAW images in many scenarios, e.g., in image and video capturing in edge devices, resulting in sub-optimal performance. RDDM bypasses this limitation by directly restoring images in the RAW domain, replacing the conventional two-stage image signal processing (ISP) + IR pipeline. However, a simple adaptation of pre-trained diffusion models to the RAW domain confronts the out-of-distribution (OOD) issues. To this end, we propose: (1) a RAW-domain VAE (RVAE) learning optimal latent representations, (2) a differentiable Post Tone Processing (PTP) module enabling joint RAW and sRGB space optimization. To compensate for the deficiency in the dataset, we develop a scalable degradation pipeline synthesizing RAW LQ-HQ pairs from existing sRGB datasets for large-scale training. Furthermore, we devise a configurable multi-bayer (CMB) LoRA module handling diverse RAW patterns such as RGGB, BGGR, etc. Extensive experiments demonstrate RDDM's superiority over state-of-the-art sRGB diffusion methods, yielding higher fidelity results with fewer artifacts.
Related papers
- Towards RAW Object Detection in Diverse Conditions [65.30190654593842]
We introduce the AODRaw dataset, which offers 7,785 high-resolution real RAW images with 135,601 annotated instances spanning 62 categories.
We find that sRGB pre-training constrains the potential of RAW object detection due to the domain gap between sRGB and RAW.
We distill the knowledge from an off-the-shelf model pre-trained on the sRGB domain to assist RAW pre-training.
arXiv Detail & Related papers (2024-11-24T01:23:04Z) - RAW-Diffusion: RGB-Guided Diffusion Models for High-Fidelity RAW Image Generation [4.625376287612609]
We propose a novel diffusion-based method for generating RAW images guided by RGB images.
This approach yields high-fidelity RAW images, enabling the creation of camera-specific RAW datasets.
We extend our method to create BDD100K-RAW and Cityscapes-RAW datasets, revealing its effectiveness for object detection in RAW imagery.
arXiv Detail & Related papers (2024-11-20T09:40:12Z) - Unveiling Hidden Details: A RAW Data-Enhanced Paradigm for Real-World Super-Resolution [56.98910228239627]
Real-world image super-resolution (Real SR) aims to generate high-fidelity, detail-rich high-resolution (HR) images from low-resolution (LR) counterparts.
Existing Real SR methods primarily focus on generating details from the LR RGB domain, often leading to a lack of richness or fidelity in fine details.
We pioneer the use of details hidden in RAW data to complement existing RGB-only methods, yielding superior outputs.
arXiv Detail & Related papers (2024-11-16T13:29:50Z) - Toward Efficient Deep Blind RAW Image Restoration [56.41827271721955]
We design a new realistic degradation pipeline for training deep blind RAW restoration models.
Our pipeline considers realistic sensor noise, motion blur, camera shake, and other common degradations.
The models trained with our pipeline and data from multiple sensors, can successfully reduce noise and blur, and recover details in RAW images captured from different cameras.
arXiv Detail & Related papers (2024-09-26T18:34:37Z) - Retinex-RAWMamba: Bridging Demosaicing and Denoising for Low-Light RAW Image Enhancement [71.13353154514418]
Low-light image enhancement, particularly in cross-domain tasks such as mapping from the raw domain to the sRGB domain, remains a significant challenge.<n>We propose a novel Mamba-based method customized for low light RAW images, called RAWMamba, to effectively handle raw images with different CFAs.<n>By bridging demosaicing and denoising, better enhancement for low light RAW images is achieved.
arXiv Detail & Related papers (2024-09-11T06:12:03Z) - A Learnable Color Correction Matrix for RAW Reconstruction [19.394856071610604]
We introduce a learnable color correction matrix (CCM) to approximate the complex inverse image signal processor (ISP)
Experimental results demonstrate that simulated RAW (simRAW) images generated by our method provide performance improvements equivalent to those produced by more complex inverse ISP methods.
arXiv Detail & Related papers (2024-09-04T07:46:42Z) - BSRAW: Improving Blind RAW Image Super-Resolution [63.408484584265985]
We tackle blind image super-resolution in the RAW domain.
We design a realistic degradation pipeline tailored specifically for training models with raw sensor data.
Our BSRAW models trained with our pipeline can upscale real-scene RAW images and improve their quality.
arXiv Detail & Related papers (2023-12-24T14:17:28Z) - Image Demoireing in RAW and sRGB Domains [18.921026683632146]
We develop Skip-Connection-based Demoireing Module (SCDM) with Gated Feedback Module (GFM) and Frequency Selection Module (FSM)
We design a RGB Guided ISP (RGISP) to learn a device-dependent ISP, assisting the process of color recovery.
Our RRID outperforms state-of-the-art approaches, in terms of the performance in moire pattern removal and color cast correction by 0.62dB in PSNR and 0.003 in SSIM.
arXiv Detail & Related papers (2023-12-14T16:00:28Z) - Reversed Image Signal Processing and RAW Reconstruction. AIM 2022
Challenge Report [109.2135194765743]
This paper introduces the AIM 2022 Challenge on Reversed Image Signal Processing and RAW Reconstruction.
We aim to recover raw sensor images from the corresponding RGBs without metadata and, by doing this, "reverse" the ISP transformation.
arXiv Detail & Related papers (2022-10-20T10:43:53Z) - Model-Based Image Signal Processors via Learnable Dictionaries [6.766416093990318]
Digital cameras transform sensor RAW readings into RGB images by means of their Image Signal Processor (ISP)
Recent approaches have attempted to bridge this gap by estimating the RGB to RAW mapping.
We present a novel hybrid model-based and data-driven ISP that is both learnable and interpretable.
arXiv Detail & Related papers (2022-01-10T08:36:10Z) - Towards Low Light Enhancement with RAW Images [101.35754364753409]
We make the first benchmark effort to elaborate on the superiority of using RAW images in the low light enhancement.
We develop a new evaluation framework, Factorized Enhancement Model (FEM), which decomposes the properties of RAW images into measurable factors.
A RAW-guiding Exposure Enhancement Network (REENet) is developed, which makes trade-offs between the advantages and inaccessibility of RAW images in real applications.
arXiv Detail & Related papers (2021-12-28T07:27:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.