Efficient Visual Computing with Camera RAW Snapshots
- URL: http://arxiv.org/abs/2212.07778v2
- Date: Thu, 25 Jan 2024 16:47:32 GMT
- Title: Efficient Visual Computing with Camera RAW Snapshots
- Authors: Zhihao Li, Ming Lu, Xu Zhang, Xin Feng, M. Salman Asif, and Zhan Ma
- Abstract summary: Conventional cameras capture image irradiance on a sensor and convert it to RGB images using an image signal processor (ISP)
One can argue that since RAW images contain all the captured information, the conversion of RAW to RGB using an ISP is not necessary for visual computing.
We propose a novel $rho$-Vision framework to perform high-level semantic understanding and low-level compression using RAW images.
- Score: 41.9863557302409
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: Conventional cameras capture image irradiance on a sensor and convert it to
RGB images using an image signal processor (ISP). The images can then be used
for photography or visual computing tasks in a variety of applications, such as
public safety surveillance and autonomous driving. One can argue that since RAW
images contain all the captured information, the conversion of RAW to RGB using
an ISP is not necessary for visual computing. In this paper, we propose a novel
$\rho$-Vision framework to perform high-level semantic understanding and
low-level compression using RAW images without the ISP subsystem used for
decades. Considering the scarcity of available RAW image datasets, we first
develop an unpaired CycleR2R network based on unsupervised CycleGAN to train
modular unrolled ISP and inverse ISP (invISP) models using unpaired RAW and RGB
images. We can then flexibly generate simulated RAW images (simRAW) using any
existing RGB image dataset and finetune different models originally trained for
the RGB domain to process real-world camera RAW images. We demonstrate object
detection and image compression capabilities in RAW-domain using RAW-domain
YOLOv3 and RAW image compressor (RIC) on snapshots from various cameras.
Quantitative results reveal that RAW-domain task inference provides better
detection accuracy and compression compared to RGB-domain processing.
Furthermore, the proposed \r{ho}-Vision generalizes across various camera
sensors and different task-specific models. Additional advantages of the
proposed $\rho$-Vision that eliminates the ISP are the potential reductions in
computations and processing times.
Related papers
- Towards RAW Object Detection in Diverse Conditions [65.30190654593842]
We introduce the AODRaw dataset, which offers 7,785 high-resolution real RAW images with 135,601 annotated instances spanning 62 categories.
We find that sRGB pre-training constrains the potential of RAW object detection due to the domain gap between sRGB and RAW.
We distill the knowledge from an off-the-shelf model pre-trained on the sRGB domain to assist RAW pre-training.
arXiv Detail & Related papers (2024-11-24T01:23:04Z) - A Learnable Color Correction Matrix for RAW Reconstruction [19.394856071610604]
We introduce a learnable color correction matrix (CCM) to approximate the complex inverse image signal processor (ISP)
Experimental results demonstrate that simulated RAW (simRAW) images generated by our method provide performance improvements equivalent to those produced by more complex inverse ISP methods.
arXiv Detail & Related papers (2024-09-04T07:46:42Z) - Simple Image Signal Processing using Global Context Guidance [56.41827271721955]
Deep learning-based ISPs aim to transform RAW images into DSLR-like RGB images using deep neural networks.
We propose a novel module that can be integrated into any neural ISP to capture the global context information from the full RAW images.
Our model achieves state-of-the-art results on different benchmarks using diverse and real smartphone images.
arXiv Detail & Related papers (2024-04-17T17:11:47Z) - BSRAW: Improving Blind RAW Image Super-Resolution [63.408484584265985]
We tackle blind image super-resolution in the RAW domain.
We design a realistic degradation pipeline tailored specifically for training models with raw sensor data.
Our BSRAW models trained with our pipeline can upscale real-scene RAW images and improve their quality.
arXiv Detail & Related papers (2023-12-24T14:17:28Z) - DRAW: Defending Camera-shooted RAW against Image Manipulation [24.203631473348462]
We propose a novel scheme of defending images against manipulation by protecting their sources, i.e., camera-shooted RAWs.
Specifically, we design a lightweight Multi-frequency Partial Fusion Network (MPF-Net) friendly to devices with limited computing resources by frequency learning and partial feature fusion.
The protection capability can not only be transferred into the rendered RGB images regardless of the applied ISP pipeline, but also is resilient to post-processing operations such as blurring or compression.
arXiv Detail & Related papers (2023-07-31T05:57:41Z) - Self-Supervised Reversed Image Signal Processing via Reference-Guided
Dynamic Parameter Selection [1.1602089225841632]
We propose a self-supervised reversed ISP method that does not require metadata and paired images.
The proposed method converts a RGB image into a RAW-like image taken in the same environment with the same sensor as a reference RAW image.
We show that the proposed method is able to learn various reversed ISPs with comparable accuracy to other state-of-the-art supervised methods.
arXiv Detail & Related papers (2023-03-24T11:12:05Z) - Reversed Image Signal Processing and RAW Reconstruction. AIM 2022
Challenge Report [109.2135194765743]
This paper introduces the AIM 2022 Challenge on Reversed Image Signal Processing and RAW Reconstruction.
We aim to recover raw sensor images from the corresponding RGBs without metadata and, by doing this, "reverse" the ISP transformation.
arXiv Detail & Related papers (2022-10-20T10:43:53Z) - Learning RAW-to-sRGB Mappings with Inaccurately Aligned Supervision [76.41657124981549]
This paper presents a joint learning model for image alignment and RAW-to-sRGB mapping.
Experiments show that our method performs favorably against state-of-the-arts on ZRR and SR-RAW datasets.
arXiv Detail & Related papers (2021-08-18T12:41:36Z) - Invertible Image Signal Processing [42.109752151834456]
Invertible Image Signal Processing (InvISP) pipeline enables rendering visually appealing sRGB images.
We can reconstruct realistic RAW data instead of synthesizing RAW data from sRGB images without any memory overhead.
arXiv Detail & Related papers (2021-03-28T06:30:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.