ISP Distillation
- URL: http://arxiv.org/abs/2101.10203v3
- Date: Thu, 4 May 2023 14:27:49 GMT
- Title: ISP Distillation
- Authors: Eli Schwartz, Alex Bronstein, Raja Giryes
- Abstract summary: High-level machine vision models, such as object recognition or semantic segmentation, assume images are transformed into some canonical image space by the camera.
The camera ISP is optimized for producing visually pleasing images for human observers and not for machines.
We show that our performance on RAW images for object classification and semantic segmentation is significantly better than models trained on labeled RAW images.
- Score: 38.19032198060534
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Nowadays, many of the images captured are `observed' by machines only and not
by humans, e.g., in autonomous systems. High-level machine vision models, such
as object recognition or semantic segmentation, assume images are transformed
into some canonical image space by the camera \ans{Image Signal Processor
(ISP)}. However, the camera ISP is optimized for producing visually pleasing
images for human observers and not for machines. Therefore, one may spare the
ISP compute time and apply vision models directly to RAW images. Yet, it has
been shown that training such models directly on RAW images results in a
performance drop. To mitigate this drop, we use a RAW and RGB image pairs
dataset, which can be easily acquired with no human labeling. We then train a
model that is applied directly to the RAW data by using knowledge distillation
such that the model predictions for RAW images will be aligned with the
predictions of an off-the-shelf pre-trained model for processed RGB images. Our
experiments show that our performance on RAW images for object classification
and semantic segmentation is significantly better than models trained on
labeled RAW images. It also reasonably matches the predictions of a pre-trained
model on processed RGB images, while saving the ISP compute overhead.
Related papers
- Towards RAW Object Detection in Diverse Conditions [65.30190654593842]
We introduce the AODRaw dataset, which offers 7,785 high-resolution real RAW images with 135,601 annotated instances spanning 62 categories.
We find that sRGB pre-training constrains the potential of RAW object detection due to the domain gap between sRGB and RAW.
We distill the knowledge from an off-the-shelf model pre-trained on the sRGB domain to assist RAW pre-training.
arXiv Detail & Related papers (2024-11-24T01:23:04Z) - A Learnable Color Correction Matrix for RAW Reconstruction [19.394856071610604]
We introduce a learnable color correction matrix (CCM) to approximate the complex inverse image signal processor (ISP)
Experimental results demonstrate that simulated RAW (simRAW) images generated by our method provide performance improvements equivalent to those produced by more complex inverse ISP methods.
arXiv Detail & Related papers (2024-09-04T07:46:42Z) - RAW-Adapter: Adapting Pre-trained Visual Model to Camera RAW Images [51.68432586065828]
We introduce RAW-Adapter, a novel approach aimed at adapting sRGB pre-trained models to camera RAW data.
Raw-Adapter comprises input-level adapters that employ learnable ISP stages to adjust RAW inputs, as well as model-level adapters to build connections between ISP stages and subsequent high-level networks.
arXiv Detail & Related papers (2024-08-27T06:14:54Z) - Rawformer: Unpaired Raw-to-Raw Translation for Learnable Camera ISPs [53.68932498994655]
This paper introduces a novel method for unpaired learning of raw-to-raw translation across diverse cameras.
It accurately maps raw images captured by a certain camera to the target camera, facilitating the generalization of learnable ISPs to new unseen cameras.
Our method demonstrates superior performance on real camera datasets, achieving higher accuracy compared to previous state-of-the-art techniques.
arXiv Detail & Related papers (2024-04-16T16:17:48Z) - Self-Supervised Reversed Image Signal Processing via Reference-Guided
Dynamic Parameter Selection [1.1602089225841632]
We propose a self-supervised reversed ISP method that does not require metadata and paired images.
The proposed method converts a RGB image into a RAW-like image taken in the same environment with the same sensor as a reference RAW image.
We show that the proposed method is able to learn various reversed ISPs with comparable accuracy to other state-of-the-art supervised methods.
arXiv Detail & Related papers (2023-03-24T11:12:05Z) - Efficient Visual Computing with Camera RAW Snapshots [41.9863557302409]
Conventional cameras capture image irradiance on a sensor and convert it to RGB images using an image signal processor (ISP)
One can argue that since RAW images contain all the captured information, the conversion of RAW to RGB using an ISP is not necessary for visual computing.
We propose a novel $rho$-Vision framework to perform high-level semantic understanding and low-level compression using RAW images.
arXiv Detail & Related papers (2022-12-15T12:54:21Z) - Model-Based Image Signal Processors via Learnable Dictionaries [6.766416093990318]
Digital cameras transform sensor RAW readings into RGB images by means of their Image Signal Processor (ISP)
Recent approaches have attempted to bridge this gap by estimating the RGB to RAW mapping.
We present a novel hybrid model-based and data-driven ISP that is both learnable and interpretable.
arXiv Detail & Related papers (2022-01-10T08:36:10Z) - Towards Low Light Enhancement with RAW Images [101.35754364753409]
We make the first benchmark effort to elaborate on the superiority of using RAW images in the low light enhancement.
We develop a new evaluation framework, Factorized Enhancement Model (FEM), which decomposes the properties of RAW images into measurable factors.
A RAW-guiding Exposure Enhancement Network (REENet) is developed, which makes trade-offs between the advantages and inaccessibility of RAW images in real applications.
arXiv Detail & Related papers (2021-12-28T07:27:51Z) - Raw Image Deblurring [24.525466412146358]
We build a new dataset containing both RAW images and processed sRGB images and design a new model to utilize the unique characteristics of RAW images.
The proposed deblurring model, trained solely from RAW images, achieves the state-of-art performance and outweighs those trained on processed sRGB images.
arXiv Detail & Related papers (2020-12-08T08:03:09Z) - CycleISP: Real Image Restoration via Improved Data Synthesis [166.17296369600774]
We present a framework that models camera imaging pipeline in forward and reverse directions.
By training a new image denoising network on realistic synthetic data, we achieve the state-of-the-art performance on real camera benchmark datasets.
arXiv Detail & Related papers (2020-03-17T15:20:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.