Recursive Self-Improvement for Camera Image and Signal Processing
Pipeline
- URL: http://arxiv.org/abs/2111.07499v1
- Date: Mon, 15 Nov 2021 02:23:40 GMT
- Title: Recursive Self-Improvement for Camera Image and Signal Processing
Pipeline
- Authors: Chandrajit Bajaj and Yi Wang and Yunhao Yang and Yuhan Zheng
- Abstract summary: Current camera image and signal processing pipelines (ISPs) tend to apply a single filter that is uniformly applied to the entire image.
This despite the fact that most acquired camera images have spatially heterogeneous artifacts.
We present a deep reinforcement learning model that works in learned latent subspaces.
- Score: 6.318974730864278
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Current camera image and signal processing pipelines (ISPs), including deep
trained versions, tend to apply a single filter that is uniformly applied to
the entire image. This despite the fact that most acquired camera images have
spatially heterogeneous artifacts. This spatial heterogeneity manifests itself
across the image space as varied Moire ringing, motion-blur, color-bleaching or
lens based projection distortions. Moreover, combinations of these image
artifacts can be present in small or large pixel neighborhoods, within an
acquired image. Here, we present a deep reinforcement learning model that works
in learned latent subspaces, recursively improves camera image quality through
a patch-based spatially adaptive artifact filtering and image enhancement. Our
RSE-RL model views the identification and correction of artifacts as a
recursive self-learning and self-improvement exercise and consists of two major
sub-modules: (i) The latent feature sub-space clustering/grouping obtained
through an equivariant variational auto-encoder enabling rapid identification
of the correspondence and discrepancy between noisy and clean image patches.
(ii) The adaptive learned transformation controlled by a trust-region soft
actor-critic agent that progressively filters and enhances the noisy patches
using its closest feature distance neighbors of clean patches. Artificial
artifacts that may be introduced in a patch-based ISP, are also removed through
a reward based de-blocking recovery and image enhancement. We demonstrate the
self-improvement feature of our model by recursively training and testing on
images, wherein the enhanced images resulting from each epoch provide a natural
data augmentation and robustness to the RSE-RL training-filtering pipeline.
Related papers
- DGNet: Dynamic Gradient-Guided Network for Water-Related Optics Image
Enhancement [77.0360085530701]
Underwater image enhancement (UIE) is a challenging task due to the complex degradation caused by underwater environments.
Previous methods often idealize the degradation process, and neglect the impact of medium noise and object motion on the distribution of image features.
Our approach utilizes predicted images to dynamically update pseudo-labels, adding a dynamic gradient to optimize the network's gradient space.
arXiv Detail & Related papers (2023-12-12T06:07:21Z) - Pixel-Inconsistency Modeling for Image Manipulation Localization [59.968362815126326]
Digital image forensics plays a crucial role in image authentication and manipulation localization.
This paper presents a generalized and robust manipulation localization model through the analysis of pixel inconsistency artifacts.
Experiments show that our method successfully extracts inherent pixel-inconsistency forgery fingerprints.
arXiv Detail & Related papers (2023-09-30T02:54:51Z) - In-Domain GAN Inversion for Faithful Reconstruction and Editability [132.68255553099834]
We propose in-domain GAN inversion, which consists of a domain-guided domain-regularized and a encoder to regularize the inverted code in the native latent space of the pre-trained GAN model.
We make comprehensive analyses on the effects of the encoder structure, the starting inversion point, as well as the inversion parameter space, and observe the trade-off between the reconstruction quality and the editing property.
arXiv Detail & Related papers (2023-09-25T08:42:06Z) - Pixel-Aware Stable Diffusion for Realistic Image Super-resolution and Personalized Stylization [23.723573179119228]
We propose a pixel-aware stable diffusion (PASD) network to achieve robust Real-ISR and personalized image stylization.
A pixel-aware cross attention module is introduced to enable diffusion models perceiving image local structures in pixel-wise level.
An adjustable noise schedule is introduced to further improve the image restoration results.
arXiv Detail & Related papers (2023-08-28T10:15:57Z) - Super-Resolution of License Plate Images Using Attention Modules and
Sub-Pixel Convolution Layers [3.8831062015253055]
We introduce a Single-Image Super-Resolution (SISR) approach to enhance the detection of structural and textural features in surveillance images.
Our approach incorporates sub-pixel convolution layers and a loss function that uses an Optical Character Recognition (OCR) model for feature extraction.
Our results show that our approach for reconstructing these low-resolution synthesized images outperforms existing ones in both quantitative and qualitative measures.
arXiv Detail & Related papers (2023-05-27T00:17:19Z) - Spatially-Adaptive Image Restoration using Distortion-Guided Networks [51.89245800461537]
We present a learning-based solution for restoring images suffering from spatially-varying degradations.
We propose SPAIR, a network design that harnesses distortion-localization information and dynamically adjusts to difficult regions in the image.
arXiv Detail & Related papers (2021-08-19T11:02:25Z) - Ensembling with Deep Generative Views [72.70801582346344]
generative models can synthesize "views" of artificial images that mimic real-world variations, such as changes in color or pose.
Here, we investigate whether such views can be applied to real images to benefit downstream analysis tasks such as image classification.
We use StyleGAN2 as the source of generative augmentations and investigate this setup on classification tasks involving facial attributes, cat faces, and cars.
arXiv Detail & Related papers (2021-04-29T17:58:35Z) - Deep Contrastive Patch-Based Subspace Learning for Camera Image Signal
Processing [5.678834480723395]
We present a specific patch-based, local subspace deep neural network that improves Camera ISP to be robust to heterogeneous artifacts.
We call our three-fold deep-trained model the Patch Subspace Learning Autoencoder (PSL-AE)
PSL-AE encodes patches extracted from noisy a nd clean image pairs, with different artifact types or distortion levels, by contrastive learning.
arXiv Detail & Related papers (2021-04-01T04:40:22Z) - SIR: Self-supervised Image Rectification via Seeing the Same Scene from
Multiple Different Lenses [82.56853587380168]
We propose a novel self-supervised image rectification (SIR) method based on an important insight that the rectified results of distorted images of the same scene from different lens should be the same.
We leverage a differentiable warping module to generate the rectified images and re-distorted images from the distortion parameters.
Our method achieves comparable or even better performance than the supervised baseline method and representative state-of-the-art methods.
arXiv Detail & Related papers (2020-11-30T08:23:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.