Exploit Camera Raw Data for Video Super-Resolution via Hidden Markov
Model Inference
- URL: http://arxiv.org/abs/2008.10710v2
- Date: Tue, 5 Jan 2021 04:03:55 GMT
- Title: Exploit Camera Raw Data for Video Super-Resolution via Hidden Markov
Model Inference
- Authors: Xiaohong Liu, Kangdi Shi, Zhe Wang, Jun Chen
- Abstract summary: We propose a new deep-learning Video Super-Resolution (VSR) method that can directly exploit camera sensor data.
The proposed method achieves superior VSR results compared to the state-of-the-art and can be adapted to any specific camera-ISP.
- Score: 17.82232046395501
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: To the best of our knowledge, the existing deep-learning-based Video
Super-Resolution (VSR) methods exclusively make use of videos produced by the
Image Signal Processor (ISP) of the camera system as inputs. Such methods are
1) inherently suboptimal due to information loss incurred by non-invertible
operations in ISP, and 2) inconsistent with the real imaging pipeline where VSR
in fact serves as a pre-processing unit of ISP. To address this issue, we
propose a new VSR method that can directly exploit camera sensor data,
accompanied by a carefully built Raw Video Dataset (RawVD) for training,
validation, and testing. This method consists of a Successive Deep Inference
(SDI) module and a reconstruction module, among others. The SDI module is
designed according to the architectural principle suggested by a canonical
decomposition result for Hidden Markov Model (HMM) inference; it estimates the
target high-resolution frame by repeatedly performing pairwise feature fusion
using deformable convolutions. The reconstruction module, built with
elaborately designed Attention-based Residual Dense Blocks (ARDBs), serves the
purpose of 1) refining the fused feature and 2) learning the color information
needed to generate a spatial-specific transformation for accurate color
correction. Extensive experiments demonstrate that owing to the informativeness
of the camera raw data, the effectiveness of the network architecture, and the
separation of super-resolution and color correction processes, the proposed
method achieves superior VSR results compared to the state-of-the-art and can
be adapted to any specific camera-ISP. Code and dataset are available at
https://github.com/proteus1991/RawVSR.
Related papers
- A Learnable Color Correction Matrix for RAW Reconstruction [19.394856071610604]
We introduce a learnable color correction matrix (CCM) to approximate the complex inverse image signal processor (ISP)
Experimental results demonstrate that simulated RAW (simRAW) images generated by our method provide performance improvements equivalent to those produced by more complex inverse ISP methods.
arXiv Detail & Related papers (2024-09-04T07:46:42Z) - Reference-based Image and Video Super-Resolution via C2-Matching [100.0808130445653]
We propose C2-Matching, which performs explicit robust matching crossing transformation and resolution.
C2-Matching significantly outperforms state of the arts on the standard CUFED5 benchmark.
We also extend C2-Matching to Reference-based Video Super-Resolution task, where an image taken in a similar scene serves as the HR reference image.
arXiv Detail & Related papers (2022-12-19T16:15:02Z) - Learning Detail-Structure Alternative Optimization for Blind
Super-Resolution [69.11604249813304]
We propose an effective and kernel-free network, namely DSSR, which enables recurrent detail-structure alternative optimization without blur kernel prior incorporation for blind SR.
In our DSSR, a detail-structure modulation module (DSMM) is built to exploit the interaction and collaboration of image details and structures.
Our method achieves the state-of-the-art against existing methods.
arXiv Detail & Related papers (2022-12-03T14:44:17Z) - Self-Supervised Deep Blind Video Super-Resolution [46.410705294831374]
We propose a self-supervised learning method to solve the blind video SR problem.
We generate auxiliary paired data from original LR videos according to the image formation of video SR.
Experiments show that our method performs favorably against state-of-the-art ones on benchmarks and real-world videos.
arXiv Detail & Related papers (2022-01-19T05:18:44Z) - RBSRICNN: Raw Burst Super-Resolution through Iterative Convolutional
Neural Network [23.451063587138393]
We propose a Raw Burst Super-Resolution Iterative Convolutional Neural Network (RBSRICNN)
The proposed network produces the final output by an iterative refinement of the intermediate SR estimates.
We demonstrate the effectiveness of our proposed approach in quantitative and qualitative experiments.
arXiv Detail & Related papers (2021-10-25T19:01:28Z) - Enhanced Hyperspectral Image Super-Resolution via RGB Fusion and TV-TV
Minimization [9.584717030078245]
Hyperspectral (HS) images contain detailed spectral information that has proven crucial in applications like remote sensing, surveillance, and astronomy.
Because of hardware limitations of HS cameras, the captured images have low spatial resolution.
To improve them, the low-resolution hyperspectral images are fused with conventional high-resolution RGB images via a technique known as fusion based HS image super-resolution.
arXiv Detail & Related papers (2021-06-13T18:52:47Z) - LAPAR: Linearly-Assembled Pixel-Adaptive Regression Network for Single
Image Super-Resolution and Beyond [75.37541439447314]
Single image super-resolution (SISR) deals with a fundamental problem of upsampling a low-resolution (LR) image to its high-resolution (HR) version.
This paper proposes a linearly-assembled pixel-adaptive regression network (LAPAR) to strike a sweet spot of deep model complexity and resulting SISR quality.
arXiv Detail & Related papers (2021-05-21T15:47:18Z) - Exploiting Raw Images for Real-Scene Super-Resolution [105.18021110372133]
We study the problem of real-scene single image super-resolution to bridge the gap between synthetic data and real captured images.
We propose a method to generate more realistic training data by mimicking the imaging process of digital cameras.
We also develop a two-branch convolutional neural network to exploit the radiance information originally-recorded in raw images.
arXiv Detail & Related papers (2021-02-02T16:10:15Z) - Deep Burst Super-Resolution [165.90445859851448]
We propose a novel architecture for the burst super-resolution task.
Our network takes multiple noisy RAW images as input, and generates a denoised, super-resolved RGB image as output.
In order to enable training and evaluation on real-world data, we additionally introduce the BurstSR dataset.
arXiv Detail & Related papers (2021-01-26T18:57:21Z) - MuCAN: Multi-Correspondence Aggregation Network for Video
Super-Resolution [63.02785017714131]
Video super-resolution (VSR) aims to utilize multiple low-resolution frames to generate a high-resolution prediction for each frame.
Inter- and intra-frames are the key sources for exploiting temporal and spatial information.
We build an effective multi-correspondence aggregation network (MuCAN) for VSR.
arXiv Detail & Related papers (2020-07-23T05:41:27Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.