Human Guided Ground-truth Generation for Realistic Image
Super-resolution
- URL: http://arxiv.org/abs/2303.13069v1
- Date: Thu, 23 Mar 2023 06:53:14 GMT
- Title: Human Guided Ground-truth Generation for Realistic Image
Super-resolution
- Authors: Du Chen, Jie Liang, Xindong Zhang, Ming Liu, Hui Zeng, Lei Zhang
- Abstract summary: How to generate the ground-truth (GT) image is a critical issue for training realistic image super-resolution (Real-ISR) models.
Existing methods mostly take a set of high-resolution (HR) images as GTs and apply various degradations to simulate their low-resolution (LR) counterparts.
We propose a human guided GT generation scheme.
- Score: 27.74022069080442
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: How to generate the ground-truth (GT) image is a critical issue for training
realistic image super-resolution (Real-ISR) models. Existing methods mostly
take a set of high-resolution (HR) images as GTs and apply various degradations
to simulate their low-resolution (LR) counterparts. Though great progress has
been achieved, such an LR-HR pair generation scheme has several limitations.
First, the perceptual quality of HR images may not be high enough, limiting the
quality of Real-ISR outputs. Second, existing schemes do not consider much
human perception in GT generation, and the trained models tend to produce
over-smoothed results or unpleasant artifacts. With the above considerations,
we propose a human guided GT generation scheme. We first elaborately train
multiple image enhancement models to improve the perceptual quality of HR
images, and enable one LR image having multiple HR counterparts. Human subjects
are then involved to annotate the high quality regions among the enhanced HR
images as GTs, and label the regions with unpleasant artifacts as negative
samples. A human guided GT image dataset with both positive and negative
samples is then constructed, and a loss function is proposed to train the
Real-ISR models. Experiments show that the Real-ISR models trained on our
dataset can produce perceptually more realistic results with less artifacts.
Dataset and codes can be found at https://github.com/ChrisDud0257/HGGT
Related papers
- Efficient Test-Time Adaptation for Super-Resolution with Second-Order
Degradation and Reconstruction [62.955327005837475]
Image super-resolution (SR) aims to learn a mapping from low-resolution (LR) to high-resolution (HR) using paired HR-LR training images.
We present an efficient test-time adaptation framework for SR, named SRTTA, which is able to quickly adapt SR models to test domains with different/unknown degradation types.
arXiv Detail & Related papers (2023-10-29T13:58:57Z) - Learning Many-to-Many Mapping for Unpaired Real-World Image
Super-resolution and Downscaling [60.80788144261183]
We propose an image downscaling and SR model dubbed as SDFlow, which simultaneously learns a bidirectional many-to-many mapping between real-world LR and HR images unsupervisedly.
Experimental results on real-world image SR datasets indicate that SDFlow can generate diverse realistic LR and SR images both quantitatively and qualitatively.
arXiv Detail & Related papers (2023-10-08T01:48:34Z) - Learning Multiple Probabilistic Degradation Generators for Unsupervised
Real World Image Super Resolution [5.987801889633082]
Unsupervised real world super resolution aims at restoring high-resolution (HR) images given low-resolution (LR) inputs when paired data is unavailable.
One of the most common approaches is synthesizing noisy LR images using GANs and utilizing a synthetic dataset to train the model in a supervised manner.
We propose a probabilistic degradation generator to approximate the distribution of LR images given a HR image.
arXiv Detail & Related papers (2022-01-26T04:49:11Z) - Real-World Super-Resolution of Face-Images from Surveillance Cameras [25.258587196435464]
We propose a novel framework for generation of realistic LR/HR training pairs.
Our framework estimates realistic blur kernels, noise distributions, and JPEG compression artifacts to generate LR images with similar image characteristics as the ones in the source domain.
For better perceptual quality we use a Generative Adrial Network (GAN) based SR model where we have exchanged the commonly used VGG-loss [24] with LPIPS-loss [52]
arXiv Detail & Related papers (2021-02-05T11:38:30Z) - Joint Generative Learning and Super-Resolution For Real-World
Camera-Screen Degradation [6.14297871633911]
In real-world single image super-resolution (SISR) task, the low-resolution image suffers more complicated degradations.
In this paper, we focus on the camera-screen degradation and build a real-world dataset (Cam-ScreenSR)
We propose a joint two-stage model. Firstly, the downsampling degradation GAN(DD-GAN) is trained to model the degradation and produces more various of LR images.
Then the dual residual channel attention network (DuRCAN) learns to recover the SR image.
arXiv Detail & Related papers (2020-08-01T07:10:13Z) - Closed-loop Matters: Dual Regression Networks for Single Image
Super-Resolution [73.86924594746884]
Deep neural networks have exhibited promising performance in image super-resolution.
These networks learn a nonlinear mapping function from low-resolution (LR) images to high-resolution (HR) images.
We propose a dual regression scheme by introducing an additional constraint on LR data to reduce the space of the possible functions.
arXiv Detail & Related papers (2020-03-16T04:23:42Z) - PULSE: Self-Supervised Photo Upsampling via Latent Space Exploration of
Generative Models [77.32079593577821]
PULSE (Photo Upsampling via Latent Space Exploration) generates high-resolution, realistic images at resolutions previously unseen in the literature.
Our method outperforms state-of-the-art methods in perceptual quality at higher resolutions and scale factors than previously possible.
arXiv Detail & Related papers (2020-03-08T16:44:31Z) - Learning to Zoom-in via Learning to Zoom-out: Real-world
Super-resolution by Generating and Adapting Degradation [91.40265983636839]
We propose a framework to learn SR from an arbitrary set of unpaired LR and HR images.
We minimize the discrepancy between the generated data and real data while learning a degradation adaptive SR network.
The proposed unpaired method achieves state-of-the-art SR results on real-world images, even in the datasets that favor the paired-learning methods more.
arXiv Detail & Related papers (2020-01-08T05:17:02Z) - Characteristic Regularisation for Super-Resolving Face Images [81.84939112201377]
Existing facial image super-resolution (SR) methods focus mostly on improving artificially down-sampled low-resolution (LR) imagery.
Previous unsupervised domain adaptation (UDA) methods address this issue by training a model using unpaired genuine LR and HR data.
This renders the model overstretched with two tasks: consistifying the visual characteristics and enhancing the image resolution.
We formulate a method that joins the advantages of conventional SR and UDA models.
arXiv Detail & Related papers (2019-12-30T16:27:24Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.