OSRT: Omnidirectional Image Super-Resolution with Distortion-aware
Transformer
- URL: http://arxiv.org/abs/2302.03453v2
- Date: Thu, 9 Feb 2023 10:45:41 GMT
- Title: OSRT: Omnidirectional Image Super-Resolution with Distortion-aware
Transformer
- Authors: Fanghua Yu, Xintao Wang, Mingdeng Cao, Gen Li, Ying Shan, Chao Dong
- Abstract summary: Previous methods attempt to solve this issue by image super-resolution (SR) on equirectangular projection (ERP) images.
We propose Fisheye downsampling, which mimics the real-world imaging process and synthesizes more realistic low-resolution samples.
We also propose a convenient data augmentation strategy, which synthesizes pseudo ERP images from plain images.
- Score: 28.53390467642499
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Omnidirectional images (ODIs) have obtained lots of research interest for
immersive experiences. Although ODIs require extremely high resolution to
capture details of the entire scene, the resolutions of most ODIs are
insufficient. Previous methods attempt to solve this issue by image
super-resolution (SR) on equirectangular projection (ERP) images. However, they
omit geometric properties of ERP in the degradation process, and their models
can hardly generalize to real ERP images. In this paper, we propose Fisheye
downsampling, which mimics the real-world imaging process and synthesizes more
realistic low-resolution samples. Then we design a distortion-aware Transformer
(OSRT) to modulate ERP distortions continuously and self-adaptively. Without a
cumbersome process, OSRT outperforms previous methods by about 0.2dB on PSNR.
Moreover, we propose a convenient data augmentation strategy, which synthesizes
pseudo ERP images from plain images. This simple strategy can alleviate the
over-fitting problem of large networks and significantly boost the performance
of ODISR. Extensive experiments have demonstrated the state-of-the-art
performance of our OSRT. Codes and models will be available at
https://github.com/Fanghua-Yu/OSRT.
Related papers
- Geometric Distortion Guided Transformer for Omnidirectional Image Super-Resolution [26.819508321968108]
We introduce a novel Geometric Distortion Guided Transformer for Omnidirectional image Super-Resolution (GDGT-OSR)
Specifically, a distortion modulated rectangle-window self-attention mechanism, integrated with deformable self-attention, is proposed to better perceive the distortion.
We show that the new GDGT-OSR outperforms methods in existing literature.
arXiv Detail & Related papers (2024-06-16T09:38:33Z) - Hierarchical Similarity Learning for Aliasing Suppression Image
Super-Resolution [64.15915577164894]
A hierarchical image super-resolution network (HSRNet) is proposed to suppress the influence of aliasing.
HSRNet achieves better quantitative and visual performance than other works, and remits the aliasing more effectively.
arXiv Detail & Related papers (2022-06-07T14:55:32Z) - Lightweight Bimodal Network for Single-Image Super-Resolution via
Symmetric CNN and Recursive Transformer [27.51790638626891]
Single-image super-resolution (SISR) has achieved significant breakthroughs with the development of deep learning.
To solve this issue, we propose a Lightweight Bimodal Network (LBNet) for SISR.
Specifically, an effective Symmetric CNN is designed for local feature extraction and coarse image reconstruction.
arXiv Detail & Related papers (2022-04-28T04:43:22Z) - Efficient and Degradation-Adaptive Network for Real-World Image
Super-Resolution [28.00231586840797]
Real-world image super-resolution (Real-ISR) is a challenging task due to the unknown complex degradation of real-world images.
Recent research on Real-ISR has achieved significant progress by modeling the image degradation space.
We propose an efficient degradation-adaptive super-resolution (DASR) network, whose parameters are adaptively specified by estimating the degradation of each input image.
arXiv Detail & Related papers (2022-03-27T05:59:13Z) - Toward Real-world Image Super-resolution via Hardware-based Adaptive
Degradation Models [3.9037347042028254]
Most single image super-resolution (SR) methods are developed on synthetic low-resolution (LR) and high-resolution (HR) image pairs.
We propose a novel supervised method to simulate an unknown degradation process with the inclusion of prior hardware knowledge.
Experiments on the real-world datasets validate that our degradation model can estimate LR images more accurately than the predetermined degradation operation.
arXiv Detail & Related papers (2021-10-20T19:53:48Z) - LAPAR: Linearly-Assembled Pixel-Adaptive Regression Network for Single
Image Super-Resolution and Beyond [75.37541439447314]
Single image super-resolution (SISR) deals with a fundamental problem of upsampling a low-resolution (LR) image to its high-resolution (HR) version.
This paper proposes a linearly-assembled pixel-adaptive regression network (LAPAR) to strike a sweet spot of deep model complexity and resulting SISR quality.
arXiv Detail & Related papers (2021-05-21T15:47:18Z) - SRWarp: Generalized Image Super-Resolution under Arbitrary
Transformation [65.88321755969677]
Deep CNNs have achieved significant successes in image processing and its applications, including single image super-resolution.
Recent approaches extend the scope to real-valued upsampling factors.
We propose the SRWarp framework to further generalize the SR tasks toward an arbitrary image transformation.
arXiv Detail & Related papers (2021-04-21T02:50:41Z) - Best-Buddy GANs for Highly Detailed Image Super-Resolution [71.13466303340192]
We consider the single image super-resolution (SISR) problem, where a high-resolution (HR) image is generated based on a low-resolution (LR) input.
Most methods along this line rely on a predefined single-LR-single-HR mapping, which is not flexible enough for the SISR task.
We propose best-buddy GANs (Beby-GAN) for rich-detail SISR. Relaxing the immutable one-to-one constraint, we allow the estimated patches to dynamically seek the best supervision.
arXiv Detail & Related papers (2021-03-29T02:58:27Z) - Designing a Practical Degradation Model for Deep Blind Image
Super-Resolution [134.9023380383406]
Single image super-resolution (SISR) methods would not perform well if the assumed degradation model deviates from those in real images.
This paper proposes to design a more complex but practical degradation model that consists of randomly shuffled blur, downsampling and noise degradations.
arXiv Detail & Related papers (2021-03-25T17:40:53Z) - Deep Generative Adversarial Residual Convolutional Networks for
Real-World Super-Resolution [31.934084942626257]
We propose a deep Super-Resolution Residual Convolutional Generative Adversarial Network (SRResCGAN)
It follows the real-world degradation settings by adversarial training the model with pixel-wise supervision in the HR domain from its generated LR counterpart.
The proposed network exploits the residual learning by minimizing the energy-based objective function with powerful image regularization and convex optimization techniques.
arXiv Detail & Related papers (2020-05-03T00:12:38Z) - PULSE: Self-Supervised Photo Upsampling via Latent Space Exploration of
Generative Models [77.32079593577821]
PULSE (Photo Upsampling via Latent Space Exploration) generates high-resolution, realistic images at resolutions previously unseen in the literature.
Our method outperforms state-of-the-art methods in perceptual quality at higher resolutions and scale factors than previously possible.
arXiv Detail & Related papers (2020-03-08T16:44:31Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.