Underwater Image Super-Resolution using Generative Adversarial
Network-based Model
- URL: http://arxiv.org/abs/2211.03550v4
- Date: Sat, 23 Sep 2023 16:14:20 GMT
- Title: Underwater Image Super-Resolution using Generative Adversarial
Network-based Model
- Authors: Alireza Aghelan, Modjtaba Rouhani
- Abstract summary: Single image super-resolution (SISR) models are able to enhance the resolution and visual quality of underwater images.
In this paper, we fine-tune the pre-trained Real-ESRGAN model for underwater image super-resolution.
- Score: 3.127436744845925
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Single image super-resolution (SISR) models are able to enhance the
resolution and visual quality of underwater images and contribute to a better
understanding of underwater environments. The integration of these models in
Autonomous Underwater Vehicles (AUVs) can improve their performance in
vision-based tasks. Real-Enhanced Super-Resolution Generative Adversarial
Network (Real-ESRGAN) is an efficient model that has shown remarkable
performance among SISR models. In this paper, we fine-tune the pre-trained
Real-ESRGAN model for underwater image super-resolution. To fine-tune and
evaluate the performance of the model, we use the USR-248 dataset. The
fine-tuned model produces more realistic images with better visual quality
compared to the Real-ESRGAN model.
Related papers
- IG-CFAT: An Improved GAN-Based Framework for Effectively Exploiting Transformers in Real-World Image Super-Resolution [2.009766774844269]
This paper extends the CFAT model to an improved GAN-based model called IG-CFAT.
IG-CFAT incorporates a semantic-aware discriminator to reconstruct fine details more accurately.
Our methodology adds wavelet loss to conventional loss functions of GAN-based super-resolution models to recover high-frequency details more efficiently.
arXiv Detail & Related papers (2024-06-19T20:21:26Z) - DGNet: Dynamic Gradient-Guided Network for Water-Related Optics Image
Enhancement [77.0360085530701]
Underwater image enhancement (UIE) is a challenging task due to the complex degradation caused by underwater environments.
Previous methods often idealize the degradation process, and neglect the impact of medium noise and object motion on the distribution of image features.
Our approach utilizes predicted images to dynamically update pseudo-labels, adding a dynamic gradient to optimize the network's gradient space.
arXiv Detail & Related papers (2023-12-12T06:07:21Z) - A comparative analysis of SRGAN models [0.0]
We evaluate the performance of SRGAN (Super Resolution Generative Adversarial Network) models, ESRGAN, Real-ESRGAN and EDSR, on a benchmark dataset of real-world images.
Some models seem to significantly increase the resolution of the input images while preserving their visual quality, this is assessed using Tesseract OCR engine.
arXiv Detail & Related papers (2023-07-18T17:35:45Z) - ACDMSR: Accelerated Conditional Diffusion Models for Single Image
Super-Resolution [84.73658185158222]
We propose a diffusion model-based super-resolution method called ACDMSR.
Our method adapts the standard diffusion model to perform super-resolution through a deterministic iterative denoising process.
Our approach generates more visually realistic counterparts for low-resolution images, emphasizing its effectiveness in practical scenarios.
arXiv Detail & Related papers (2023-07-03T06:49:04Z) - PUGAN: Physical Model-Guided Underwater Image Enhancement Using GAN with
Dual-Discriminators [120.06891448820447]
How to obtain clear and visually pleasant images has become a common concern of people.
The task of underwater image enhancement (UIE) has also emerged as the times require.
In this paper, we propose a physical model-guided GAN model for UIE, referred to as PUGAN.
Our PUGAN outperforms state-of-the-art methods in both qualitative and quantitative metrics.
arXiv Detail & Related papers (2023-06-15T07:41:12Z) - Fine-tuned Generative Adversarial Network-based Model for Medical Image Super-Resolution [2.647302105102753]
Real-Enhanced Super-Resolution Generative Adversarial Network (Real-ESRGAN) is a practical model for recovering HR images from real-world LR images.
We employ the high-order degradation model of the Real-ESRGAN which better simulates real-world image degradations.
The proposed model achieves superior perceptual quality compared to the Real-ESRGAN model, effectively preserving fine details and generating images with more realistic textures.
arXiv Detail & Related papers (2022-11-01T16:48:04Z) - Advancing Plain Vision Transformer Towards Remote Sensing Foundation
Model [97.9548609175831]
We resort to plain vision transformers with about 100 million parameters and make the first attempt to propose large vision models customized for remote sensing tasks.
Specifically, to handle the large image size and objects of various orientations in RS images, we propose a new rotated varied-size window attention.
Experiments on detection tasks demonstrate the superiority of our model over all state-of-the-art models, achieving 81.16% mAP on the DOTA-V1.0 dataset.
arXiv Detail & Related papers (2022-08-08T09:08:40Z) - DiVAE: Photorealistic Images Synthesis with Denoising Diffusion Decoder [73.1010640692609]
We propose a VQ-VAE architecture model with a diffusion decoder (DiVAE) to work as the reconstructing component in image synthesis.
Our model achieves state-of-the-art results and generates more photorealistic images specifically.
arXiv Detail & Related papers (2022-06-01T10:39:12Z) - Image Super-Resolution With Deep Variational Autoencoders [10.62560651449376]
We introduce VDVAE-SR, a new model that aims to exploit the most recent deep VAE methodologies to improve upon image super-resolution.
We show that the proposed model is competitive with other state-of-the-art methods.
arXiv Detail & Related papers (2022-03-17T17:05:14Z) - A Generic Approach for Enhancing GANs by Regularized Latent Optimization [79.00740660219256]
We introduce a generic framework called em generative-model inference that is capable of enhancing pre-trained GANs effectively and seamlessly.
Our basic idea is to efficiently infer the optimal latent distribution for the given requirements using Wasserstein gradient flow techniques.
arXiv Detail & Related papers (2021-12-07T05:22:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.