1st Solution Places for CVPR 2023 UG$^{\textbf{2}}$+ Challenge Track
2.1-Text Recognition through Atmospheric Turbulence
- URL: http://arxiv.org/abs/2306.08963v1
- Date: Thu, 15 Jun 2023 08:56:51 GMT
- Title: 1st Solution Places for CVPR 2023 UG$^{\textbf{2}}$+ Challenge Track
2.1-Text Recognition through Atmospheric Turbulence
- Authors: Shengqi Xu, Xueyao Xiao, Shuning Cao, Yi Chang, Luxin Yan
- Abstract summary: We present the solution developed by our team VIELab-HUST for text recognition through atmospheric turbulence in Track 2.1 of the CVPR 2023 UG$2$+ challenge.
Our framework can handle both hot-air text dataset and turbulence text dataset provided in the final testing phase and achieved 1st place in text recognition accuracy.
- Score: 13.393698451466689
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: In this technical report, we present the solution developed by our team
VIELab-HUST for text recognition through atmospheric turbulence in Track 2.1 of
the CVPR 2023 UG$^{2}$+ challenge. Our solution involves an efficient
multi-stage framework that restores a high-quality image from distorted frames.
Specifically, a frame selection algorithm based on sharpness is first utilized
to select the sharpest set of distorted frames. Next, each frame in the
selected frames is aligned to suppress geometric distortion through
optical-flow-based image registration. Then, a region-based image fusion method
with DT-CWT is utilized to mitigate the blur caused by the turbulence. Finally,
a learning-based deartifacts method is applied to remove the artifacts in the
fused image, generating a high-quality outuput. Our framework can handle both
hot-air text dataset and turbulence text dataset provided in the final testing
phase and achieved 1st place in text recognition accuracy. Our code will be
available at https://github.com/xsqhust/Turbulence_Removal.
Related papers
- Blind Image Deblurring with FFT-ReLU Sparsity Prior [1.179778723980276]
Blind image deblurring is the process of recovering a sharp image from a blurred one without prior knowledge about the blur kernel.
We introduce a method that leverages a prior which targets the blur kernel to achieve effective deblurring across a wide range of image types.
arXiv Detail & Related papers (2024-06-12T15:51:39Z) - ENTED: Enhanced Neural Texture Extraction and Distribution for
Reference-based Blind Face Restoration [51.205673783866146]
We present ENTED, a new framework for blind face restoration that aims to restore high-quality and realistic portrait images.
We utilize a texture extraction and distribution framework to transfer high-quality texture features between the degraded input and reference image.
The StyleGAN-like architecture in our framework requires high-quality latent codes to generate realistic images.
arXiv Detail & Related papers (2024-01-13T04:54:59Z) - TextDiff: Mask-Guided Residual Diffusion Models for Scene Text Image
Super-Resolution [18.73348268987249]
TextDiff is a diffusion-based framework tailored for scene text image super-resolution.
It achieves state-of-the-art (SOTA) performance on public benchmark datasets.
Our proposed MRD module is plug-and-play that effectively sharpens the text edges produced by SOTA methods.
arXiv Detail & Related papers (2023-08-13T11:02:16Z) - 1st Solution Places for CVPR 2023 UG$^2$+ Challenge Track 2.2-Coded
Target Restoration through Atmospheric Turbulence [12.484269899245515]
This report introduces the solution of our team VIELab-HUST for coded target restoration through atmospheric turbulence in CVPR 2023 UG$2$+ Track 2.2.
We propose an efficient multi-stage framework to restore a high quality image from distorted frames.
Our framework is capable of handling different kinds of coded target dataset provided in the final testing phase, and ranked 1st on the final leaderboard.
arXiv Detail & Related papers (2023-06-15T09:06:48Z) - CoordFill: Efficient High-Resolution Image Inpainting via Parameterized
Coordinate Querying [52.91778151771145]
In this paper, we try to break the limitations for the first time thanks to the recent development of continuous implicit representation.
Experiments show that the proposed method achieves real-time performance on the 2048$times$2048 images using a single GTX 2080 Ti GPU.
arXiv Detail & Related papers (2023-03-15T11:13:51Z) - Parallax-Tolerant Unsupervised Deep Image Stitching [57.76737888499145]
We propose UDIS++, a parallax-tolerant unsupervised deep image stitching technique.
First, we propose a robust and flexible warp to model the image registration from global homography to local thin-plate spline motion.
To further eliminate the parallax artifacts, we propose to composite the stitched image seamlessly by unsupervised learning for seam-driven composition masks.
arXiv Detail & Related papers (2023-02-16T10:40:55Z) - Let's Enhance: A Deep Learning Approach to Extreme Deblurring of Text
Images [3.441021278275805]
This work presents a novel deep-learning-based pipeline for the inverse problem of image deblurring.
Our results build on our winning submission to the recent Helsinki Deblurring Challenge 2021, whose goal was to explore the limits of state-of-the-art deblurring algorithms.
arXiv Detail & Related papers (2022-11-18T09:06:56Z) - Any-resolution Training for High-resolution Image Synthesis [55.19874755679901]
Generative models operate at fixed resolution, even though natural images come in a variety of sizes.
We argue that every pixel matters and create datasets with variable-size images, collected at their native resolutions.
We introduce continuous-scale training, a process that samples patches at random scales to train a new generator with variable output resolutions.
arXiv Detail & Related papers (2022-04-14T17:59:31Z) - Spatial-Separated Curve Rendering Network for Efficient and
High-Resolution Image Harmonization [59.19214040221055]
We propose a novel spatial-separated curve rendering network (S$2$CRNet) for efficient and high-resolution image harmonization.
The proposed method reduces more than 90% parameters compared with previous methods.
Our method can work smoothly on higher resolution images in real-time which is more than 10$times$ faster than the existing methods.
arXiv Detail & Related papers (2021-09-13T07:20:16Z) - InfinityGAN: Towards Infinite-Resolution Image Synthesis [92.40782797030977]
We present InfinityGAN, a method to generate arbitrary-resolution images.
We show how it trains and infers patch-by-patch seamlessly with low computational resources.
arXiv Detail & Related papers (2021-04-08T17:59:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.