Deep Residual Fourier Transformation for Single Image Deblurring
- URL: http://arxiv.org/abs/2111.11745v1
- Date: Tue, 23 Nov 2021 09:40:40 GMT
- Title: Deep Residual Fourier Transformation for Single Image Deblurring
- Authors: Xintian Mao, Yiming Liu, Wei Shen, Qingli Li, Yan Wang
- Abstract summary: Reconstructing a sharp image from its blurry counterpart requires changes regarding both low- and high-frequency information.
We present a Residual Fast Fourier Transform with Convolution Block (Res FFT-Conv Block) capable of capturing both long-term and short-term interactions.
- Score: 12.674752421170547
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: It has been a common practice to adopt the ResBlock, which learns the
difference between blurry and sharp image pairs, in end-to-end image deblurring
architectures. Reconstructing a sharp image from its blurry counterpart
requires changes regarding both low- and high-frequency information. Although
conventional ResBlock may have good abilities in capturing the high-frequency
components of images, it tends to overlook the low-frequency information.
Moreover, ResBlock usually fails to felicitously model the long-distance
information which is non-trivial in reconstructing a sharp image from its
blurry counterpart. In this paper, we present a Residual Fast Fourier Transform
with Convolution Block (Res FFT-Conv Block), capable of capturing both
long-term and short-term interactions, while integrating both low- and
high-frequency residual information. Res FFT-Conv Block is a conceptually
simple yet computationally efficient, and plug-and-play block, leading to
remarkable performance gains in different architectures. With Res FFT-Conv
Block, we further propose a Deep Residual Fourier Transformation (DeepRFT)
framework, based upon MIMO-UNet, achieving state-of-the-art image deblurring
performance on GoPro, HIDE, RealBlur and DPDD datasets. Experiments show our
DeepRFT can boost image deblurring performance significantly (e.g., with 1.09
dB improvement in PSNR on GoPro dataset compared with MIMO-UNet), and DeepRFT+
even reaches 33.23 dB in PSNR on GoPro dataset.
Related papers
- LIPT: Latency-aware Image Processing Transformer [17.802838753201385]
We present a latency-aware image processing transformer, termed LIPT.
We devise the low-latency proportion LIPT block that substitutes memory-intensive operators with the combination of self-attention and convolutions to achieve practical speedup.
arXiv Detail & Related papers (2024-04-09T07:25:30Z) - Distance Weighted Trans Network for Image Completion [52.318730994423106]
We propose a new architecture that relies on Distance-based Weighted Transformer (DWT) to better understand the relationships between an image's components.
CNNs are used to augment the local texture information of coarse priors.
DWT blocks are used to recover certain coarse textures and coherent visual structures.
arXiv Detail & Related papers (2023-10-11T12:46:11Z) - Towards Real-World Burst Image Super-Resolution: Benchmark and Method [93.73429028287038]
In this paper, we establish a large-scale real-world burst super-resolution dataset, i.e., RealBSR, to explore the faithful reconstruction of image details from multiple frames.
We also introduce a Federated Burst Affinity network (FBAnet) to investigate non-trivial pixel-wise displacement among images under real-world image degradation.
arXiv Detail & Related papers (2023-09-09T14:11:37Z) - RBSR: Efficient and Flexible Recurrent Network for Burst
Super-Resolution [57.98314517861539]
Burst super-resolution (BurstSR) aims at reconstructing a high-resolution (HR) image from a sequence of low-resolution (LR) and noisy images.
In this paper, we suggest fusing cues frame-by-frame with an efficient and flexible recurrent network.
arXiv Detail & Related papers (2023-06-30T12:14:13Z) - Asymmetric Learned Image Compression with Multi-Scale Residual Block,
Importance Map, and Post-Quantization Filtering [15.056672221375104]
Deep learning-based image compression has achieved better ratedistortion (R-D) performance than the latest traditional method, H.266/VVC.
Many leading learned schemes cannot maintain a good trade-off between performance and complexity.
We propose an effcient and effective image coding framework, which achieves similar R-D performance with lower complexity than the state of the art.
arXiv Detail & Related papers (2022-06-21T09:34:29Z) - UHD Image Deblurring via Multi-scale Cubic-Mixer [12.402054374952485]
transformer-based algorithms are making a splash in the domain of image deblurring.
These algorithms depend on the self-attention mechanism with CNN stem to model long range dependencies between tokens.
arXiv Detail & Related papers (2022-06-08T05:04:43Z) - Wavelet-Based Network For High Dynamic Range Imaging [64.66969585951207]
Existing methods, such as optical flow based and end-to-end deep learning based solutions, are error-prone either in detail restoration or ghosting artifacts removal.
In this work, we propose a novel frequency-guided end-to-end deep neural network (FNet) to conduct HDR fusion in the frequency domain, and Wavelet Transform (DWT) is used to decompose inputs into different frequency bands.
The low-frequency signals are used to avoid specific ghosting artifacts, while the high-frequency signals are used for preserving details.
arXiv Detail & Related papers (2021-08-03T12:26:33Z) - Asymmetric CNN for image super-resolution [102.96131810686231]
Deep convolutional neural networks (CNNs) have been widely applied for low-level vision over the past five years.
We propose an asymmetric CNN (ACNet) comprising an asymmetric block (AB), a mem?ory enhancement block (MEB) and a high-frequency feature enhancement block (HFFEB) for image super-resolution.
Our ACNet can effectively address single image super-resolution (SISR), blind SISR and blind SISR of blind noise problems.
arXiv Detail & Related papers (2021-03-25T07:10:46Z) - Deep Burst Super-Resolution [165.90445859851448]
We propose a novel architecture for the burst super-resolution task.
Our network takes multiple noisy RAW images as input, and generates a denoised, super-resolved RGB image as output.
In order to enable training and evaluation on real-world data, we additionally introduce the BurstSR dataset.
arXiv Detail & Related papers (2021-01-26T18:57:21Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.