Interpretable Detail-Fidelity Attention Network for Single Image
Super-Resolution
- URL: http://arxiv.org/abs/2009.13134v1
- Date: Mon, 28 Sep 2020 08:31:23 GMT
- Title: Interpretable Detail-Fidelity Attention Network for Single Image
Super-Resolution
- Authors: Yuanfei Huang, Jie Li, Xinbo Gao, Yanting Hu, Wen Lu
- Abstract summary: We propose a purposeful and interpretable detail-fidelity attention network to progressively process smoothes and details in divide-and-conquer manner.
Particularly, we propose a Hessian filtering for interpretable feature representation which is high-profile for detail inference.
Experiments demonstrate that the proposed methods achieve superior performances over the state-of-the-art methods.
- Score: 89.1947690981471
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Benefiting from the strong capabilities of deep CNNs for feature
representation and nonlinear mapping, deep-learning-based methods have achieved
excellent performance in single image super-resolution. However, most existing
SR methods depend on the high capacity of networks which is initially designed
for visual recognition, and rarely consider the initial intention of
super-resolution for detail fidelity. Aiming at pursuing this intention, there
are two challenging issues to be solved: (1) learning appropriate operators
which is adaptive to the diverse characteristics of smoothes and details; (2)
improving the ability of model to preserve the low-frequency smoothes and
reconstruct the high-frequency details. To solve them, we propose a purposeful
and interpretable detail-fidelity attention network to progressively process
these smoothes and details in divide-and-conquer manner, which is a novel and
specific prospect of image super-resolution for the purpose on improving the
detail fidelity, instead of blindly designing or employing the deep CNNs
architectures for merely feature representation in local receptive fields.
Particularly, we propose a Hessian filtering for interpretable feature
representation which is high-profile for detail inference, a dilated
encoder-decoder and a distribution alignment cell to improve the inferred
Hessian features in morphological manner and statistical manner respectively.
Extensive experiments demonstrate that the proposed methods achieve superior
performances over the state-of-the-art methods quantitatively and
qualitatively. Code is available at https://github.com/YuanfeiHuang/DeFiAN.
Related papers
- High-Precision Dichotomous Image Segmentation via Probing Diffusion Capacity [69.32473738284374]
We propose DiffDIS, a diffusion-driven segmentation model that taps into the potential of the pre-trained U-Net within diffusion models.
By leveraging the robust generalization capabilities and rich, versatile image representation prior to the SD models, we significantly reduce the inference time while preserving high-fidelity, detailed generation.
Experiments on the DIS5K dataset demonstrate the superiority of DiffDIS, achieving state-of-the-art results through a streamlined inference process.
arXiv Detail & Related papers (2024-10-14T02:49:23Z) - Learning Accurate and Enriched Features for Stereo Image Super-Resolution [0.0]
Stereo image super-resolution (stereoSR) aims to enhance the quality of super-resolution results by incorporating complementary information from an alternative view.
We propose a mixed-scale selective fusion network (MSSFNet) to preserve precise spatial details and incorporate abundant contextual information.
MSSFNet achieves significant improvements over state-of-the-art approaches on both quantitative and qualitative evaluations.
arXiv Detail & Related papers (2024-06-23T03:34:17Z) - An Advanced Features Extraction Module for Remote Sensing Image Super-Resolution [0.5461938536945723]
We propose an advanced feature extraction module called Channel and Spatial Attention Feature Extraction (CSA-FE)
Our proposed method helps the model focus on the specific channels and spatial locations containing high-frequency information so that the model can focus on relevant features and suppress irrelevant ones.
Our model achieved superior performance compared to various existing models.
arXiv Detail & Related papers (2024-05-07T18:15:51Z) - Learning Enriched Features for Fast Image Restoration and Enhancement [166.17296369600774]
This paper presents a holistic goal of maintaining spatially-precise high-resolution representations through the entire network.
We learn an enriched set of features that combines contextual information from multiple scales, while simultaneously preserving the high-resolution spatial details.
Our approach achieves state-of-the-art results for a variety of image processing tasks, including defocus deblurring, image denoising, super-resolution, and image enhancement.
arXiv Detail & Related papers (2022-04-19T17:59:45Z) - Bayesian Attention Belief Networks [59.183311769616466]
Attention-based neural networks have achieved state-of-the-art results on a wide range of tasks.
This paper introduces Bayesian attention belief networks, which construct a decoder network by modeling unnormalized attention weights.
We show that our method outperforms deterministic attention and state-of-the-art attention in accuracy, uncertainty estimation, generalization across domains, and adversarial attacks.
arXiv Detail & Related papers (2021-06-09T17:46:22Z) - Resolution Adaptive Networks for Efficient Inference [53.04907454606711]
We propose a novel Resolution Adaptive Network (RANet), which is inspired by the intuition that low-resolution representations are sufficient for classifying "easy" inputs.
In RANet, the input images are first routed to a lightweight sub-network that efficiently extracts low-resolution representations.
High-resolution paths in the network maintain the capability to recognize the "hard" samples.
arXiv Detail & Related papers (2020-03-16T16:54:36Z) - Learning Enriched Features for Real Image Restoration and Enhancement [166.17296369600774]
convolutional neural networks (CNNs) have achieved dramatic improvements over conventional approaches for image restoration task.
We present a novel architecture with the collective goals of maintaining spatially-precise high-resolution representations through the entire network.
Our approach learns an enriched set of features that combines contextual information from multiple scales, while simultaneously preserving the high-resolution spatial details.
arXiv Detail & Related papers (2020-03-15T11:04:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.