From Coarse to Fine: Hierarchical Pixel Integration for Lightweight
Image Super-Resolution
- URL: http://arxiv.org/abs/2211.16776v1
- Date: Wed, 30 Nov 2022 06:32:34 GMT
- Title: From Coarse to Fine: Hierarchical Pixel Integration for Lightweight
Image Super-Resolution
- Authors: Jie Liu, Chao Chen, Jie Tang, Gangshan Wu
- Abstract summary: Transformer-based models have achieved competitive performances in image super-resolution (SR)
We propose a new attention block whose insights are from the interpretation of Local Map (LAM) for SR networks.
In the fine area, we use an Intra-Patch Self-Attention Attribution (IPSA) module to model long-range pixel dependencies in a local patch.
- Score: 41.0555613285837
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Image super-resolution (SR) serves as a fundamental tool for the processing
and transmission of multimedia data. Recently, Transformer-based models have
achieved competitive performances in image SR. They divide images into
fixed-size patches and apply self-attention on these patches to model
long-range dependencies among pixels. However, this architecture design is
originated for high-level vision tasks, which lacks design guideline from SR
knowledge. In this paper, we aim to design a new attention block whose insights
are from the interpretation of Local Attribution Map (LAM) for SR networks.
Specifically, LAM presents a hierarchical importance map where the most
important pixels are located in a fine area of a patch and some less important
pixels are spread in a coarse area of the whole image. To access pixels in the
coarse area, instead of using a very large patch size, we propose a lightweight
Global Pixel Access (GPA) module that applies cross-attention with the most
similar patch in an image. In the fine area, we use an Intra-Patch
Self-Attention (IPSA) module to model long-range pixel dependencies in a local
patch, and then a $3\times3$ convolution is applied to process the finest
details. In addition, a Cascaded Patch Division (CPD) strategy is proposed to
enhance perceptual quality of recovered images. Extensive experiments suggest
that our method outperforms state-of-the-art lightweight SR methods by a large
margin. Code is available at https://github.com/passerer/HPINet.
Related papers
- Accelerating Image Super-Resolution Networks with Pixel-Level Classification [29.010136088811137]
Pixel-level for Single Image SuperResolution is a novel method designed to distribute computational resources adaptively at the pixel level.
Our method allows for performance and computational cost balance during inference without re-training.
arXiv Detail & Related papers (2024-07-31T08:53:10Z) - Parameter-Inverted Image Pyramid Networks [49.35689698870247]
We propose a novel network architecture known as the Inverted Image Pyramid Networks (PIIP)
Our core idea is to use models with different parameter sizes to process different resolution levels of the image pyramid.
PIIP achieves superior performance in tasks such as object detection, segmentation, and image classification.
arXiv Detail & Related papers (2024-06-06T17:59:10Z) - Parallel Cross Strip Attention Network for Single Image Dehazing [15.246376325081973]
Single image dehazing aims to restore hazy images and produce clear, high-quality visuals.
Traditional convolutional models struggle with long-range dependencies due to limited receptive field size.
We introduce a novel dehazing network based on Parallel Stripe Cross Attention (PCSA) with a multi-scale strategy.
arXiv Detail & Related papers (2024-05-09T14:50:07Z) - Pixel-Inconsistency Modeling for Image Manipulation Localization [59.968362815126326]
Digital image forensics plays a crucial role in image authentication and manipulation localization.
This paper presents a generalized and robust manipulation localization model through the analysis of pixel inconsistency artifacts.
Experiments show that our method successfully extracts inherent pixel-inconsistency forgery fingerprints.
arXiv Detail & Related papers (2023-09-30T02:54:51Z) - Super-Resolution of License Plate Images Using Attention Modules and
Sub-Pixel Convolution Layers [3.8831062015253055]
We introduce a Single-Image Super-Resolution (SISR) approach to enhance the detection of structural and textural features in surveillance images.
Our approach incorporates sub-pixel convolution layers and a loss function that uses an Optical Character Recognition (OCR) model for feature extraction.
Our results show that our approach for reconstructing these low-resolution synthesized images outperforms existing ones in both quantitative and qualitative measures.
arXiv Detail & Related papers (2023-05-27T00:17:19Z) - Probabilistic Deep Metric Learning for Hyperspectral Image
Classification [91.5747859691553]
This paper proposes a probabilistic deep metric learning framework for hyperspectral image classification.
It aims to predict the category of each pixel for an image captured by hyperspectral sensors.
Our framework can be readily applied to existing hyperspectral image classification methods.
arXiv Detail & Related papers (2022-11-15T17:57:12Z) - HIPA: Hierarchical Patch Transformer for Single Image Super Resolution [62.7081074931892]
This paper presents HIPA, a novel Transformer architecture that progressively recovers the high resolution image using a hierarchical patch partition.
We build a cascaded model that processes an input image in multiple stages, where we start with tokens with small patch sizes and gradually merge to the full resolution.
Such a hierarchical patch mechanism not only explicitly enables feature aggregation at multiple resolutions but also adaptively learns patch-aware features for different image regions.
arXiv Detail & Related papers (2022-03-19T05:09:34Z) - Patch-Based Stochastic Attention for Image Editing [4.8201607588546]
We propose an efficient attention layer based on the algorithm PatchMatch, which is used for determining approximate nearest neighbors.
We demonstrate the usefulness of PSAL on several image editing tasks, such as image inpainting, guided image colorization, and single-image super-resolution.
arXiv Detail & Related papers (2022-02-07T13:42:00Z) - Generating Superpixels for High-resolution Images with Decoupled Patch
Calibration [82.21559299694555]
Patch Networks (PCNet) is designed to efficiently and accurately implement high-resolution superpixel segmentation.
DPC takes a local patch from the high-resolution images and dynamically generates a binary mask to impose the network to focus on region boundaries.
In particular, DPC takes a local patch from the high-resolution images and dynamically generates a binary mask to impose the network to focus on region boundaries.
arXiv Detail & Related papers (2021-08-19T10:33:05Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.