Multi-Attention Based Ultra Lightweight Image Super-Resolution
- URL: http://arxiv.org/abs/2008.12912v2
- Date: Mon, 21 Sep 2020 06:07:14 GMT
- Title: Multi-Attention Based Ultra Lightweight Image Super-Resolution
- Authors: Abdul Muqeet, Jiwon Hwang, Subin Yang, Jung Heum Kang, Yongwoo Kim,
Sung-Ho Bae
- Abstract summary: We propose a Multi-Attentive Feature Fusion Super-Resolution Network (MAFFSRN)
MAFFSRN consists of proposed feature fusion groups (FFGs) that serve as a feature extraction block.
We participated in AIM 2020 efficient SR challenge with our MAFFSRN model and won 1st, 3rd, and 4th places in memory usage, floating-point operations (FLOPs) and number of parameters, respectively.
- Score: 9.819866781885446
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Lightweight image super-resolution (SR) networks have the utmost significance
for real-world applications. There are several deep learning based SR methods
with remarkable performance, but their memory and computational cost are
hindrances in practical usage. To tackle this problem, we propose a
Multi-Attentive Feature Fusion Super-Resolution Network (MAFFSRN). MAFFSRN
consists of proposed feature fusion groups (FFGs) that serve as a feature
extraction block. Each FFG contains a stack of proposed multi-attention blocks
(MAB) that are combined in a novel feature fusion structure. Further, the MAB
with a cost-efficient attention mechanism (CEA) helps us to refine and extract
the features using multiple attention mechanisms. The comprehensive experiments
show the superiority of our model over the existing state-of-the-art. We
participated in AIM 2020 efficient SR challenge with our MAFFSRN model and won
1st, 3rd, and 4th places in memory usage, floating-point operations (FLOPs) and
number of parameters, respectively.
Related papers
- MAT: Multi-Range Attention Transformer for Efficient Image Super-Resolution [14.265237560766268]
A flexible integration of attention across diverse spatial extents can yield significant performance enhancements.
We introduce Multi-Range Attention Transformer (MAT) tailored for Super Resolution (SR) tasks.
MAT adeptly capture dependencies across various spatial ranges, improving the diversity and efficacy of its feature representations.
arXiv Detail & Related papers (2024-11-26T08:30:31Z) - A Lightweight Attention-based Deep Network via Multi-Scale Feature Fusion for Multi-View Facial Expression Recognition [2.9581436761331017]
We introduce a lightweight attentional network incorporating multi-scale feature fusion (LANMSFF) to tackle these issues.
We present two novel components, namely mass attention (MassAtt) and point wise feature selection (PWFS) blocks.
Our proposed approach achieved results comparable to state-of-the-art methods in terms of parameter counts and robustness to pose variation.
arXiv Detail & Related papers (2024-03-21T11:40:51Z) - Accurate and lightweight dehazing via multi-receptive-field non-local
network and novel contrastive regularization [9.90146712189936]
This paper presents a multi-receptive-field non-local network (MRFNLN) for image dehazing.
It is designed as a multi-stream feature attention block (MSFAB) and cross non-local block (CNLB)
It outperforms recent state-of-the-art dehazing methods with less than 1.5 Million parameters.
arXiv Detail & Related papers (2023-09-28T14:59:16Z) - Searching a Compact Architecture for Robust Multi-Exposure Image Fusion [55.37210629454589]
Two major stumbling blocks hinder the development, including pixel misalignment and inefficient inference.
This study introduces an architecture search-based paradigm incorporating self-alignment and detail repletion modules for robust multi-exposure image fusion.
The proposed method outperforms various competitive schemes, achieving a noteworthy 3.19% improvement in PSNR for general scenarios and an impressive 23.5% enhancement in misaligned scenarios.
arXiv Detail & Related papers (2023-05-20T17:01:52Z) - Spatially-Adaptive Feature Modulation for Efficient Image
Super-Resolution [90.16462805389943]
We develop a spatially-adaptive feature modulation (SAFM) mechanism upon a vision transformer (ViT)-like block.
Proposed method is $3times$ smaller than state-of-the-art efficient SR methods.
arXiv Detail & Related papers (2023-02-27T14:19:31Z) - CDDFuse: Correlation-Driven Dual-Branch Feature Decomposition for
Multi-Modality Image Fusion [138.40422469153145]
We propose a novel Correlation-Driven feature Decomposition Fusion (CDDFuse) network.
We show that CDDFuse achieves promising results in multiple fusion tasks, including infrared-visible image fusion and medical image fusion.
arXiv Detail & Related papers (2022-11-26T02:40:28Z) - Feature Distillation Interaction Weighting Network for Lightweight Image
Super-Resolution [25.50790871331823]
We propose a lightweight yet efficient Feature Distillation Interaction Weighted Network (FDIWN)
FDIWN is superior to other models to strike a good balance between model performance and efficiency.
arXiv Detail & Related papers (2021-12-16T06:20:35Z) - EPMF: Efficient Perception-aware Multi-sensor Fusion for 3D Semantic Segmentation [62.210091681352914]
We study multi-sensor fusion for 3D semantic segmentation for many applications, such as autonomous driving and robotics.
In this work, we investigate a collaborative fusion scheme called perception-aware multi-sensor fusion (PMF)
We propose a two-stream network to extract features from the two modalities separately. The extracted features are fused by effective residual-based fusion modules.
arXiv Detail & Related papers (2021-06-21T10:47:26Z) - Lightweight Image Super-Resolution with Multi-scale Feature Interaction
Network [15.846394239848959]
We present a lightweight multi-scale feature interaction network (MSFIN)
For lightweight SISR, MSFIN expands the receptive field and adequately exploits the informative features of the low-resolution observed images.
Our proposed MSFIN can achieve comparable performance against the state-of-the-arts with a more lightweight model.
arXiv Detail & Related papers (2021-03-24T07:25:21Z) - Fully Quantized Image Super-Resolution Networks [81.75002888152159]
We propose a Fully Quantized image Super-Resolution framework (FQSR) to jointly optimize efficiency and accuracy.
We apply our quantization scheme on multiple mainstream super-resolution architectures, including SRResNet, SRGAN and EDSR.
Our FQSR using low bits quantization can achieve on par performance compared with the full-precision counterparts on five benchmark datasets.
arXiv Detail & Related papers (2020-11-29T03:53:49Z) - Lightweight Single-Image Super-Resolution Network with Attentive
Auxiliary Feature Learning [73.75457731689858]
We develop a computation efficient yet accurate network based on the proposed attentive auxiliary features (A$2$F) for SISR.
Experimental results on large-scale dataset demonstrate the effectiveness of the proposed model against the state-of-the-art (SOTA) SR methods.
arXiv Detail & Related papers (2020-11-13T06:01:46Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.