Sequential Hierarchical Learning with Distribution Transformation for
Image Super-Resolution
- URL: http://arxiv.org/abs/2007.09552v4
- Date: Wed, 3 May 2023 11:35:41 GMT
- Title: Sequential Hierarchical Learning with Distribution Transformation for
Image Super-Resolution
- Authors: Yuqing Liu and Xinfeng Zhang and Shanshe Wang and Siwei Ma and Wen Gao
- Abstract summary: We build a sequential hierarchical learning super-resolution network (SHSR) for effective image SR.
We consider the inter-scale correlations of features, and devise a sequential multi-scale block (SMB) to progressively explore the hierarchical information.
Experiment results show SHSR achieves superior quantitative performance and visual quality to state-of-the-art methods.
- Score: 83.70890515772456
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Multi-scale design has been considered in recent image super-resolution (SR)
works to explore the hierarchical feature information. Existing multi-scale
networks aim to build elaborate blocks or progressive architecture for
restoration. In general, larger scale features concentrate more on structural
and high-level information, while smaller scale features contain plentiful
details and textured information. In this point of view, information from
larger scale features can be derived from smaller ones. Based on the
observation, in this paper, we build a sequential hierarchical learning
super-resolution network (SHSR) for effective image SR. Specially, we consider
the inter-scale correlations of features, and devise a sequential multi-scale
block (SMB) to progressively explore the hierarchical information. SMB is
designed in a recursive way based on the linearity of convolution with
restricted parameters. Besides the sequential hierarchical learning, we also
investigate the correlations among the feature maps and devise a distribution
transformation block (DTB). Different from attention-based methods, DTB regards
the transformation in a normalization manner, and jointly considers the spatial
and channel-wise correlations with scaling and bias factors. Experiment results
show SHSR achieves superior quantitative performance and visual quality to
state-of-the-art methods with near 34\% parameters and 50\% MACs off when
scaling factor is $\times4$. To boost the performance without further training,
the extension model SHSR$^+$ with self-ensemble achieves competitive
performance than larger networks with near 92\% parameters and 42\% MACs off
with scaling factor $\times4$.
Related papers
- Multi-scale Unified Network for Image Classification [33.560003528712414]
CNNs face notable challenges in performance and computational efficiency when dealing with real-world, multi-scale image inputs.
We propose Multi-scale Unified Network (MUSN) consisting of multi-scales, a unified network, and scale-invariant constraint.
MUSN yields an accuracy increase up to 44.53% and diminishes FLOPs by 7.01-16.13% in multi-scale scenarios.
arXiv Detail & Related papers (2024-03-27T06:40:26Z) - Hi-ResNet: Edge Detail Enhancement for High-Resolution Remote Sensing Segmentation [10.919956120261539]
High-resolution remote sensing (HRS) semantic segmentation extracts key objects from high-resolution coverage areas.
objects of the same category within HRS images show significant differences in scale and shape across diverse geographical environments.
We propose a High-resolution remote sensing network (Hi-ResNet) with efficient network structure designs.
arXiv Detail & Related papers (2023-05-22T03:58:25Z) - Multi-level Second-order Few-shot Learning [111.0648869396828]
We propose a Multi-level Second-order (MlSo) few-shot learning network for supervised or unsupervised few-shot image classification and few-shot action recognition.
We leverage so-called power-normalized second-order base learner streams combined with features that express multiple levels of visual abstraction.
We demonstrate respectable results on standard datasets such as Omniglot, mini-ImageNet, tiered-ImageNet, Open MIC, fine-grained datasets such as CUB Birds, Stanford Dogs and Cars, and action recognition datasets such as HMDB51, UCF101, and mini-MIT.
arXiv Detail & Related papers (2022-01-15T19:49:00Z) - Learning to Aggregate Multi-Scale Context for Instance Segmentation in
Remote Sensing Images [28.560068780733342]
A novel context aggregation network (CATNet) is proposed to improve the feature extraction process.
The proposed model exploits three lightweight plug-and-play modules, namely dense feature pyramid network (DenseFPN), spatial context pyramid ( SCP), and hierarchical region of interest extractor (HRoIE)
arXiv Detail & Related papers (2021-11-22T08:55:25Z) - High-resolution Depth Maps Imaging via Attention-based Hierarchical
Multi-modal Fusion [84.24973877109181]
We propose a novel attention-based hierarchical multi-modal fusion network for guided DSR.
We show that our approach outperforms state-of-the-art methods in terms of reconstruction accuracy, running speed and memory efficiency.
arXiv Detail & Related papers (2021-04-04T03:28:33Z) - Multi-Stage Progressive Image Restoration [167.6852235432918]
We propose a novel synergistic design that can optimally balance these competing goals.
Our main proposal is a multi-stage architecture, that progressively learns restoration functions for the degraded inputs.
The resulting tightly interlinked multi-stage architecture, named as MPRNet, delivers strong performance gains on ten datasets.
arXiv Detail & Related papers (2021-02-04T18:57:07Z) - Joint Self-Attention and Scale-Aggregation for Self-Calibrated Deraining
Network [13.628218953897946]
In this paper, we propose an effective algorithm, called JDNet, to solve the single image deraining problem.
By designing the Scale-Aggregation and Self-Attention modules with Self-Calibrated convolution skillfully, the proposed model has better deraining results.
arXiv Detail & Related papers (2020-08-06T17:04:34Z) - Learning Enriched Features for Real Image Restoration and Enhancement [166.17296369600774]
convolutional neural networks (CNNs) have achieved dramatic improvements over conventional approaches for image restoration task.
We present a novel architecture with the collective goals of maintaining spatially-precise high-resolution representations through the entire network.
Our approach learns an enriched set of features that combines contextual information from multiple scales, while simultaneously preserving the high-resolution spatial details.
arXiv Detail & Related papers (2020-03-15T11:04:30Z) - Crowd Counting via Hierarchical Scale Recalibration Network [61.09833400167511]
We propose a novel Hierarchical Scale Recalibration Network (HSRNet) to tackle the task of crowd counting.
HSRNet models rich contextual dependencies and recalibrating multiple scale-associated information.
Our approach can ignore various noises selectively and focus on appropriate crowd scales automatically.
arXiv Detail & Related papers (2020-03-07T10:06:47Z) - Multi-Level Feature Fusion Mechanism for Single Image Super-Resolution [0.0]
Convolution neural network (CNN) has been widely used in Single Image Super Resolution (SISR)
Most SISR methods based on CNN do not make full use of hierarchical feature and the learning ability of network.
A novel Multi-Level Feature Fusion network (MLRN) is proposed, which can take full use of global intermediate features.
arXiv Detail & Related papers (2020-02-14T10:47:40Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.