LSwinSR: UAV Imagery Super-Resolution based on Linear Swin Transformer
- URL: http://arxiv.org/abs/2303.10232v1
- Date: Fri, 17 Mar 2023 20:14:10 GMT
- Title: LSwinSR: UAV Imagery Super-Resolution based on Linear Swin Transformer
- Authors: Rui Li and Xiaowei Zhao
- Abstract summary: Super-resolution technology is especially beneficial for Unmanned Aerial Vehicles (UAV)
In this paper, for the super-resolution of UAV images, a novel network based on the state-of-the-art Swin Transformer is proposed with better efficiency and competitive accuracy.
- Score: 7.3817359680010615
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Super-resolution, which aims to reconstruct high-resolution images from
low-resolution images, has drawn considerable attention and has been
intensively studied in computer vision and remote sensing communities. The
super-resolution technology is especially beneficial for Unmanned Aerial
Vehicles (UAV), as the amount and resolution of images captured by UAV are
highly limited by physical constraints such as flight altitude and load
capacity. In the wake of the successful application of deep learning methods in
the super-resolution task, in recent years, a series of super-resolution
algorithms have been developed. In this paper, for the super-resolution of UAV
images, a novel network based on the state-of-the-art Swin Transformer is
proposed with better efficiency and competitive accuracy. Meanwhile, as one of
the essential applications of the UAV is land cover and land use monitoring,
simple image quality assessments such as the Peak-Signal-to-Noise Ratio (PSNR)
and the Structural Similarity Index Measure (SSIM) are not enough to
comprehensively measure the performance of an algorithm. Therefore, we further
investigate the effectiveness of super-resolution methods using the accuracy of
semantic segmentation. The code will be available at
https://github.com/lironui/LSwinSR.
Related papers
- Recurrent Multi-scale Transformer for High-Resolution Salient Object
Detection [68.65338791283298]
Salient Object Detection (SOD) aims to identify and segment the most conspicuous objects in an image or video.
Traditional SOD methods are largely limited to low-resolution images, making them difficult to adapt to the development of High-Resolution SOD.
In this work, we first propose a new HRS10K dataset, which contains 10,500 high-quality annotated images at 2K-8K resolution.
arXiv Detail & Related papers (2023-08-07T17:49:04Z) - Cutting-Edge Techniques for Depth Map Super-Resolution [0.0]
To overcome hardware limitations in commercially available depth sensors, depth map super-resolution (DMSR) is a practical and valuable computer vision task.
Joint image filtering for DMSR has been applied using spatially-invariant and spatially-variant convolutional neural network (CNN) approaches.
In this project, we propose a novel joint image filtering DMSR algorithm using a Swin transformer architecture.
arXiv Detail & Related papers (2023-06-27T06:57:08Z) - A Vision Transformer Approach for Efficient Near-Field Irregular SAR
Super-Resolution [0.0]
We introduce a mobile-friend vision transformer (ViT) architecture to address position estimation error and perform SAR image super-resolution (SR) under irregular sampling geometries.
The proposed algorithm, Mobile-SRViT, is the first to employ a ViT approach for SAR image enhancement and is validated in simulation and via empirical studies.
arXiv Detail & Related papers (2023-05-03T12:25:01Z) - Exploring Resolution and Degradation Clues as Self-supervised Signal for
Low Quality Object Detection [77.3530907443279]
We propose a novel self-supervised framework to detect objects in degraded low resolution images.
Our methods has achieved superior performance compared with existing methods when facing variant degradation situations.
arXiv Detail & Related papers (2022-08-05T09:36:13Z) - Hierarchical Similarity Learning for Aliasing Suppression Image
Super-Resolution [64.15915577164894]
A hierarchical image super-resolution network (HSRNet) is proposed to suppress the influence of aliasing.
HSRNet achieves better quantitative and visual performance than other works, and remits the aliasing more effectively.
arXiv Detail & Related papers (2022-06-07T14:55:32Z) - Efficient Real-Time Image Recognition Using Collaborative Swarm of UAVs
and Convolutional Networks [9.449650062296824]
We present a strategy aiming at distributing inference requests to a swarm of resource-constrained UAVs that classifies captured images on-board.
We formulate the model as an optimization problem that minimizes the latency between acquiring images and making the final decisions.
We introduce an online solution, namely DistInference, to find the layers placement strategy that gives the best latency among the available UAVs.
arXiv Detail & Related papers (2021-07-09T19:47:02Z) - EmergencyNet: Efficient Aerial Image Classification for Drone-Based
Emergency Monitoring Using Atrous Convolutional Feature Fusion [8.634988828030245]
This article focuses on the efficient aerial image classification from on-board a UAV for emergency response/monitoring applications.
A dedicated Aerial Image Database for Emergency Response applications is introduced and a comparative analysis of existing approaches is performed.
A lightweight convolutional neural network architecture is proposed, referred to as EmergencyNet, based on atrous convolutions to process multiresolution features.
arXiv Detail & Related papers (2021-04-28T20:24:10Z) - Asymmetric CNN for image super-resolution [102.96131810686231]
Deep convolutional neural networks (CNNs) have been widely applied for low-level vision over the past five years.
We propose an asymmetric CNN (ACNet) comprising an asymmetric block (AB), a mem?ory enhancement block (MEB) and a high-frequency feature enhancement block (HFFEB) for image super-resolution.
Our ACNet can effectively address single image super-resolution (SISR), blind SISR and blind SISR of blind noise problems.
arXiv Detail & Related papers (2021-03-25T07:10:46Z) - Joint-SRVDNet: Joint Super Resolution and Vehicle Detection Network [17.57284924547865]
We propose a Joint Super-Resolution and Vehicle DetectionNetwork (Joint-SRVDNet) to generate discriminative, high-resolution images of vehicles.
Aerial images are up-scaled by a factor of 4x using a Multi-scaleGenerative Adversarial Network (MsGAN), which has multiple intermediate outputs with increasingresolutions.
The network jointlylearns hierarchical and discriminative features of targets and produces optimal super-resolution results.
arXiv Detail & Related papers (2020-05-03T04:28:44Z) - PULSE: Self-Supervised Photo Upsampling via Latent Space Exploration of
Generative Models [77.32079593577821]
PULSE (Photo Upsampling via Latent Space Exploration) generates high-resolution, realistic images at resolutions previously unseen in the literature.
Our method outperforms state-of-the-art methods in perceptual quality at higher resolutions and scale factors than previously possible.
arXiv Detail & Related papers (2020-03-08T16:44:31Z) - Gated Fusion Network for Degraded Image Super Resolution [78.67168802945069]
We propose a dual-branch convolutional neural network to extract base features and recovered features separately.
By decomposing the feature extraction step into two task-independent streams, the dual-branch model can facilitate the training process.
arXiv Detail & Related papers (2020-03-02T13:28:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.