Remote Sensing Image Classification using Transfer Learning and
Attention Based Deep Neural Network
- URL: http://arxiv.org/abs/2206.13392v1
- Date: Mon, 20 Jun 2022 10:05:38 GMT
- Title: Remote Sensing Image Classification using Transfer Learning and
Attention Based Deep Neural Network
- Authors: Lam Pham, Khoa Tran, Dat Ngo, Jasmin Lampert, Alexander Schindler
- Abstract summary: We propose a deep learning based framework for RSISC, which makes use of the transfer learning technique and multihead attention scheme.
The proposed deep learning framework is evaluated on the benchmark NWPU-RESISC45 dataset and achieves the best classification accuracy of 94.7%.
- Score: 59.86658316440461
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The task of remote sensing image scene classification (RSISC), which aims at
classifying remote sensing images into groups of semantic categories based on
their contents, has taken the important role in a wide range of applications
such as urban planning, natural hazards detection, environment
monitoring,vegetation mapping, or geospatial object detection. During the past
years, research community focusing on RSISC task has shown significant effort
to publish diverse datasets as well as propose different approaches to deal
with the RSISC challenges. Recently, almost proposed RSISC systems base on deep
learning models which prove powerful and outperform traditional approaches
using image processing and machine learning. In this paper, we also leverage
the power of deep learning technology, evaluate a variety of deep neural
network architectures, indicate main factors affecting the performance of a
RSISC system. Given the comprehensive analysis, we propose a deep learning
based framework for RSISC, which makes use of the transfer learning technique
and multihead attention scheme. The proposed deep learning framework is
evaluated on the benchmark NWPU-RESISC45 dataset and achieves the best
classification accuracy of 94.7% which shows competitive to the
state-of-the-art systems and potential for real-life applications.
Related papers
- SAFE: a SAR Feature Extractor based on self-supervised learning and masked Siamese ViTs [5.961207817077044]
We propose a novel self-supervised learning framework based on masked Siamese Vision Transformers to create a General SAR Feature Extractor coined SAFE.
Our method leverages contrastive learning principles to train a model on unlabeled SAR data, extracting robust and generalizable features.
We introduce tailored data augmentation techniques specific to SAR imagery, such as sub-aperture decomposition and despeckling.
Our network competes with or surpasses other state-of-the-art methods in few-shot classification and segmentation tasks, even without being trained on the sensors used for the evaluation.
arXiv Detail & Related papers (2024-06-30T23:11:20Z) - Dehazing Remote Sensing and UAV Imagery: A Review of Deep Learning, Prior-based, and Hybrid Approaches [4.516330345599765]
High-quality images are crucial in remote sensing and UAV applications.
atmospheric haze can severely degrade image quality, making image dehazing a critical research area.
arXiv Detail & Related papers (2024-05-13T07:35:24Z) - ADASR: An Adversarial Auto-Augmentation Framework for Hyperspectral and
Multispectral Data Fusion [54.668445421149364]
Deep learning-based hyperspectral image (HSI) super-resolution aims to generate high spatial resolution HSI (HR-HSI) by fusing hyperspectral image (HSI) and multispectral image (MSI) with deep neural networks (DNNs)
In this letter, we propose a novel adversarial automatic data augmentation framework ADASR that automatically optimize and augments HSI-MSI sample pairs to enrich data diversity for HSI-MSI fusion.
arXiv Detail & Related papers (2023-10-11T07:30:37Z) - Multi-spectral Class Center Network for Face Manipulation Detection and Localization [52.569170436393165]
We propose a novel Multi-Spectral Class Center Network (MSCCNet) for face manipulation detection and localization.
Based on the features of different frequency bands, the MSCC module collects multi-spectral class centers and computes pixel-to-class relations.
Applying multi-spectral class-level representations suppresses the semantic information of the visual concepts which is insensitive to manipulated regions of forgery images.
arXiv Detail & Related papers (2023-05-18T08:09:20Z) - Impact of a DCT-driven Loss in Attention-based Knowledge-Distillation
for Scene Recognition [64.29650787243443]
We propose and analyse the use of a 2D frequency transform of the activation maps before transferring them.
This strategy enhances knowledge transferability in tasks such as scene recognition.
We publicly release the training and evaluation framework used along this paper at http://www.vpu.eps.uam.es/publications/DCTBasedKDForSceneRecognition.
arXiv Detail & Related papers (2022-05-04T11:05:18Z) - RSBNet: One-Shot Neural Architecture Search for A Backbone Network in
Remote Sensing Image Recognition [43.95699860302204]
We propose a new design paradigm for the backbone architecture in RSI recognition tasks, including scene classification, land-cover classification and object detection.
A novel one-shot architecture search framework based on weight-sharing strategy and evolutionary algorithm is proposed, called RSBNet.
Extensive experiments have been conducted on five benchmark datasets for different recognition tasks, the results show the effectiveness of the proposed search paradigm.
arXiv Detail & Related papers (2021-12-07T02:44:16Z) - Multi-Content Complementation Network for Salient Object Detection in
Optical Remote Sensing Images [108.79667788962425]
salient object detection in optical remote sensing images (RSI-SOD) remains to be a challenging emerging topic.
We propose a novel Multi-Content Complementation Network (MCCNet) to explore the complementarity of multiple content for RSI-SOD.
In MCCM, we consider multiple types of features that are critical to RSI-SOD, including foreground features, edge features, background features, and global image-level features.
arXiv Detail & Related papers (2021-12-02T04:46:40Z) - Learnable Multi-level Frequency Decomposition and Hierarchical Attention
Mechanism for Generalized Face Presentation Attack Detection [7.324459578044212]
Face presentation attack detection (PAD) is attracting a lot of attention and playing a key role in securing face recognition systems.
We propose a dual-stream convolution neural networks (CNNs) framework to deal with unseen scenarios.
We successfully prove the design of our proposed PAD solution in a step-wise ablation study.
arXiv Detail & Related papers (2021-09-16T13:06:43Z) - Robust Self-Ensembling Network for Hyperspectral Image Classification [38.84831094095329]
We propose a robust self-ensembling network (RSEN) to address this problem.
The proposed RSEN consists of twoworks including a base network and an ensemble network.
We show that the proposed algorithm can yield competitive performance compared with the state-of-the-art methods.
arXiv Detail & Related papers (2021-04-08T13:33:14Z) - Factors of Influence for Transfer Learning across Diverse Appearance
Domains and Task Types [50.1843146606122]
A simple form of transfer learning is common in current state-of-the-art computer vision models.
Previous systematic studies of transfer learning have been limited and the circumstances in which it is expected to work are not fully understood.
In this paper we carry out an extensive experimental exploration of transfer learning across vastly different image domains.
arXiv Detail & Related papers (2021-03-24T16:24:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.