Spatial--spectral FFPNet: Attention-Based Pyramid Network for
Segmentation and Classification of Remote Sensing Images
- URL: http://arxiv.org/abs/2008.08775v1
- Date: Thu, 20 Aug 2020 04:55:34 GMT
- Title: Spatial--spectral FFPNet: Attention-Based Pyramid Network for
Segmentation and Classification of Remote Sensing Images
- Authors: Qingsong Xu, Xin Yuan, Chaojun Ouyang, Yue Zeng
- Abstract summary: In this study, we develop an attention-based pyramid network for segmentation and classification of remote sensing datasets.
Experiments conducted on ISPRS Vaihingen and ISPRS Potsdam high-resolution datasets demonstrate the competitive segmentation accuracy achieved by the proposed heavy-weight spatial FFPNet.
- Score: 12.320585790097415
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We consider the problem of segmentation and classification of high-resolution
and hyperspectral remote sensing images. Unlike conventional natural (RGB)
images, the inherent large scale and complex structures of remote sensing
images pose major challenges such as spatial object distribution diversity and
spectral information extraction when existing models are directly applied for
image classification. In this study, we develop an attention-based pyramid
network for segmentation and classification of remote sensing datasets.
Attention mechanisms are used to develop the following modules: i) a novel and
robust attention-based multi-scale fusion method effectively fuses useful
spatial or spectral information at different and same scales; ii) a region
pyramid attention mechanism using region-based attention addresses the target
geometric size diversity in large-scale remote sensing images; and iii
cross-scale attention} in our adaptive atrous spatial pyramid pooling network
adapts to varied contents in a feature-embedded space. Different forms of
feature fusion pyramid frameworks are established by combining these
attention-based modules. First, a novel segmentation framework, called the
heavy-weight spatial feature fusion pyramid network (FFPNet), is proposed to
address the spatial problem of high-resolution remote sensing images. Second,
an end-to-end spatial--spectral FFPNet is presented for classifying
hyperspectral images. Experiments conducted on ISPRS Vaihingen and ISPRS
Potsdam high-resolution datasets demonstrate the competitive segmentation
accuracy achieved by the proposed heavy-weight spatial FFPNet. Furthermore,
experiments on the Indian Pines and the University of Pavia hyperspectral
datasets indicate that the proposed spatial--spectral FFPNet outperforms the
current state-of-the-art methods in hyperspectral image classification.
Related papers
- Unsupervised Hyperspectral and Multispectral Image Blind Fusion Based on Deep Tucker Decomposition Network with Spatial-Spectral Manifold Learning [15.86617273658407]
We propose an unsupervised blind fusion method for hyperspectral and multispectral images based on Tucker decomposition and spatial spectral manifold learning (DTDNML)
We show that this method enhances the accuracy and efficiency of hyperspectral and multispectral fusion on different remote sensing datasets.
arXiv Detail & Related papers (2024-09-15T08:58:26Z) - MDFL: Multi-domain Diffusion-driven Feature Learning [19.298491870280213]
We present a multi-domain diffusion-driven feature learning network (MDFL)
MDFL redefines the effective information domain that the model really focuses on.
We demonstrate that MDFL significantly improves the feature extraction performance of high-dimensional data.
arXiv Detail & Related papers (2023-11-16T02:55:21Z) - Object Detection in Hyperspectral Image via Unified Spectral-Spatial
Feature Aggregation [55.9217962930169]
We present S2ADet, an object detector that harnesses the rich spectral and spatial complementary information inherent in hyperspectral images.
S2ADet surpasses existing state-of-the-art methods, achieving robust and reliable results.
arXiv Detail & Related papers (2023-06-14T09:01:50Z) - SAWU-Net: Spatial Attention Weighted Unmixing Network for Hyperspectral
Images [91.20864037082863]
We propose a spatial attention weighted unmixing network, dubbed as SAWU-Net, which learns a spatial attention network and a weighted unmixing network in an end-to-end manner.
In particular, we design a spatial attention module, which consists of a pixel attention block and a window attention block to efficiently model pixel-based spectral information and patch-based spatial information.
Experimental results on real and synthetic datasets demonstrate the better accuracy and superiority of SAWU-Net.
arXiv Detail & Related papers (2023-04-22T05:22:50Z) - DCN-T: Dual Context Network with Transformer for Hyperspectral Image
Classification [109.09061514799413]
Hyperspectral image (HSI) classification is challenging due to spatial variability caused by complex imaging conditions.
We propose a tri-spectral image generation pipeline that transforms HSI into high-quality tri-spectral images.
Our proposed method outperforms state-of-the-art methods for HSI classification.
arXiv Detail & Related papers (2023-04-19T18:32:52Z) - Deep Posterior Distribution-based Embedding for Hyperspectral Image
Super-resolution [75.24345439401166]
This paper focuses on how to embed the high-dimensional spatial-spectral information of hyperspectral (HS) images efficiently and effectively.
We formulate HS embedding as an approximation of the posterior distribution of a set of carefully-defined HS embedding events.
Then, we incorporate the proposed feature embedding scheme into a source-consistent super-resolution framework that is physically-interpretable.
Experiments over three common benchmark datasets demonstrate that PDE-Net achieves superior performance over state-of-the-art methods.
arXiv Detail & Related papers (2022-05-30T06:59:01Z) - Decoupled-and-Coupled Networks: Self-Supervised Hyperspectral Image
Super-Resolution with Subpixel Fusion [67.35540259040806]
We propose a subpixel-level HS super-resolution framework by devising a novel decoupled-and-coupled network, called DCNet.
As the name suggests, DC-Net first decouples the input into common (or cross-sensor) and sensor-specific components.
We append a self-supervised learning module behind the CSU net by guaranteeing the material consistency to enhance the detailed appearances of the restored HS product.
arXiv Detail & Related papers (2022-05-07T23:40:36Z) - Learning to Aggregate Multi-Scale Context for Instance Segmentation in
Remote Sensing Images [28.560068780733342]
A novel context aggregation network (CATNet) is proposed to improve the feature extraction process.
The proposed model exploits three lightweight plug-and-play modules, namely dense feature pyramid network (DenseFPN), spatial context pyramid ( SCP), and hierarchical region of interest extractor (HRoIE)
arXiv Detail & Related papers (2021-11-22T08:55:25Z) - RRNet: Relational Reasoning Network with Parallel Multi-scale Attention
for Salient Object Detection in Optical Remote Sensing Images [82.1679766706423]
Salient object detection (SOD) for optical remote sensing images (RSIs) aims at locating and extracting visually distinctive objects/regions from the optical RSIs.
We propose a relational reasoning network with parallel multi-scale attention for SOD in optical RSIs.
Our proposed RRNet outperforms the existing state-of-the-art SOD competitors both qualitatively and quantitatively.
arXiv Detail & Related papers (2021-10-27T07:18:32Z) - A Feature Fusion-Net Using Deep Spatial Context Encoder and
Nonstationary Joint Statistical Model for High Resolution SAR Image
Classification [10.152675581771113]
A novel end-to-end supervised classification method is proposed for HR SAR images.
To extract more effective spatial features, a new deep spatial context encoder network (DSCEN) is proposed.
To enhance the diversity of statistics, the nonstationary joint statistical model (NS-JSM) is adopted to form the global statistical features.
arXiv Detail & Related papers (2021-05-11T06:20:14Z) - Feature Pyramid Network with Multi-Head Attention for Se-mantic
Segmentation of Fine-Resolution Remotely Sensed Im-ages [4.869987958751064]
We introduce the Feature Pyramid Net-work (FPN) to bridge the gap between the low-level and high-level features.
We propose the Feature Pyramid Network with Multi-Head Attention (FPN-MHA) for semantic segmentation of fine-resolution remotely sensed images.
arXiv Detail & Related papers (2021-02-16T07:54:19Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.