DCN-T: Dual Context Network with Transformer for Hyperspectral Image
Classification
- URL: http://arxiv.org/abs/2304.09915v1
- Date: Wed, 19 Apr 2023 18:32:52 GMT
- Title: DCN-T: Dual Context Network with Transformer for Hyperspectral Image
Classification
- Authors: Di Wang, Jing Zhang, Bo Du, Liangpei Zhang and Dacheng Tao
- Abstract summary: Hyperspectral image (HSI) classification is challenging due to spatial variability caused by complex imaging conditions.
We propose a tri-spectral image generation pipeline that transforms HSI into high-quality tri-spectral images.
Our proposed method outperforms state-of-the-art methods for HSI classification.
- Score: 109.09061514799413
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Hyperspectral image (HSI) classification is challenging due to spatial
variability caused by complex imaging conditions. Prior methods suffer from
limited representation ability, as they train specially designed networks from
scratch on limited annotated data. We propose a tri-spectral image generation
pipeline that transforms HSI into high-quality tri-spectral images, enabling
the use of off-the-shelf ImageNet pretrained backbone networks for feature
extraction. Motivated by the observation that there are many homogeneous areas
with distinguished semantic and geometric properties in HSIs, which can be used
to extract useful contexts, we propose an end-to-end segmentation network named
DCN-T. It adopts transformers to effectively encode regional adaptation and
global aggregation spatial contexts within and between the homogeneous areas
discovered by similarity-based clustering. To fully exploit the rich spectrums
of the HSI, we adopt an ensemble approach where all segmentation results of the
tri-spectral images are integrated into the final prediction through a voting
scheme. Extensive experiments on three public benchmarks show that our proposed
method outperforms state-of-the-art methods for HSI classification.
Related papers
- Superpixel Graph Contrastive Clustering with Semantic-Invariant
Augmentations for Hyperspectral Images [64.72242126879503]
Hyperspectral images (HSI) clustering is an important but challenging task.
We first use 3-D and 2-D hybrid convolutional neural networks to extract the high-order spatial and spectral features of HSI.
We then design a superpixel graph contrastive clustering model to learn discriminative superpixel representations.
arXiv Detail & Related papers (2024-03-04T07:40:55Z) - Multiview Subspace Clustering of Hyperspectral Images based on Graph
Convolutional Networks [12.275530282665578]
This study proposes a multiview subspace clustering of hy-perspectral images (HSI) based on graph convolutional networks.
The model was evaluated on three popular HSI datasets, including Indian Pines, Pavia University, and Houston.
It achieved overall accuracies of 92.38%, 93.43%, and 83.82%, respectively, and significantly outperformed the state-of-the-art clustering methods.
arXiv Detail & Related papers (2024-03-03T10:19:18Z) - Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation [63.15257949821558]
Referring Remote Sensing Image (RRSIS) is a new challenge that combines computer vision and natural language processing.
Traditional Referring Image (RIS) approaches have been impeded by the complex spatial scales and orientations found in aerial imagery.
We introduce the Rotated Multi-Scale Interaction Network (RMSIN), an innovative approach designed for the unique demands of RRSIS.
arXiv Detail & Related papers (2023-12-19T08:14:14Z) - Deep Posterior Distribution-based Embedding for Hyperspectral Image
Super-resolution [75.24345439401166]
This paper focuses on how to embed the high-dimensional spatial-spectral information of hyperspectral (HS) images efficiently and effectively.
We formulate HS embedding as an approximation of the posterior distribution of a set of carefully-defined HS embedding events.
Then, we incorporate the proposed feature embedding scheme into a source-consistent super-resolution framework that is physically-interpretable.
Experiments over three common benchmark datasets demonstrate that PDE-Net achieves superior performance over state-of-the-art methods.
arXiv Detail & Related papers (2022-05-30T06:59:01Z) - Two-Stream Graph Convolutional Network for Intra-oral Scanner Image
Segmentation [133.02190910009384]
We propose a two-stream graph convolutional network (i.e., TSGCN) to handle inter-view confusion between different raw attributes.
Our TSGCN significantly outperforms state-of-the-art methods in 3D tooth (surface) segmentation.
arXiv Detail & Related papers (2022-04-19T10:41:09Z) - Hyperspectral Image Segmentation based on Graph Processing over
Multilayer Networks [51.15952040322895]
One important task of hyperspectral image (HSI) processing is the extraction of spectral-spatial features.
We propose several approaches to HSI segmentation based on M-GSP feature extraction.
Our experimental results demonstrate the strength of M-GSP in HSI processing and spectral-spatial information extraction.
arXiv Detail & Related papers (2021-11-29T23:28:18Z) - RSI-Net: Two-Stream Deep Neural Network Integrating GCN and Atrous CNN
for Semantic Segmentation of High-resolution Remote Sensing Images [3.468780866037609]
Two-stream deep neural network for semantic segmentation of remote sensing images (RSI-Net) is proposed in this paper.
Experiments are implemented on the Vaihingen, Potsdam and Gaofen RSI datasets.
Results demonstrate the superior performance of RSI-Net in terms of overall accuracy, F1 score and kappa coefficient when compared with six state-of-the-art RSI semantic segmentation methods.
arXiv Detail & Related papers (2021-09-19T15:57:20Z) - A Feature Fusion-Net Using Deep Spatial Context Encoder and
Nonstationary Joint Statistical Model for High Resolution SAR Image
Classification [10.152675581771113]
A novel end-to-end supervised classification method is proposed for HR SAR images.
To extract more effective spatial features, a new deep spatial context encoder network (DSCEN) is proposed.
To enhance the diversity of statistics, the nonstationary joint statistical model (NS-JSM) is adopted to form the global statistical features.
arXiv Detail & Related papers (2021-05-11T06:20:14Z) - Spatial--spectral FFPNet: Attention-Based Pyramid Network for
Segmentation and Classification of Remote Sensing Images [12.320585790097415]
In this study, we develop an attention-based pyramid network for segmentation and classification of remote sensing datasets.
Experiments conducted on ISPRS Vaihingen and ISPRS Potsdam high-resolution datasets demonstrate the competitive segmentation accuracy achieved by the proposed heavy-weight spatial FFPNet.
arXiv Detail & Related papers (2020-08-20T04:55:34Z) - Hyperspectral Image Classification with Spatial Consistence Using Fully
Convolutional Spatial Propagation Network [9.583523548244683]
Deep convolutional neural networks (CNNs) have shown impressive ability to represent hyperspectral images (HSIs)
We propose a novel end-to-end, pixels-to-pixels fully convolutional spatial propagation network (FCSPN) for HSI classification.
FCSPN consists of a 3D fully convolution network (3D-FCN) and a convolutional spatial propagation network (CSPN)
arXiv Detail & Related papers (2020-08-04T09:05:52Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.