SIRAN: Sinkhorn Distance Regularized Adversarial Network for DEM
Super-resolution using Discriminative Spatial Self-attention
- URL: http://arxiv.org/abs/2311.16490v1
- Date: Mon, 27 Nov 2023 12:03:22 GMT
- Title: SIRAN: Sinkhorn Distance Regularized Adversarial Network for DEM
Super-resolution using Discriminative Spatial Self-attention
- Authors: Subhajit Paul, Ashutosh Gupta
- Abstract summary: Digital Elevation Model (DEM) is an essential aspect in the remote sensing domain to analyze and explore different applications related to surface elevation information.
In this study, we intend to address the generation of high-resolution DEMs using high-resolution multi-spectral (MX) satellite imagery.
We present an objective function related to optimizing Sinkhorn distance with traditional GAN to improve the stability of adversarial learning.
- Score: 5.178465447325005
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Digital Elevation Model (DEM) is an essential aspect in the remote sensing
domain to analyze and explore different applications related to surface
elevation information. In this study, we intend to address the generation of
high-resolution DEMs using high-resolution multi-spectral (MX) satellite
imagery by incorporating adversarial learning. To promptly regulate this
process, we utilize the notion of polarized self-attention of discriminator
spatial maps as well as introduce a Densely connected Multi-Residual Block
(DMRB) module to assist in efficient gradient flow. Further, we present an
objective function related to optimizing Sinkhorn distance with traditional GAN
to improve the stability of adversarial learning. In this regard, we provide
both theoretical and empirical substantiation of better performance in terms of
vanishing gradient issues and numerical convergence. We demonstrate both
qualitative and quantitative outcomes with available state-of-the-art methods.
Based on our experiments on DEM datasets of Shuttle Radar Topographic Mission
(SRTM) and Cartosat-1, we show that the proposed model performs preferably
against other learning-based state-of-the-art methods. We also generate and
visualize several high-resolution DEMs covering terrains with diverse
signatures to show the performance of our model.
Related papers
- A Sinkhorn Regularized Adversarial Network for Image Guided DEM Super-resolution using Frequency Selective Hybrid Graph Transformer [4.383449961857098]
Digital Elevation Model (DEM) is an essential aspect in the remote sensing (RS) domain to analyze various applications related to surface elevations.
Here, we address the generation of high-resolution (HR) DEMs using HR multi-spectral (MX) satellite imagery as a guide.
We present a novel adversarial objective related to optimizing Sinkhorn distance with classical GAN.
arXiv Detail & Related papers (2024-09-21T16:59:08Z) - Inter-slice Super-resolution of Magnetic Resonance Images by Pre-training and Self-supervised Fine-tuning [49.197385954021456]
In clinical practice, 2D magnetic resonance (MR) sequences are widely adopted. While individual 2D slices can be stacked to form a 3D volume, the relatively large slice spacing can pose challenges for visualization and subsequent analysis tasks.
To reduce slice spacing, deep-learning-based super-resolution techniques are widely investigated.
Most current solutions require a substantial number of paired high-resolution and low-resolution images for supervised training, which are typically unavailable in real-world scenarios.
arXiv Detail & Related papers (2024-06-10T02:20:26Z) - Hybrid Convolutional and Attention Network for Hyperspectral Image Denoising [54.110544509099526]
Hyperspectral image (HSI) denoising is critical for the effective analysis and interpretation of hyperspectral data.
We propose a hybrid convolution and attention network (HCANet) to enhance HSI denoising.
Experimental results on mainstream HSI datasets demonstrate the rationality and effectiveness of the proposed HCANet.
arXiv Detail & Related papers (2024-03-15T07:18:43Z) - Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation [63.15257949821558]
Referring Remote Sensing Image (RRSIS) is a new challenge that combines computer vision and natural language processing.
Traditional Referring Image (RIS) approaches have been impeded by the complex spatial scales and orientations found in aerial imagery.
We introduce the Rotated Multi-Scale Interaction Network (RMSIN), an innovative approach designed for the unique demands of RRSIS.
arXiv Detail & Related papers (2023-12-19T08:14:14Z) - SatDM: Synthesizing Realistic Satellite Image with Semantic Layout
Conditioning using Diffusion Models [0.0]
Denoising Diffusion Probabilistic Models (DDPMs) have demonstrated significant promise in synthesizing realistic images from semantic layouts.
In this paper, a conditional DDPM model capable of taking a semantic map and generating high-quality, diverse, and correspondingly accurate satellite images is implemented.
The effectiveness of our proposed model is validated using a meticulously labeled dataset introduced within the context of this study.
arXiv Detail & Related papers (2023-09-28T19:39:13Z) - Exploiting Temporal Structures of Cyclostationary Signals for
Data-Driven Single-Channel Source Separation [98.95383921866096]
We study the problem of single-channel source separation (SCSS)
We focus on cyclostationary signals, which are particularly suitable in a variety of application domains.
We propose a deep learning approach using a U-Net architecture, which is competitive with the minimum MSE estimator.
arXiv Detail & Related papers (2022-08-22T14:04:56Z) - Aerial Images Meet Crowdsourced Trajectories: A New Approach to Robust
Road Extraction [110.61383502442598]
We introduce a novel neural network framework termed Cross-Modal Message Propagation Network (CMMPNet)
CMMPNet is composed of two deep Auto-Encoders for modality-specific representation learning and a tailor-designed Dual Enhancement Module for cross-modal representation refinement.
Experiments on three real-world benchmarks demonstrate the effectiveness of our CMMPNet for robust road extraction.
arXiv Detail & Related papers (2021-11-30T04:30:10Z) - High-resolution Depth Maps Imaging via Attention-based Hierarchical
Multi-modal Fusion [84.24973877109181]
We propose a novel attention-based hierarchical multi-modal fusion network for guided DSR.
We show that our approach outperforms state-of-the-art methods in terms of reconstruction accuracy, running speed and memory efficiency.
arXiv Detail & Related papers (2021-04-04T03:28:33Z) - AFN: Attentional Feedback Network based 3D Terrain Super-Resolution [5.349223987137843]
We propose a novel fully convolutional neural network-based super-resolution architecture to increase the resolution of low-resolution Digital Elevation Model (LRDEM)
We perform the super-resolution of LRDEM using an attention-based feedback mechanism named 'Attentional Feedback Network' (AFN), which selectively fuses the information from LRDEM and aerial image to enhance and infuse the high-frequency features and to produce the terrain realistically.
arXiv Detail & Related papers (2020-10-04T16:51:39Z) - D-SRGAN: DEM Super-Resolution with Generative Adversarial Networks [0.0]
LIDAR data has been used as the primary source of Digital Elevation Models (DEMs)
DEMs have been used in a variety of applications like road extraction, hydrological modeling, flood mapping, and surface analysis.
Deep learning techniques have become attractive to researchers for their performance in learning features from high-resolution datasets.
arXiv Detail & Related papers (2020-04-09T19:57:49Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.