Progressive Cross Attention Network for Flood Segmentation using Multispectral Satellite Imagery
- URL: http://arxiv.org/abs/2501.11923v1
- Date: Tue, 21 Jan 2025 06:55:31 GMT
- Title: Progressive Cross Attention Network for Flood Segmentation using Multispectral Satellite Imagery
- Authors: Vicky Feliren, Fithrothul Khikmah, Irfan Dwiki Bhaswara, Bahrul I. Nasution, Alex M. Lechner, Muhamad Risqi U. Saputra,
- Abstract summary: ProCANet is a deep learning model that progressively applies both self- and cross-attention mechanisms to multispectral features.
Our model demonstrated superior performance with the highest Intersection over Union (IoU) score of 0.815.
- Score: 0.0
- License:
- Abstract: In recent years, the integration of deep learning techniques with remote sensing technology has revolutionized the way natural hazards, such as floods, are monitored and managed. However, existing methods for flood segmentation using remote sensing data often overlook the utility of correlative features among multispectral satellite information. In this study, we introduce a progressive cross attention network (ProCANet), a deep learning model that progressively applies both self- and cross-attention mechanisms to multispectral features, generating optimal feature combinations for flood segmentation. The proposed model was compared with state-of-the-art approaches using Sen1Floods11 dataset and our bespoke flood data generated for the Citarum River basin, Indonesia. Our model demonstrated superior performance with the highest Intersection over Union (IoU) score of 0.815. Our results in this study, coupled with the ablation assessment comparing scenarios with and without attention across various modalities, opens a promising path for enhancing the accuracy of flood analysis using remote sensing technology.
Related papers
- Nearshore Underwater Target Detection Meets UAV-borne Hyperspectral Remote Sensing: A Novel Hybrid-level Contrastive Learning Framework and Benchmark Dataset [6.832892497880372]
UAV-borne hyperspectral remote sensing has emerged as a promising approach for underwater target detection (UTD)
It is hindered by spectral distortions in nearshore environments, which compromise the accuracy of traditional hyperspectral UTD methods.
We propose a novel framework that integrates contrastive learning with a self-paced learning paradigm for robust HUTD in nearshore regions.
arXiv Detail & Related papers (2025-02-20T12:25:30Z) - Evaluation of Deep Learning Semantic Segmentation for Land Cover Mapping on Multispectral, Hyperspectral and High Spatial Aerial Imagery [0.0]
In the rise of climate change, land cover mapping has become such an urgent need in environmental monitoring.
This research implemented a semantic segmentation method such as Unet, Linknet, FPN, and PSPnet for categorizing vegetation, water, and others.
The LinkNet model obtained high accuracy in IoU at 0.92 in all datasets, which is comparable with other mentioned techniques.
arXiv Detail & Related papers (2024-06-20T11:40:12Z) - SatSynth: Augmenting Image-Mask Pairs through Diffusion Models for Aerial Semantic Segmentation [69.42764583465508]
We explore the potential of generative image diffusion to address the scarcity of annotated data in earth observation tasks.
To the best of our knowledge, we are the first to generate both images and corresponding masks for satellite segmentation.
arXiv Detail & Related papers (2024-03-25T10:30:22Z) - SIRAN: Sinkhorn Distance Regularized Adversarial Network for DEM
Super-resolution using Discriminative Spatial Self-attention [5.178465447325005]
Digital Elevation Model (DEM) is an essential aspect in the remote sensing domain to analyze and explore different applications related to surface elevation information.
In this study, we intend to address the generation of high-resolution DEMs using high-resolution multi-spectral (MX) satellite imagery.
We present an objective function related to optimizing Sinkhorn distance with traditional GAN to improve the stability of adversarial learning.
arXiv Detail & Related papers (2023-11-27T12:03:22Z) - Scaling Data Generation in Vision-and-Language Navigation [116.95534559103788]
We propose an effective paradigm for generating large-scale data for learning.
We apply 1200+ photo-realistic environments from HM3D and Gibson datasets and synthesizes 4.9 million instruction trajectory pairs.
Thanks to our large-scale dataset, the performance of an existing agent can be pushed up (+11% absolute with regard to previous SoTA) to a significantly new best of 80% single-run success rate on the R2R test split by simple imitation learning.
arXiv Detail & Related papers (2023-07-28T16:03:28Z) - DAM-Net: Global Flood Detection from SAR Imagery Using Differential
Attention Metric-Based Vision Transformers [22.885444177106873]
Detection of flooded areas using high-resolution synthetic aperture radar (SAR) imagery is a critical task with applications in crisis and disaster management.
To address this issue, we propose a novel differential attention metric-based network (DAM-Net) in this study.
The DAM-Net comprises two key components: a weight-sharing Siamese backbone to obtain multi-scale change features of multi-temporal images and tokens containing high-level semantic information of water-body changes.
arXiv Detail & Related papers (2023-06-01T14:12:33Z) - Semantic Segmentation of Vegetation in Remote Sensing Imagery Using Deep
Learning [77.34726150561087]
We propose an approach for creating a multi-modal and large-temporal dataset comprised of publicly available Remote Sensing data.
We use Convolutional Neural Networks (CNN) models that are capable of separating different classes of vegetation.
arXiv Detail & Related papers (2022-09-28T18:51:59Z) - Towards Daily High-resolution Inundation Observations using Deep
Learning and EO [0.0]
Constantly remote sensing presents a cost-effective solution for synoptic flood monitoring.
Satellites do offer timely inundation information when they cover an ongoing flood event, but they are limited by their resolution in terms of their ability to monitor flood evolution at various scales.
Data from satellites, such as the Copernicus Sentinels, which have high spatial and low temporal resolution, with data from NASA SMAP and GPM missions could yield high-resolution flood inundation at a daily scale.
arXiv Detail & Related papers (2022-08-10T14:04:50Z) - MD-CSDNetwork: Multi-Domain Cross Stitched Network for Deepfake
Detection [80.83725644958633]
Current deepfake generation methods leave discriminative artifacts in the frequency spectrum of fake images and videos.
We present a novel approach, termed as MD-CSDNetwork, for combining the features in the spatial and frequency domains to mine a shared discriminative representation.
arXiv Detail & Related papers (2021-09-15T14:11:53Z) - A SAR speckle filter based on Residual Convolutional Neural Networks [68.8204255655161]
This work aims to present a novel method for filtering the speckle noise from Sentinel-1 data by applying Deep Learning (DL) algorithms, based on Convolutional Neural Networks (CNNs)
The obtained results, if compared with the state of the art, show a clear improvement in terms of Peak Signal-to-Noise Ratio (PSNR) and Structural Similarity Index (SSIM)
arXiv Detail & Related papers (2021-04-19T14:43:07Z) - Channelized Axial Attention for Semantic Segmentation [70.14921019774793]
We propose the Channelized Axial Attention (CAA) to seamlessly integratechannel attention and axial attention with reduced computationalcomplexity.
Our CAA not onlyrequires much less computation resources compared with otherdual attention models such as DANet, but also outperforms the state-of-the-art ResNet-101-based segmentation models on alltested datasets.
arXiv Detail & Related papers (2021-01-19T03:08:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.