DAM-Net: Global Flood Detection from SAR Imagery Using Differential
Attention Metric-Based Vision Transformers
- URL: http://arxiv.org/abs/2306.00704v1
- Date: Thu, 1 Jun 2023 14:12:33 GMT
- Title: DAM-Net: Global Flood Detection from SAR Imagery Using Differential
Attention Metric-Based Vision Transformers
- Authors: Tamer Saleh, Xingxing Weng, Shimaa Holail, Chen Hao and Gui-Song Xia
- Abstract summary: Detection of flooded areas using high-resolution synthetic aperture radar (SAR) imagery is a critical task with applications in crisis and disaster management.
To address this issue, we propose a novel differential attention metric-based network (DAM-Net) in this study.
The DAM-Net comprises two key components: a weight-sharing Siamese backbone to obtain multi-scale change features of multi-temporal images and tokens containing high-level semantic information of water-body changes.
- Score: 22.885444177106873
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The detection of flooded areas using high-resolution synthetic aperture radar
(SAR) imagery is a critical task with applications in crisis and disaster
management, as well as environmental resource planning. However, the complex
nature of SAR images presents a challenge that often leads to an overestimation
of the flood extent. To address this issue, we propose a novel differential
attention metric-based network (DAM-Net) in this study. The DAM-Net comprises
two key components: a weight-sharing Siamese backbone to obtain multi-scale
change features of multi-temporal images and tokens containing high-level
semantic information of water-body changes, and a temporal differential fusion
(TDF) module that integrates semantic tokens and change features to generate
flood maps with reduced speckle noise. Specifically, the backbone is split into
multiple stages. In each stage, we design three modules, namely, temporal-wise
feature extraction (TWFE), cross-temporal change attention (CTCA), and
temporal-aware change enhancement (TACE), to effectively extract the change
features. In TACE of the last stage, we introduce a class token to record
high-level semantic information of water-body changes via the attention
mechanism. Another challenge faced by data-driven deep learning algorithms is
the limited availability of flood detection datasets. To overcome this, we have
created the S1GFloods open-source dataset, a global-scale high-resolution
Sentinel-1 SAR image pairs dataset covering 46 global flood events between 2015
and 2022. The experiments on the S1GFloods dataset using the proposed DAM-Net
showed top results compared to state-of-the-art methods in terms of overall
accuracy, F1-score, and IoU, which reached 97.8%, 96.5%, and 93.2%,
respectively. Our dataset and code will be available online at
https://github.com/Tamer-Saleh/S1GFlood-Detection.
Related papers
- Deep Multimodal Fusion for Semantic Segmentation of Remote Sensing Earth Observation Data [0.08192907805418582]
This paper proposes a late fusion deep learning model (LF-DLM) for semantic segmentation.
One branch integrates detailed textures from aerial imagery captured by UNetFormer with a Multi-Axis Vision Transformer (ViT) backbone.
The other branch captures complex-temporal dynamics from the Sentinel-2 satellite imageMax time series using a U-ViNet with Temporal Attention (U-TAE)
arXiv Detail & Related papers (2024-10-01T07:50:37Z) - Wavelet-based Bi-dimensional Aggregation Network for SAR Image Change Detection [53.842568573251214]
Experimental results on three SAR datasets demonstrate that our WBANet significantly outperforms contemporary state-of-the-art methods.
Our WBANet achieves 98.33%, 96.65%, and 96.62% of percentage of correct classification (PCC) on the respective datasets.
arXiv Detail & Related papers (2024-07-18T04:36:10Z) - SOOD++: Leveraging Unlabeled Data to Boost Oriented Object Detection [59.868772767818975]
We propose a simple yet effective Semi-supervised Oriented Object Detection method termed SOOD++.
Specifically, we observe that objects from aerial images are usually arbitrary orientations, small scales, and aggregation.
Extensive experiments conducted on various multi-oriented object datasets under various labeled settings demonstrate the effectiveness of our method.
arXiv Detail & Related papers (2024-07-01T07:03:51Z) - SARDet-100K: Towards Open-Source Benchmark and ToolKit for Large-Scale SAR Object Detection [79.23689506129733]
We establish a new benchmark dataset and an open-source method for large-scale SAR object detection.
Our dataset, SARDet-100K, is a result of intense surveying, collecting, and standardizing 10 existing SAR detection datasets.
To the best of our knowledge, SARDet-100K is the first COCO-level large-scale multi-class SAR object detection dataset ever created.
arXiv Detail & Related papers (2024-03-11T09:20:40Z) - AMANet: Advancing SAR Ship Detection with Adaptive Multi-Hierarchical
Attention Network [0.5437298646956507]
A novel adaptive multi-hierarchical attention module (AMAM) is proposed to learn multi-scale features and adaptively aggregate salient features from various feature layers.
We first fuse information from adjacent feature layers to enhance the detection of smaller targets, thereby achieving multi-scale feature enhancement.
Thirdly, we present a novel adaptive multi-hierarchical attention network (AMANet) by embedding the AMAM between the backbone network and the feature pyramid network.
arXiv Detail & Related papers (2024-01-24T03:56:33Z) - DiAD: A Diffusion-based Framework for Multi-class Anomaly Detection [55.48770333927732]
We propose a Difusion-based Anomaly Detection (DiAD) framework for multi-class anomaly detection.
It consists of a pixel-space autoencoder, a latent-space Semantic-Guided (SG) network with a connection to the stable diffusion's denoising network, and a feature-space pre-trained feature extractor.
Experiments on MVTec-AD and VisA datasets demonstrate the effectiveness of our approach.
arXiv Detail & Related papers (2023-12-11T18:38:28Z) - Mutual Information-driven Triple Interaction Network for Efficient Image
Dehazing [54.168567276280505]
We propose a novel Mutual Information-driven Triple interaction Network (MITNet) for image dehazing.
The first stage, named amplitude-guided haze removal, aims to recover the amplitude spectrum of the hazy images for haze removal.
The second stage, named phase-guided structure refined, devotes to learning the transformation and refinement of the phase spectrum.
arXiv Detail & Related papers (2023-08-14T08:23:58Z) - Transforming Observations of Ocean Temperature with a Deep Convolutional
Residual Regressive Neural Network [0.0]
Sea surface temperature (SST) is an essential climate variable that can be measured via ground truth, remote sensing, or hybrid model methodologies.
Here, we celebrate SST surveillance progress via the application of a few relevant technological advances from the late 20th and early 21st century.
We develop our existing water cycle observation framework, Flux to Flow (F2F), to fuse AMSR-E and MODIS into a higher resolution product.
Our neural network architecture is constrained to a deep convolutional residual regressive neural network.
arXiv Detail & Related papers (2023-06-16T17:35:11Z) - Attentive Dual Stream Siamese U-net for Flood Detection on
Multi-temporal Sentinel-1 Data [0.0]
We propose a flood detection network using bi-temporal SAR acquisitions.
The proposed segmentation network has an encoder-decoder architecture with two Siamese encoders for pre and post-flood images.
The network outperformed the existing state-of-the-art (uni-temporal) flood detection method by 6% IOU.
arXiv Detail & Related papers (2022-04-20T10:56:39Z) - Dense Attention Fluid Network for Salient Object Detection in Optical
Remote Sensing Images [193.77450545067967]
We propose an end-to-end Dense Attention Fluid Network (DAFNet) for salient object detection in optical remote sensing images (RSIs)
A Global Context-aware Attention (GCA) module is proposed to adaptively capture long-range semantic context relationships.
We construct a new and challenging optical RSI dataset for SOD that contains 2,000 images with pixel-wise saliency annotations.
arXiv Detail & Related papers (2020-11-26T06:14:10Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.