Landslide Detection and Segmentation Using Remote Sensing Images and
Deep Neural Network
- URL: http://arxiv.org/abs/2312.16717v1
- Date: Wed, 27 Dec 2023 20:56:55 GMT
- Title: Landslide Detection and Segmentation Using Remote Sensing Images and
Deep Neural Network
- Authors: Cam Le, Lam Pham, Jasmin Lampert, Matthias Schl\"ogl, Alexander
Schindler
- Abstract summary: Building upon findings from 2022 Landslide4Sense Competition, we propose a deep neural network based system for landslide detection and segmentation.
We use a U-Net trained with Cross Entropy loss as baseline model.
We then improve the U-Net baseline model by leveraging a wide range of deep learning techniques.
- Score: 42.59806784981723
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Knowledge about historic landslide event occurrence is important for
supporting disaster risk reduction strategies. Building upon findings from 2022
Landslide4Sense Competition, we propose a deep neural network based system for
landslide detection and segmentation from multisource remote sensing image
input. We use a U-Net trained with Cross Entropy loss as baseline model. We
then improve the U-Net baseline model by leveraging a wide range of deep
learning techniques. In particular, we conduct feature engineering by
generating new band data from the original bands, which helps to enhance the
quality of remote sensing image input. Regarding the network architecture, we
replace traditional convolutional layers in the U-Net baseline by a
residual-convolutional layer. We also propose an attention layer which
leverages the multi-head attention scheme. Additionally, we generate multiple
output masks with three different resolutions, which creates an ensemble of
three outputs in the inference process to enhance the performance. Finally, we
propose a combined loss function which leverages Focal loss and IoU loss to
train the network. Our experiments on the development set of the
Landslide4Sense challenge achieve an F1 score and an mIoU score of 84.07 and
76.07, respectively. Our best model setup outperforms the challenge baseline
and the proposed U-Net baseline, improving the F1 score/mIoU score by 6.8/7.4
and 10.5/8.8, respectively.
Related papers
- NeRF-DetS: Enhancing Multi-View 3D Object Detection with Sampling-adaptive Network of Continuous NeRF-based Representation [60.47114985993196]
NeRF-Det unifies the tasks of novel view arithmetic and 3D perception.
We introduce a novel 3D perception network structure, NeRF-DetS.
NeRF-DetS outperforms competitive NeRF-Det on the ScanNetV2 dataset.
arXiv Detail & Related papers (2024-04-22T06:59:03Z) - Diffusion-based Data Augmentation for Object Counting Problems [62.63346162144445]
We develop a pipeline that utilizes a diffusion model to generate extensive training data.
We are the first to generate images conditioned on a location dot map with a diffusion model.
Our proposed counting loss for the diffusion model effectively minimizes the discrepancies between the location dot map and the crowd images generated.
arXiv Detail & Related papers (2024-01-25T07:28:22Z) - Active search and coverage using point-cloud reinforcement learning [50.741409008225766]
This paper presents an end-to-end deep reinforcement learning solution for target search and coverage.
We show that deep hierarchical feature learning works for RL and that by using farthest point sampling (FPS) we can reduce the amount of points.
We also show that multi-head attention for point-clouds helps to learn the agent faster but converges to the same outcome.
arXiv Detail & Related papers (2023-12-18T18:16:30Z) - PLADE-Net: Towards Pixel-Level Accuracy for Self-Supervised Single-View
Depth Estimation with Neural Positional Encoding and Distilled Matting Loss [49.66736599668501]
We propose a self-supervised single-view pixel-level accurate depth estimation network, called PLADE-Net.
Our method shows unprecedented accuracy levels, exceeding 95% in terms of the $delta1$ metric on the KITTI dataset.
arXiv Detail & Related papers (2021-03-12T15:54:46Z) - Image Restoration by Deep Projected GSURE [115.57142046076164]
Ill-posed inverse problems appear in many image processing applications, such as deblurring and super-resolution.
We propose a new image restoration framework that is based on minimizing a loss function that includes a "projected-version" of the Generalized SteinUnbiased Risk Estimator (GSURE) and parameterization of the latent image by a CNN.
arXiv Detail & Related papers (2021-02-04T08:52:46Z) - Multi-scale Attention U-Net (MsAUNet): A Modified U-Net Architecture for
Scene Segmentation [1.713291434132985]
We propose a novel multi-scale attention network for scene segmentation by using contextual information from an image.
This network can map local features with their global counterparts with improved accuracy and emphasize on discriminative image regions.
We have evaluated our model on two standard datasets named PascalVOC2012 and ADE20k.
arXiv Detail & Related papers (2020-09-15T08:03:41Z) - Monocular Depth Estimation Using Multi Scale Neural Network And Feature
Fusion [0.0]
Our network uses two different blocks, first which uses different filter sizes for convolution and merges all the individual feature maps.
The second block uses dilated convolutions in place of fully connected layers thus reducing computations and increasing the receptive field.
We train and test our network on Make 3D dataset, NYU Depth V2 dataset and Kitti dataset using standard evaluation metrics for depth estimation comprised of RMSE loss and SILog loss.
arXiv Detail & Related papers (2020-09-11T18:08:52Z) - HRVGAN: High Resolution Video Generation using Spatio-Temporal GAN [0.0]
We present a novel network for high resolution video generation.
Our network uses ideas from Wasserstein GANs by enforcing k-Lipschitz constraint on the loss term and Conditional GANs using class labels for training and testing.
arXiv Detail & Related papers (2020-08-17T20:45:59Z) - DiResNet: Direction-aware Residual Network for Road Extraction in VHR
Remote Sensing Images [12.081877372552606]
We present a direction-aware residual network (DiResNet) that includes three main contributions.
The proposed method has advantages in both overall accuracy and F1-score.
arXiv Detail & Related papers (2020-05-14T19:33:21Z) - Automated Pavement Crack Segmentation Using U-Net-based Convolutional
Neural Network [10.48658033897047]
We propose a deep learning technique based on a convolutional neural network to perform segmentation tasks on pavement crack images.
Our approach requires minimal feature engineering compared to other machine learning techniques.
Our method achieves an F1 score of 96% on the CFD dataset and 73% on the Crack500 dataset, outperforming other algorithms tested on these datasets.
arXiv Detail & Related papers (2020-01-07T07:18:13Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.