Adversarial Loss for Semantic Segmentation of Aerial Imagery
- URL: http://arxiv.org/abs/2001.04269v2
- Date: Sat, 18 Jan 2020 10:21:37 GMT
- Title: Adversarial Loss for Semantic Segmentation of Aerial Imagery
- Authors: Clint Sebastian, Raffaele Imbriaco, Egor Bondarev, Peter H.N. de With
- Abstract summary: We propose a novel loss function that learns to understand both local and global contexts for semantic segmentation.
The newly proposed loss function deployed on the DeepLab v3+ network obtains state-of-the-art results on the Massachusetts buildings dataset.
- Score: 12.241693880896348
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Automatic building extraction from aerial imagery has several applications in
urban planning, disaster management, and change detection. In recent years,
several works have adopted deep convolutional neural networks (CNNs) for
building extraction, since they produce rich features that are invariant
against lighting conditions, shadows, etc. Although several advances have been
made, building extraction from aerial imagery still presents multiple
challenges. Most of the deep learning segmentation methods optimize the
per-pixel loss with respect to the ground truth without knowledge of the
context. This often leads to imperfect outputs that may lead to missing or
unrefined regions. In this work, we propose a novel loss function combining
both adversarial and cross-entropy losses that learn to understand both local
and global contexts for semantic segmentation. The newly proposed loss function
deployed on the DeepLab v3+ network obtains state-of-the-art results on the
Massachusetts buildings dataset. The loss function improves the structure and
refines the edges of buildings without requiring any of the commonly used
post-processing methods, such as Conditional Random Fields. We also perform
ablation studies to understand the impact of the adversarial loss. Finally, the
proposed method achieves a relaxed F1 score of 95.59% on the Massachusetts
buildings dataset compared to the previous best F1 of 94.88%.
Related papers
- Deep Loss Convexification for Learning Iterative Models [11.36644967267829]
Iterative methods such as iterative closest point (ICP) for point cloud registration often suffer from bad local optimality.
We propose learning to form a convex landscape around each ground truth.
arXiv Detail & Related papers (2024-11-16T01:13:04Z) - DiAD: A Diffusion-based Framework for Multi-class Anomaly Detection [55.48770333927732]
We propose a Difusion-based Anomaly Detection (DiAD) framework for multi-class anomaly detection.
It consists of a pixel-space autoencoder, a latent-space Semantic-Guided (SG) network with a connection to the stable diffusion's denoising network, and a feature-space pre-trained feature extractor.
Experiments on MVTec-AD and VisA datasets demonstrate the effectiveness of our approach.
arXiv Detail & Related papers (2023-12-11T18:38:28Z) - AugUndo: Scaling Up Augmentations for Monocular Depth Completion and Estimation [51.143540967290114]
We propose a method that unlocks a wide range of previously-infeasible geometric augmentations for unsupervised depth computation and estimation.
This is achieved by reversing, or undo''-ing, geometric transformations to the coordinates of the output depth, warping the depth map back to the original reference frame.
arXiv Detail & Related papers (2023-10-15T05:15:45Z) - Magic ELF: Image Deraining Meets Association Learning and Transformer [63.761812092934576]
This paper aims to unify CNN and Transformer to take advantage of their learning merits for image deraining.
A novel multi-input attention module (MAM) is proposed to associate rain removal and background recovery.
Our proposed method (dubbed as ELF) outperforms the state-of-the-art approach (MPRNet) by 0.25 dB on average.
arXiv Detail & Related papers (2022-07-21T12:50:54Z) - A Contrastive Distillation Approach for Incremental Semantic
Segmentation in Aerial Images [15.75291664088815]
A major issue concerning current deep neural architectures is known as catastrophic forgetting.
We propose a contrastive regularization, where any given input is compared with its augmented version.
We show the effectiveness of our solution on the Potsdam dataset, outperforming the incremental baseline in every test.
arXiv Detail & Related papers (2021-12-07T16:44:45Z) - Generic Perceptual Loss for Modeling Structured Output Dependencies [78.59700528239141]
We show that, what matters is the network structure instead of the trained weights.
We demonstrate that a randomly-weighted deep CNN can be used to model the structured dependencies of outputs.
arXiv Detail & Related papers (2021-03-18T23:56:07Z) - Road Segmentation for Remote Sensing Images using Adversarial Spatial
Pyramid Networks [28.32775611169636]
We introduce a new model to apply structured domain adaption for synthetic image generation and road segmentation.
A novel scale-wise architecture is introduced to learn from the multi-level feature maps and improve the semantics of the features.
Our model achieves state-of-the-art 78.86 IOU on the Massachusetts dataset with 14.89M parameters and 86.78B FLOPs, with 4x fewer FLOPs but higher accuracy (+3.47% IOU)
arXiv Detail & Related papers (2020-08-10T11:00:19Z) - RescueNet: Joint Building Segmentation and Damage Assessment from
Satellite Imagery [83.49145695899388]
RescueNet is a unified model that can simultaneously segment buildings and assess the damage levels to individual buildings and can be trained end-to-end.
RescueNet is tested on the large scale and diverse xBD dataset and achieves significantly better building segmentation and damage classification performance than previous methods.
arXiv Detail & Related papers (2020-04-15T19:52:09Z) - Contextual Pyramid Attention Network for Building Segmentation in Aerial
Imagery [12.241693880896348]
Building extraction from aerial images has several applications in problems such as urban planning, change detection, and disaster management.
We propose to improve building segmentation of different sizes by capturing long-range dependencies using contextual pyramid attention (CPA)
Our method improves 1.8 points over current state-of-the-art methods and 12.6 points higher than existing baselines without any post-processing.
arXiv Detail & Related papers (2020-04-15T11:36:26Z) - Image Fine-grained Inpainting [89.17316318927621]
We present a one-stage model that utilizes dense combinations of dilated convolutions to obtain larger and more effective receptive fields.
To better train this efficient generator, except for frequently-used VGG feature matching loss, we design a novel self-guided regression loss.
We also employ a discriminator with local and global branches to ensure local-global contents consistency.
arXiv Detail & Related papers (2020-02-07T03:45:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.