Semantic Segmentation of Medium-Resolution Satellite Imagery using
Conditional Generative Adversarial Networks
- URL: http://arxiv.org/abs/2012.03093v1
- Date: Sat, 5 Dec 2020 18:18:45 GMT
- Title: Semantic Segmentation of Medium-Resolution Satellite Imagery using
Conditional Generative Adversarial Networks
- Authors: Aditya Kulkarni, Tharun Mohandoss, Daniel Northrup, Ernest Mwebaze,
Hamed Alemohammad
- Abstract summary: We propose Conditional Generative Adversarial Networks (CGAN) based approach of image-to-image translation for high-resolution satellite imagery.
We find that the CGAN model outperforms the CNN model of similar complexity by a significant margin on an unseen imbalanced test dataset.
- Score: 3.4797121357690153
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Semantic segmentation of satellite imagery is a common approach to identify
patterns and detect changes around the planet. Most of the state-of-the-art
semantic segmentation models are trained in a fully supervised way using
Convolutional Neural Network (CNN). The generalization property of CNN is poor
for satellite imagery because the data can be very diverse in terms of
landscape types, image resolutions, and scarcity of labels for different
geographies and seasons. Hence, the performance of CNN doesn't translate well
to images from unseen regions or seasons. Inspired by Conditional Generative
Adversarial Networks (CGAN) based approach of image-to-image translation for
high-resolution satellite imagery, we propose a CGAN framework for land cover
classification using medium-resolution Sentinel-2 imagery. We find that the
CGAN model outperforms the CNN model of similar complexity by a significant
margin on an unseen imbalanced test dataset.
Related papers
- SatSynth: Augmenting Image-Mask Pairs through Diffusion Models for Aerial Semantic Segmentation [69.42764583465508]
We explore the potential of generative image diffusion to address the scarcity of annotated data in earth observation tasks.
To the best of our knowledge, we are the first to generate both images and corresponding masks for satellite segmentation.
arXiv Detail & Related papers (2024-03-25T10:30:22Z) - Fine-grained Recognition with Learnable Semantic Data Augmentation [68.48892326854494]
Fine-grained image recognition is a longstanding computer vision challenge.
We propose diversifying the training data at the feature-level to alleviate the discriminative region loss problem.
Our method significantly improves the generalization performance on several popular classification networks.
arXiv Detail & Related papers (2023-09-01T11:15:50Z) - DCN-T: Dual Context Network with Transformer for Hyperspectral Image
Classification [109.09061514799413]
Hyperspectral image (HSI) classification is challenging due to spatial variability caused by complex imaging conditions.
We propose a tri-spectral image generation pipeline that transforms HSI into high-quality tri-spectral images.
Our proposed method outperforms state-of-the-art methods for HSI classification.
arXiv Detail & Related papers (2023-04-19T18:32:52Z) - Decoupled Mixup for Generalized Visual Recognition [71.13734761715472]
We propose a novel "Decoupled-Mixup" method to train CNN models for visual recognition.
Our method decouples each image into discriminative and noise-prone regions, and then heterogeneously combines these regions to train CNN models.
Experiment results show the high generalization performance of our method on testing data that are composed of unseen contexts.
arXiv Detail & Related papers (2022-10-26T15:21:39Z) - Activation Regression for Continuous Domain Generalization with
Applications to Crop Classification [48.795866501365694]
Geographic variance in satellite imagery impacts the ability of machine learning models to generalise to new regions.
We model geographic generalisation in medium resolution Landsat-8 satellite imagery as a continuous domain adaptation problem.
We develop a dataset spatially distributed across the entire continental United States.
arXiv Detail & Related papers (2022-04-14T15:41:39Z) - Sci-Net: a Scale Invariant Model for Building Detection from Aerial
Images [0.0]
We propose a Scale-invariant neural network (Sci-Net) that is able to segment buildings present in aerial images at different spatial resolutions.
Specifically, we modified the U-Net architecture and fused it with dense Atrous Spatial Pyramid Pooling (ASPP) to extract fine-grained multi-scale representations.
arXiv Detail & Related papers (2021-11-12T16:45:20Z) - Segmentation of Roads in Satellite Images using specially modified U-Net
CNNs [0.0]
The aim of this paper is to build an image classifier for satellite images of urban scenes that identifies the portions of the images in which a road is located.
Unlike conventional computer vision algorithms, convolutional neural networks (CNNs) provide accurate and reliable results on this task.
arXiv Detail & Related papers (2021-09-29T19:08:32Z) - RSI-Net: Two-Stream Deep Neural Network Integrating GCN and Atrous CNN
for Semantic Segmentation of High-resolution Remote Sensing Images [3.468780866037609]
Two-stream deep neural network for semantic segmentation of remote sensing images (RSI-Net) is proposed in this paper.
Experiments are implemented on the Vaihingen, Potsdam and Gaofen RSI datasets.
Results demonstrate the superior performance of RSI-Net in terms of overall accuracy, F1 score and kappa coefficient when compared with six state-of-the-art RSI semantic segmentation methods.
arXiv Detail & Related papers (2021-09-19T15:57:20Z) - Scale Aware Adaptation for Land-Cover Classification in Remote Sensing
Imagery [4.793219747021116]
Land-cover classification using remote sensing imagery is an important Earth observation task.
The benchmark datasets available for training deep segmentation models in remote sensing imagery tend to be small.
We propose a scale aware adversarial learning framework to perform joint cross-location and cross-scale land-cover classification.
arXiv Detail & Related papers (2020-12-08T05:15:43Z) - Transferring and Regularizing Prediction for Semantic Segmentation [115.88957139226966]
In this paper, we exploit the intrinsic properties of semantic segmentation to alleviate such problem for model transfer.
We present a Regularizer of Prediction Transfer (RPT) that imposes the intrinsic properties as constraints to regularize model transfer in an unsupervised fashion.
Extensive experiments are conducted to verify the proposal of RPT on the transfer of models trained on GTA5 and SYNTHIA (synthetic data) to Cityscapes dataset (urban street scenes)
arXiv Detail & Related papers (2020-06-11T16:19:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.