Greenhouse Segmentation on High-Resolution Optical Satellite Imagery
using Deep Learning Techniques
- URL: http://arxiv.org/abs/2007.11222v1
- Date: Wed, 22 Jul 2020 06:12:57 GMT
- Title: Greenhouse Segmentation on High-Resolution Optical Satellite Imagery
using Deep Learning Techniques
- Authors: Orkhan Baghirli, Imran Ibrahimli, and Tarlan Mammadzada
- Abstract summary: This paper proposes a sound methodology for pixel-wise classification on images acquired by the Azersky (SPOT-7) optical satellite.
customized variations of U-Net-like architectures are employed to identify greenhouses.
Two models are proposed which uniquely incorporate dilated convolutions and skip connections.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Greenhouse segmentation has pivotal importance for climate-smart agricultural
land-use planning. Deep learning-based approaches provide state-of-the-art
performance in natural image segmentation. However, semantic segmentation on
high-resolution optical satellite imagery is a challenging task because of the
complex environment. In this paper, a sound methodology is proposed for
pixel-wise classification on images acquired by the Azersky (SPOT-7) optical
satellite. In particular, customized variations of U-Net-like architectures are
employed to identify greenhouses. Two models are proposed which uniquely
incorporate dilated convolutions and skip connections, and the results are
compared to that of the baseline U-Net model. The dataset used consists of
pan-sharpened orthorectified Azersky images (red, green, blue,and near infrared
channels) with 1.5-meter resolution and annotation masks, collected from 15
regions in Azerbaijan where the greenhouses are densely congested. The images
cover the cumulative area of 1008 $km^2$ and annotation masks contain 47559
polygons in total. The $F_1, Kappa, AUC$, and $IOU$ scores are used for
performance evaluation. It is observed that the use of the deconvolutional
layers alone throughout the expansive path does not yield satisfactory results;
therefore, they are either replaced or coupled with bilinear interpolation. All
models benefit from the hard example mining (HEM) strategy. It is also reported
that the best accuracy of $93.29\%$ ($F_1\,score$) is recorded when the
weighted binary cross-entropy loss is coupled with the dice loss. Experimental
results showed that both of the proposed models outperformed the baseline U-Net
architecture such that the best model proposed scored $4.48\%$ higher in
comparison to the baseline architecture.
Related papers
- A Simple and Generalist Approach for Panoptic Segmentation [57.94892855772925]
Generalist vision models aim for one and the same architecture for a variety of vision tasks.
While such shared architecture may seem attractive, generalist models tend to be outperformed by their bespoken counterparts.
We address this problem by introducing two key contributions, without compromising the desirable properties of generalist models.
arXiv Detail & Related papers (2024-08-29T13:02:12Z) - Evaluation of Deep Learning Semantic Segmentation for Land Cover Mapping on Multispectral, Hyperspectral and High Spatial Aerial Imagery [0.0]
In the rise of climate change, land cover mapping has become such an urgent need in environmental monitoring.
This research implemented a semantic segmentation method such as Unet, Linknet, FPN, and PSPnet for categorizing vegetation, water, and others.
The LinkNet model obtained high accuracy in IoU at 0.92 in all datasets, which is comparable with other mentioned techniques.
arXiv Detail & Related papers (2024-06-20T11:40:12Z) - Combining UPerNet and ConvNeXt for Contrails Identification to reduce
Global Warming [0.0]
This study focuses on aircraft contrail detection in global satellite images to improve contrail models and mitigate their impact on climate change.
An innovative data preprocessing technique for NOAA GOES-16 satellite images is developed, using temperature data from the infrared channel to create false-color images, enhancing model perception.
To tackle class imbalance, the training dataset exclusively includes images with positive contrail labels.
arXiv Detail & Related papers (2023-10-07T13:59:05Z) - Enhanced Sharp-GAN For Histopathology Image Synthesis [63.845552349914186]
Histopathology image synthesis aims to address the data shortage issue in training deep learning approaches for accurate cancer detection.
We propose a novel approach that enhances the quality of synthetic images by using nuclei topology and contour regularization.
The proposed approach outperforms Sharp-GAN in all four image quality metrics on two datasets.
arXiv Detail & Related papers (2023-01-24T17:54:01Z) - A Strategy Optimized Pix2pix Approach for SAR-to-Optical Image
Translation Task [0.9176056742068814]
This report summarizes the analysis and approach on the image-to-image translation task in the Multimodal Learning for Earth and Environment Challenge (MultiEarth 2022)
In terms of strategy optimization, cloud classification is utilized to filter optical images with dense cloud coverage to aid the supervised learning alike approach.
The results indicate great potential towards SAR-to-optical translation in remote sensing tasks, specifically for the support of long-term environmental monitoring and protection.
arXiv Detail & Related papers (2022-06-27T04:41:37Z) - Beyond Cross-view Image Retrieval: Highly Accurate Vehicle Localization
Using Satellite Image [91.29546868637911]
This paper addresses the problem of vehicle-mounted camera localization by matching a ground-level image with an overhead-view satellite map.
The key idea is to formulate the task as pose estimation and solve it by neural-net based optimization.
Experiments on standard autonomous vehicle localization datasets have confirmed the superiority of the proposed method.
arXiv Detail & Related papers (2022-04-10T19:16:58Z) - Sci-Net: a Scale Invariant Model for Building Detection from Aerial
Images [0.0]
We propose a Scale-invariant neural network (Sci-Net) that is able to segment buildings present in aerial images at different spatial resolutions.
Specifically, we modified the U-Net architecture and fused it with dense Atrous Spatial Pyramid Pooling (ASPP) to extract fine-grained multi-scale representations.
arXiv Detail & Related papers (2021-11-12T16:45:20Z) - Spatial-Separated Curve Rendering Network for Efficient and
High-Resolution Image Harmonization [59.19214040221055]
We propose a novel spatial-separated curve rendering network (S$2$CRNet) for efficient and high-resolution image harmonization.
The proposed method reduces more than 90% parameters compared with previous methods.
Our method can work smoothly on higher resolution images in real-time which is more than 10$times$ faster than the existing methods.
arXiv Detail & Related papers (2021-09-13T07:20:16Z) - Aggregated Contextual Transformations for High-Resolution Image
Inpainting [57.241749273816374]
We propose an enhanced GAN-based model, named Aggregated COntextual-Transformation GAN (AOT-GAN) for high-resolution image inpainting.
To enhance context reasoning, we construct the generator of AOT-GAN by stacking multiple layers of a proposed AOT block.
For improving texture synthesis, we enhance the discriminator of AOT-GAN by training it with a tailored mask-prediction task.
arXiv Detail & Related papers (2021-04-03T15:50:17Z) - Road Segmentation for Remote Sensing Images using Adversarial Spatial
Pyramid Networks [28.32775611169636]
We introduce a new model to apply structured domain adaption for synthetic image generation and road segmentation.
A novel scale-wise architecture is introduced to learn from the multi-level feature maps and improve the semantics of the features.
Our model achieves state-of-the-art 78.86 IOU on the Massachusetts dataset with 14.89M parameters and 86.78B FLOPs, with 4x fewer FLOPs but higher accuracy (+3.47% IOU)
arXiv Detail & Related papers (2020-08-10T11:00:19Z) - Image Fine-grained Inpainting [89.17316318927621]
We present a one-stage model that utilizes dense combinations of dilated convolutions to obtain larger and more effective receptive fields.
To better train this efficient generator, except for frequently-used VGG feature matching loss, we design a novel self-guided regression loss.
We also employ a discriminator with local and global branches to ensure local-global contents consistency.
arXiv Detail & Related papers (2020-02-07T03:45:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.