Geo-Tiles for Semantic Segmentation of Earth Observation Imagery
- URL: http://arxiv.org/abs/2306.00823v2
- Date: Wed, 7 Jun 2023 16:23:59 GMT
- Title: Geo-Tiles for Semantic Segmentation of Earth Observation Imagery
- Authors: Sebastian Bullinger and Florian Fervers and Christoph Bodensteiner and
Michael Arens
- Abstract summary: Existing methods and benchmark datasets rely on pixel-based tiling schemes or on geo-tiling schemes employed by web mapping applications.
We propose a new segmentation pipeline for earth observation imagery relying on a tiling scheme that creates geo-tiles based on heterogeneous data.
This approach exhibits several beneficial properties compared to pixel-based or common web mapping approaches.
We demonstrate how the proposed tiling system allows to improve the results of current state-of-the-art semantic segmentation models.
- Score: 7.49377967268953
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: To cope with the high requirements during the computation of semantic
segmentations of earth observation imagery, current state-of-the-art pipelines
divide the corresponding data into smaller images. Existing methods and
benchmark datasets oftentimes rely on pixel-based tiling schemes or on
geo-tiling schemes employed by web mapping applications. The selection of
subimages (comprising size, location and orientation) is crucial. It affects
the available context information of each pixel, defines the number of tiles
during training, and influences the degree of information degradation while
down- and up-sampling the tile contents to the size required by the
segmentation model. We propose a new segmentation pipeline for earth
observation imagery relying on a tiling scheme that creates geo-tiles based on
the geo-information of the raster data. This approach exhibits several
beneficial properties compared to pixel-based or common web mapping approaches.
The proposed tiling scheme shows flexible customization properties regarding
tile granularity, tile stride and image boundary alignment. This allows us to
perform a tile specific data augmentation during training and a substitution of
pixel predictions with limited context information using data of overlapping
tiles during inference. The generated tiles show a consistent spatial tile
extent w.r.t. heterogeneous sensors, varying recording distances and different
latitudes. We demonstrate how the proposed tiling system allows to improve the
results of current state-of-the-art semantic segmentation models. To foster
future research we make the source code publicly available.
Related papers
- EmerDiff: Emerging Pixel-level Semantic Knowledge in Diffusion Models [52.3015009878545]
We develop an image segmentor capable of generating fine-grained segmentation maps without any additional training.
Our framework identifies semantic correspondences between image pixels and spatial locations of low-dimensional feature maps.
In extensive experiments, the produced segmentation maps are demonstrated to be well delineated and capture detailed parts of the images.
arXiv Detail & Related papers (2024-01-22T07:34:06Z) - OsmLocator: locating overlapping scatter marks with a non-training
generative perspective [48.50108853199417]
Locating overlapping marks faces many difficulties such as no texture, less contextual information, hallow shape and tiny size.
Here, we formulate it as a optimization problem on clustering-based re-visualization from a non-training generative perspective.
We especially built a dataset named 2023 containing hundreds of scatter images with different markers and various levels of overlapping severity, and tested the proposed method and compared it to existing methods.
arXiv Detail & Related papers (2023-12-18T12:39:48Z) - CSP: Self-Supervised Contrastive Spatial Pre-Training for
Geospatial-Visual Representations [90.50864830038202]
We present Contrastive Spatial Pre-Training (CSP), a self-supervised learning framework for geo-tagged images.
We use a dual-encoder to separately encode the images and their corresponding geo-locations, and use contrastive objectives to learn effective location representations from images.
CSP significantly boosts the model performance with 10-34% relative improvement with various labeled training data sampling ratios.
arXiv Detail & Related papers (2023-05-01T23:11:18Z) - Cross-view Geo-localization via Learning Disentangled Geometric Layout
Correspondence [11.823147814005411]
Cross-view geo-localization aims to estimate the location of a query ground image by matching it to a reference geo-tagged aerial images database.
Recent works achieve outstanding progress on cross-view geo-localization benchmarks.
However, existing methods still suffer from poor performance on the cross-area benchmarks.
arXiv Detail & Related papers (2022-12-08T04:54:01Z) - Scalable Self-Supervised Representation Learning from Spatiotemporal
Motion Trajectories for Multimodal Computer Vision [0.0]
We propose a self-supervised, unlabeled method for learning representations of geographic locations from GPS trajectories.
We show that reachability embeddings are semantically meaningful representations and result in 4-23% gain in performance as measured using area under precision-recall curve (AUPRC) metric.
arXiv Detail & Related papers (2022-10-07T02:41:02Z) - Accurate 3-DoF Camera Geo-Localization via Ground-to-Satellite Image
Matching [102.39635336450262]
We address the problem of ground-to-satellite image geo-localization by matching a query image captured at the ground level against a large-scale database with geotagged satellite images.
Our new method is able to achieve the fine-grained location of a query image, up to pixel size precision of the satellite image.
arXiv Detail & Related papers (2022-03-26T20:10:38Z) - BI-GCN: Boundary-Aware Input-Dependent Graph Convolution Network for
Biomedical Image Segmentation [21.912509900254364]
We apply graph convolution into the segmentation task and propose an improved textitLaplacian.
Our method outperforms the state-of-the-art approaches on the segmentation of polyps in colonoscopy images and of the optic disc and optic cup in colour fundus images.
arXiv Detail & Related papers (2021-10-27T21:12:27Z) - Reachability Embeddings: Scalable Self-Supervised Representation
Learning from Markovian Trajectories for Geospatial Computer Vision [0.0]
We propose a self-supervised method for learning representations of geographic locations from unlabeled GPS trajectories.
A scalable and distributed algorithm is presented to compute image-like representations, called reachability summaries.
We show that reachability embeddings are semantically meaningful representations and result in 4-23% gain in performance.
arXiv Detail & Related papers (2021-10-24T20:10:22Z) - Segmentation of VHR EO Images using Unsupervised Learning [19.00071868539993]
We propose an unsupervised semantic segmentation method that can be trained using just a single unlabeled scene.
The proposed method exploits this property to sample smaller patches from the larger scene.
After unsupervised training on the target image/scene, the model automatically segregates the major classes present in the scene and produces the segmentation map.
arXiv Detail & Related papers (2021-07-09T11:42:48Z) - Geography-Aware Self-Supervised Learning [79.4009241781968]
We show that due to their different characteristics, a non-trivial gap persists between contrastive and supervised learning on standard benchmarks.
We propose novel training methods that exploit the spatially aligned structure of remote sensing data.
Our experiments show that our proposed method closes the gap between contrastive and supervised learning on image classification, object detection and semantic segmentation for remote sensing.
arXiv Detail & Related papers (2020-11-19T17:29:13Z) - Spatial Pyramid Based Graph Reasoning for Semantic Segmentation [67.47159595239798]
We apply graph convolution into the semantic segmentation task and propose an improved Laplacian.
The graph reasoning is directly performed in the original feature space organized as a spatial pyramid.
We achieve comparable performance with advantages in computational and memory overhead.
arXiv Detail & Related papers (2020-03-23T12:28:07Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.