Semi-Supervised Semantic Segmentation in Earth Observation: The
MiniFrance Suite, Dataset Analysis and Multi-task Network Study
- URL: http://arxiv.org/abs/2010.07830v1
- Date: Thu, 15 Oct 2020 15:36:58 GMT
- Title: Semi-Supervised Semantic Segmentation in Earth Observation: The
MiniFrance Suite, Dataset Analysis and Multi-task Network Study
- Authors: Javiera Castillo-Navarro, Bertrand Le Saux, Alexandre Boulch, Nicolas
Audebert and S\'ebastien Lef\`evre
- Abstract summary: We introduce a novel large-scale dataset for semi-supervised semantic segmentation in Earth Observation, the MiniFrance suite.
MiniFrance has several unprecedented properties: it is large-scale, containing over 2000 very high resolution aerial images, accounting for more than 200 billions samples (pixels)
We present tools for data representativeness analysis in terms of appearance similarity and a thorough study of MiniFrance data, demonstrating that it is suitable for learning and generalizes well in a semi-supervised setting.
- Score: 82.02173199363571
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The development of semi-supervised learning techniques is essential to
enhance the generalization capacities of machine learning algorithms. Indeed,
raw image data are abundant while labels are scarce, therefore it is crucial to
leverage unlabeled inputs to build better models. The availability of large
databases have been key for the development of learning algorithms with high
level performance.
Despite the major role of machine learning in Earth Observation to derive
products such as land cover maps, datasets in the field are still limited,
either because of modest surface coverage, lack of variety of scenes or
restricted classes to identify. We introduce a novel large-scale dataset for
semi-supervised semantic segmentation in Earth Observation, the MiniFrance
suite. MiniFrance has several unprecedented properties: it is large-scale,
containing over 2000 very high resolution aerial images, accounting for more
than 200 billions samples (pixels); it is varied, covering 16 conurbations in
France, with various climates, different landscapes, and urban as well as
countryside scenes; and it is challenging, considering land use classes with
high-level semantics. Nevertheless, the most distinctive quality of MiniFrance
is being the only dataset in the field especially designed for semi-supervised
learning: it contains labeled and unlabeled images in its training partition,
which reproduces a life-like scenario. Along with this dataset, we present
tools for data representativeness analysis in terms of appearance similarity
and a thorough study of MiniFrance data, demonstrating that it is suitable for
learning and generalizes well in a semi-supervised setting. Finally, we present
semi-supervised deep architectures based on multi-task learning and the first
experiments on MiniFrance.
Related papers
- SatSynth: Augmenting Image-Mask Pairs through Diffusion Models for Aerial Semantic Segmentation [69.42764583465508]
We explore the potential of generative image diffusion to address the scarcity of annotated data in earth observation tasks.
To the best of our knowledge, we are the first to generate both images and corresponding masks for satellite segmentation.
arXiv Detail & Related papers (2024-03-25T10:30:22Z) - Biological Valuation Map of Flanders: A Sentinel-2 Imagery Analysis [12.025312586542318]
We present a densely labeled ground truth map of Flanders paired with Sentinel-2 satellite imagery.
Our methodology includes a formalized dataset division and sampling method, utilizing the topographic map layout 'Kaartbladversnijdingen,' and a detailed semantic segmentation model training pipeline.
arXiv Detail & Related papers (2024-01-26T22:21:39Z) - Toulouse Hyperspectral Data Set: a benchmark data set to assess semi-supervised spectral representation learning and pixel-wise classification techniques [0.6384650391969042]
We release in this paper the Toulouse Hyperspectral Data Set to meet key issues in spectral representation learning and classification over large-scale hyperspectral images with very few labeled pixels.
We discuss and experiment self-supervised techniques for spectral representation learning, including the Masked Autoencoder, and establish a baseline for pixel-wise classification achieving 85% overall accuracy and 77% F1 score.
arXiv Detail & Related papers (2023-11-15T10:49:15Z) - FLAIR: a Country-Scale Land Cover Semantic Segmentation Dataset From
Multi-Source Optical Imagery [4.9687851703152806]
We introduce the French Land cover from Aerospace ImageRy (FLAIR), an extensive dataset from the French National Institute of Geographical and Forest Information (IGN)
FLAIR contains high-resolution aerial imagery with a ground sample distance of 20 cm and over 20 billion individually labeled pixels for precise land-cover classification.
The dataset also integrates temporal and spectral data from optical satellite time series.
arXiv Detail & Related papers (2023-10-20T07:55:12Z) - Learning crop type mapping from regional label proportions in
large-scale SAR and optical imagery [9.303156731091532]
This study proposes an online deep clustering method using crop label proportions as priors to learn a sample-level classifier.
We evaluate the method using two large datasets from two different agricultural regions in Brazil.
arXiv Detail & Related papers (2022-08-24T15:23:26Z) - Streaming Self-Training via Domain-Agnostic Unlabeled Images [62.57647373581592]
We present streaming self-training (SST) that aims to democratize the process of learning visual recognition models.
Key to SST are two crucial observations: (1) domain-agnostic unlabeled images enable us to learn better models with a few labeled examples without any additional knowledge or supervision; and (2) learning is a continuous process and can be done by constructing a schedule of learning updates.
arXiv Detail & Related papers (2021-04-07T17:58:39Z) - Campus3D: A Photogrammetry Point Cloud Benchmark for Hierarchical
Understanding of Outdoor Scene [76.4183572058063]
We present a richly-annotated 3D point cloud dataset for multiple outdoor scene understanding tasks.
The dataset has been point-wisely annotated with both hierarchical and instance-based labels.
We formulate a hierarchical learning problem for 3D point cloud segmentation and propose a measurement evaluating consistency across various hierarchies.
arXiv Detail & Related papers (2020-08-11T19:10:32Z) - Unravelling Small Sample Size Problems in the Deep Learning World [69.82853912238173]
We first present a review of deep learning algorithms for small sample size problems in which the algorithms are segregated according to the space in which they operate.
Secondly, we present Dynamic Attention Pooling approach which focuses on extracting global information from the most discriminative sub-part of the feature map.
arXiv Detail & Related papers (2020-08-08T13:35:49Z) - Naive-Student: Leveraging Semi-Supervised Learning in Video Sequences
for Urban Scene Segmentation [57.68890534164427]
In this work, we ask if we may leverage semi-supervised learning in unlabeled video sequences and extra images to improve the performance on urban scene segmentation.
We simply predict pseudo-labels for the unlabeled data and train subsequent models with both human-annotated and pseudo-labeled data.
Our Naive-Student model, trained with such simple yet effective iterative semi-supervised learning, attains state-of-the-art results at all three Cityscapes benchmarks.
arXiv Detail & Related papers (2020-05-20T18:00:05Z) - Very High Resolution Land Cover Mapping of Urban Areas at Global Scale
with Convolutional Neural Networks [0.0]
This paper describes a methodology to produce a 7-classes land cover map of urban areas from very high resolution images and limited noisy labeled data.
We created a training dataset on a few areas of interest aggregating databases, semi-automatic classification, and manual annotation to get a complete ground truth in each class.
The final product is a highly valuable land cover map computed from model predictions stitched together, binarized, and refined before vectorization.
arXiv Detail & Related papers (2020-05-12T10:03:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.