A Sentinel-2 multi-year, multi-country benchmark dataset for crop
classification and segmentation with deep learning
- URL: http://arxiv.org/abs/2204.00951v1
- Date: Sat, 2 Apr 2022 23:14:46 GMT
- Title: A Sentinel-2 multi-year, multi-country benchmark dataset for crop
classification and segmentation with deep learning
- Authors: Dimitrios Sykas, Maria Sdraka, Dimitrios Zografakis, Ioannis Papoutsis
- Abstract summary: Sen4AgriNet is a Sentinel-2 based time series multi country benchmark dataset for agricultural monitoring applications.
It is constructed to cover the period 2016-2020 for Catalonia and France, while it can be extended to include additional countries.
It contains 42.5 million parcels, which makes it significantly larger than other available archives.
- Score: 0.716879432974126
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: In this work we introduce Sen4AgriNet, a Sentinel-2 based time series multi
country benchmark dataset, tailored for agricultural monitoring applications
with Machine and Deep Learning. Sen4AgriNet dataset is annotated from farmer
declarations collected via the Land Parcel Identification System (LPIS) for
harmonizing country wide labels. These declarations have only recently been
made available as open data, allowing for the first time the labeling of
satellite imagery from ground truth data. We proceed to propose and standardise
a new crop type taxonomy across Europe that address Common Agriculture Policy
(CAP) needs, based on the Food and Agriculture Organization (FAO) Indicative
Crop Classification scheme. Sen4AgriNet is the only multi-country, multi-year
dataset that includes all spectral information. It is constructed to cover the
period 2016-2020 for Catalonia and France, while it can be extended to include
additional countries. Currently, it contains 42.5 million parcels, which makes
it significantly larger than other available archives. We extract two
sub-datasets to highlight its value for diverse Deep Learning applications; the
Object Aggregated Dataset (OAD) and the Patches Assembled Dataset (PAD). OAD
capitalizes zonal statistics of each parcel, thus creating a powerful
label-to-features instance for classification algorithms. On the other hand,
PAD structure generalizes the classification problem to parcel extraction and
semantic segmentation and labeling. The PAD and OAD are examined under three
different scenarios to showcase and model the effects of spatial and temporal
variability across different years and different countries.
Related papers
- Fields of The World: A Machine Learning Benchmark Dataset For Global Agricultural Field Boundary Segmentation [12.039406240082515]
Fields of The World (FTW) is a novel benchmark dataset for agricultural field instance segmentation.
FTW is an order of magnitude larger than previous datasets with 70,462 samples.
We show that models trained on FTW have better zero-shot and fine-tuning performance in held-out countries.
arXiv Detail & Related papers (2024-09-24T17:20:58Z) - EuroCropsML: A Time Series Benchmark Dataset For Few-Shot Crop Type Classification [9.670182163018804]
EuroCropsML is an analysis-ready remote sensing machine learning dataset for time series crop type classification of agricultural parcels in Europe.
Based on the open-source EuroCrops collection, EuroCropsML is publicly available on Zenodo.
arXiv Detail & Related papers (2024-07-24T17:50:54Z) - SRFUND: A Multi-Granularity Hierarchical Structure Reconstruction Benchmark in Form Understanding [55.48936731641802]
We present the SRFUND, a hierarchically structured multi-task form understanding benchmark.
SRFUND provides refined annotations on top of the original FUNSD and XFUND datasets.
The dataset includes eight languages including English, Chinese, Japanese, German, French, Spanish, Italian, and Portuguese.
arXiv Detail & Related papers (2024-06-13T02:35:55Z) - Boosting Crop Classification by Hierarchically Fusing Satellite,
Rotational, and Contextual Data [0.0]
We propose a novel approach to fuse multimodal information into a model for improved accuracy and robustness across multiple years and countries.
To evaluate our approach, we release a new annotated dataset of 7.4 million agricultural parcels in France and Netherlands.
arXiv Detail & Related papers (2023-05-19T21:42:53Z) - Navya3DSeg -- Navya 3D Semantic Segmentation Dataset & split generation
for autonomous vehicles [63.20765930558542]
3D semantic data are useful for core perception tasks such as obstacle detection and ego-vehicle localization.
We propose a new dataset, Navya 3D (Navya3DSeg), with a diverse label space corresponding to a large scale production grade operational domain.
It contains 23 labeled sequences and 25 supplementary sequences without labels, designed to explore self-supervised and semi-supervised semantic segmentation benchmarks on point clouds.
arXiv Detail & Related papers (2023-02-16T13:41:19Z) - High-Resolution Satellite Imagery for Modeling the Impact of
Aridification on Crop Production [2.5402662954395097]
We introduce a first-of-its-kind dataset, SICKLE, having time-series images at different spatial resolutions from 3 different satellites.
The dataset comprises of 2,398 season-wise samples from 388 unique plots distributed across 4 districts of the Delta.
We benchmark the dataset on 3 separate tasks, namely crop type, phenology date (sowing, transplanting, harvesting) and yield prediction, and develop an end-to-end framework for predicting key crop parameters in a real-world setting.
arXiv Detail & Related papers (2022-09-25T14:54:50Z) - Tackling Long-Tailed Category Distribution Under Domain Shifts [50.21255304847395]
Existing approaches cannot handle the scenario where both issues exist.
We designed three novel core functional blocks including Distribution Calibrated Classification Loss, Visual-Semantic Mapping and Semantic-Similarity Guided Augmentation.
Two new datasets were proposed for this problem, named AWA2-LTS and ImageNet-LTS.
arXiv Detail & Related papers (2022-07-20T19:07:46Z) - AutoGeoLabel: Automated Label Generation for Geospatial Machine Learning [69.47585818994959]
We evaluate a big data processing pipeline to auto-generate labels for remote sensing data.
We utilize the big geo-data platform IBM PAIRS to dynamically generate such labels in dense urban areas.
arXiv Detail & Related papers (2022-01-31T20:02:22Z) - A hierarchical deep learning framework for the consistent classification
of land use objects in geospatial databases [8.703408520845645]
In this paper, a hierarchical deep learning framework is proposed to verify the land use information.
A new CNN-based method is proposed aiming to predict land use in multiple levels hierarchically and simultaneously.
Experiments show that the CNN relying on JO outperforms previous results, achieving an overall accuracy up to 92.5%.
arXiv Detail & Related papers (2021-04-14T17:16:35Z) - Simple multi-dataset detection [83.9604523643406]
We present a simple method for training a unified detector on multiple large-scale datasets.
We show how to automatically integrate dataset-specific outputs into a common semantic taxonomy.
Our approach does not require manual taxonomy reconciliation.
arXiv Detail & Related papers (2021-02-25T18:55:58Z) - Campus3D: A Photogrammetry Point Cloud Benchmark for Hierarchical
Understanding of Outdoor Scene [76.4183572058063]
We present a richly-annotated 3D point cloud dataset for multiple outdoor scene understanding tasks.
The dataset has been point-wisely annotated with both hierarchical and instance-based labels.
We formulate a hierarchical learning problem for 3D point cloud segmentation and propose a measurement evaluating consistency across various hierarchies.
arXiv Detail & Related papers (2020-08-11T19:10:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.