EuroCropsML: A Time Series Benchmark Dataset For Few-Shot Crop Type Classification
- URL: http://arxiv.org/abs/2407.17458v1
- Date: Wed, 24 Jul 2024 17:50:54 GMT
- Title: EuroCropsML: A Time Series Benchmark Dataset For Few-Shot Crop Type Classification
- Authors: Joana Reuss, Jan Macdonald, Simon Becker, Lorenz Richter, Marco Körner,
- Abstract summary: EuroCropsML is an analysis-ready remote sensing machine learning dataset for time series crop type classification of agricultural parcels in Europe.
Based on the open-source EuroCrops collection, EuroCropsML is publicly available on Zenodo.
- Score: 9.670182163018804
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We introduce EuroCropsML, an analysis-ready remote sensing machine learning dataset for time series crop type classification of agricultural parcels in Europe. It is the first dataset designed to benchmark transnational few-shot crop type classification algorithms that supports advancements in algorithmic development and research comparability. It comprises 706 683 multi-class labeled data points across 176 classes, featuring annual time series of per-parcel median pixel values from Sentinel-2 L1C data for 2021, along with crop type labels and spatial coordinates. Based on the open-source EuroCrops collection, EuroCropsML is publicly available on Zenodo.
Related papers
- Fields of The World: A Machine Learning Benchmark Dataset For Global Agricultural Field Boundary Segmentation [12.039406240082515]
Fields of The World (FTW) is a novel benchmark dataset for agricultural field instance segmentation.
FTW is an order of magnitude larger than previous datasets with 70,462 samples.
We show that models trained on FTW have better zero-shot and fine-tuning performance in held-out countries.
arXiv Detail & Related papers (2024-09-24T17:20:58Z) - Boosting Crop Classification by Hierarchically Fusing Satellite,
Rotational, and Contextual Data [0.0]
We propose a novel approach to fuse multimodal information into a model for improved accuracy and robustness across multiple years and countries.
To evaluate our approach, we release a new annotated dataset of 7.4 million agricultural parcels in France and Netherlands.
arXiv Detail & Related papers (2023-05-19T21:42:53Z) - Alibaba-Translate China's Submission for WMT 2022 Quality Estimation
Shared Task [80.22825549235556]
We present our submission to the sentence-level MQM benchmark at Quality Estimation Shared Task, named UniTE.
Specifically, our systems employ the framework of UniTE, which combined three types of input formats during training with a pre-trained language model.
Results show that our models reach 1st overall ranking in the Multilingual and English-Russian settings, and 2nd overall ranking in English-German and Chinese-English settings.
arXiv Detail & Related papers (2022-10-18T08:55:27Z) - FRMT: A Benchmark for Few-Shot Region-Aware Machine Translation [64.9546787488337]
We present FRMT, a new dataset and evaluation benchmark for Few-shot Region-aware Machine Translation.
The dataset consists of professional translations from English into two regional variants each of Portuguese and Mandarin Chinese.
arXiv Detail & Related papers (2022-10-01T05:02:04Z) - Multimodal Crop Type Classification Fusing Multi-Spectral Satellite Time
Series with Farmers Crop Rotations and Local Crop Distribution [0.0]
We propose to tackle a land use and crop type classification task using three data types.
The Accuracy by 5.1 points in a 28-class setting (.948), and the micro-F1 by 9.6 points in a 10-class setting (.887) using only a set of crop of interests selected by an expert.
arXiv Detail & Related papers (2022-08-23T09:41:09Z) - A Sentinel-2 multi-year, multi-country benchmark dataset for crop
classification and segmentation with deep learning [0.716879432974126]
Sen4AgriNet is a Sentinel-2 based time series multi country benchmark dataset for agricultural monitoring applications.
It is constructed to cover the period 2016-2020 for Catalonia and France, while it can be extended to include additional countries.
It contains 42.5 million parcels, which makes it significantly larger than other available archives.
arXiv Detail & Related papers (2022-04-02T23:14:46Z) - Team Cogitat at NeurIPS 2021: Benchmarks for EEG Transfer Learning
Competition [55.34407717373643]
Building subject-independent deep learning models for EEG decoding faces the challenge of strong co-shift.
Our approach is to explicitly align feature distributions at various layers of the deep learning model.
The methodology won first place in the 2021 Benchmarks in EEG Transfer Learning competition, hosted at the NeurIPS conference.
arXiv Detail & Related papers (2022-02-01T11:11:08Z) - AutoGeoLabel: Automated Label Generation for Geospatial Machine Learning [69.47585818994959]
We evaluate a big data processing pipeline to auto-generate labels for remote sensing data.
We utilize the big geo-data platform IBM PAIRS to dynamically generate such labels in dense urban areas.
arXiv Detail & Related papers (2022-01-31T20:02:22Z) - EuroCrops: A Pan-European Dataset for Time Series Crop Type
Classification [0.0]
EuroCrops is a dataset based on self-declared field annotations for training and evaluating methods for crop type classification and mapping.
By this, we aim to enrich the research efforts and discussion for data-driven land cover classification via Earth observation and remote sensing.
arXiv Detail & Related papers (2021-06-14T15:21:50Z) - Explicit Alignment Objectives for Multilingual Bidirectional Encoders [111.65322283420805]
We present a new method for learning multilingual encoders, AMBER (Aligned Multilingual Bi-directional EncodeR)
AMBER is trained on additional parallel data using two explicit alignment objectives that align the multilingual representations at different granularities.
Experimental results show that AMBER obtains gains of up to 1.1 average F1 score on sequence tagging and up to 27.3 average accuracy on retrieval over the XLMR-large model.
arXiv Detail & Related papers (2020-10-15T18:34:13Z) - Semi-Supervised Semantic Segmentation in Earth Observation: The
MiniFrance Suite, Dataset Analysis and Multi-task Network Study [82.02173199363571]
We introduce a novel large-scale dataset for semi-supervised semantic segmentation in Earth Observation, the MiniFrance suite.
MiniFrance has several unprecedented properties: it is large-scale, containing over 2000 very high resolution aerial images, accounting for more than 200 billions samples (pixels)
We present tools for data representativeness analysis in terms of appearance similarity and a thorough study of MiniFrance data, demonstrating that it is suitable for learning and generalizes well in a semi-supervised setting.
arXiv Detail & Related papers (2020-10-15T15:36:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.