Country-Scale Cropland Mapping in Data-Scarce Settings Using Deep
Learning: A Case Study of Nigeria
- URL: http://arxiv.org/abs/2312.10872v1
- Date: Mon, 18 Dec 2023 01:23:22 GMT
- Title: Country-Scale Cropland Mapping in Data-Scarce Settings Using Deep
Learning: A Case Study of Nigeria
- Authors: Joaquin Gajardo, Michele Volpi, Daniel Onwude and Thijs Defraeye
- Abstract summary: We combine a global cropland dataset and a hand-labeled dataset to train machine learning models for generating a new cropland map for Nigeria in 2020 at 10 m resolution.
We provide the models with pixel-wise time series input data from remote sensing sources such as Sentinel-1 and 2, ERA5 climate data, and DEM data, in addition to binary labels indicating cropland presence.
We find that the existing WorldCover map performs the best with an F1-score of 0.825 and accuracy of 0.870 on the test set, followed by a single-headed LSTM model trained with our hand-labeled training
- Score: 0.6827423171182154
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Cropland maps are a core and critical component of remote-sensing-based
agricultural monitoring, providing dense and up-to-date information about
agricultural development. Machine learning is an effective tool for large-scale
agricultural mapping, but relies on geo-referenced ground-truth data for model
training and testing, which can be scarce or time-consuming to obtain. In this
study, we explore the usefulness of combining a global cropland dataset and a
hand-labeled dataset to train machine learning models for generating a new
cropland map for Nigeria in 2020 at 10 m resolution. We provide the models with
pixel-wise time series input data from remote sensing sources such as
Sentinel-1 and 2, ERA5 climate data, and DEM data, in addition to binary labels
indicating cropland presence. We manually labeled 1827 evenly distributed
pixels across Nigeria, splitting them into 50\% training, 25\% validation, and
25\% test sets used to fit the models and test our output map. We evaluate and
compare the performance of single- and multi-headed Long Short-Term Memory
(LSTM) neural network classifiers, a Random Forest classifier, and three
existing 10 m resolution global land cover maps (Google's Dynamic World, ESRI's
Land Cover, and ESA's WorldCover) on our proposed test set. Given the regional
variations in cropland appearance, we additionally experimented with excluding
or sub-setting the global crowd-sourced Geowiki cropland dataset, to
empirically assess the trade-off between data quantity and data quality in
terms of the similarity to the target data distribution of Nigeria. We find
that the existing WorldCover map performs the best with an F1-score of 0.825
and accuracy of 0.870 on the test set, followed by a single-headed LSTM model
trained with our hand-labeled training samples and the Geowiki data points in
Nigeria, with a F1-score of 0.814 and accuracy of 0.842.
Related papers
- Fields of The World: A Machine Learning Benchmark Dataset For Global Agricultural Field Boundary Segmentation [12.039406240082515]
Fields of The World (FTW) is a novel benchmark dataset for agricultural field instance segmentation.
FTW is an order of magnitude larger than previous datasets with 70,462 samples.
We show that models trained on FTW have better zero-shot and fine-tuning performance in held-out countries.
arXiv Detail & Related papers (2024-09-24T17:20:58Z) - HarvestNet: A Dataset for Detecting Smallholder Farming Activity Using
Harvest Piles and Remote Sensing [50.4506590177605]
HarvestNet is a dataset for mapping the presence of farms in the Ethiopian regions of Tigray and Amhara during 2020-2023.
We introduce a new approach based on the detection of harvest piles characteristic of many smallholder systems.
We conclude that remote sensing of harvest piles can contribute to more timely and accurate cropland assessments in food insecure regions.
arXiv Detail & Related papers (2023-08-23T11:03:28Z) - Exploring the Effectiveness of Dataset Synthesis: An application of
Apple Detection in Orchards [68.95806641664713]
We explore the usability of Stable Diffusion 2.1-base for generating synthetic datasets of apple trees for object detection.
We train a YOLOv5m object detection model to predict apples in a real-world apple detection dataset.
Results demonstrate that the model trained on generated data is slightly underperforming compared to a baseline model trained on real-world images.
arXiv Detail & Related papers (2023-06-20T09:46:01Z) - The Canadian Cropland Dataset: A New Land Cover Dataset for
Multitemporal Deep Learning Classification in Agriculture [0.8602553195689513]
temporal patch-based dataset of Canadian croplands enriched with labels retrieved from the Canadian Annual Crop Inventory.
The dataset contains 78,536 manually verified high-resolution spatial images from 10 crop classes collected over four crop production years.
As a benchmark, we provide models and source code that allow a user to predict the crop class using a single image (ResNet, DenseNet, EfficientNet) or a sequence of images (LRCN, 3D-CNN) from the same location.
arXiv Detail & Related papers (2023-05-31T18:40:15Z) - Navya3DSeg -- Navya 3D Semantic Segmentation Dataset & split generation
for autonomous vehicles [63.20765930558542]
3D semantic data are useful for core perception tasks such as obstacle detection and ego-vehicle localization.
We propose a new dataset, Navya 3D (Navya3DSeg), with a diverse label space corresponding to a large scale production grade operational domain.
It contains 23 labeled sequences and 25 supplementary sequences without labels, designed to explore self-supervised and semi-supervised semantic segmentation benchmarks on point clouds.
arXiv Detail & Related papers (2023-02-16T13:41:19Z) - End-to-end deep learning for directly estimating grape yield from
ground-based imagery [53.086864957064876]
This study demonstrates the application of proximal imaging combined with deep learning for yield estimation in vineyards.
Three model architectures were tested: object detection, CNN regression, and transformer models.
The study showed the applicability of proximal imaging and deep learning for prediction of grapevine yield on a large scale.
arXiv Detail & Related papers (2022-08-04T01:34:46Z) - Strict baselines for Covid-19 forecasting and ML perspective for USA and
Russia [105.54048699217668]
Covid-19 allows researchers to gather datasets accumulated over 2 years and to use them in predictive analysis.
We present the results of a consistent comparative study of different types of methods for predicting the dynamics of the spread of Covid-19 based on regional data for two countries: the United States and Russia.
arXiv Detail & Related papers (2022-07-15T18:21:36Z) - Using Machine Learning to generate an open-access cropland map from
satellite images time series in the Indian Himalayan Region [0.28675177318965034]
We develop an ML pipeline that relies on Sentinel-2 satellite images time series.
We generate a cropland map for three districts of Himachal Pradesh, spanning 14,600 km2, which improves the resolution and quality of existing public maps.
arXiv Detail & Related papers (2022-03-28T12:08:06Z) - Continental-Scale Building Detection from High Resolution Satellite
Imagery [5.56205296867374]
We study variations in architecture, loss functions, regularization, pre-training, self-training and post-processing that increase instance segmentation performance.
Experiments were carried out using a dataset of 100k satellite images across Africa containing 1.75M manually labelled building instances.
We report novel methods for improving performance of building detection with this type of model, including the use of mixup.
arXiv Detail & Related papers (2021-07-26T15:48:14Z) - Dataset Cartography: Mapping and Diagnosing Datasets with Training
Dynamics [118.75207687144817]
We introduce Data Maps, a model-based tool to characterize and diagnose datasets.
We leverage a largely ignored source of information: the behavior of the model on individual instances during training.
Our results indicate that a shift in focus from quantity to quality of data could lead to robust models and improved out-of-distribution generalization.
arXiv Detail & Related papers (2020-09-22T20:19:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.