A region-wide, multi-year set of crop field boundary labels for Africa
- URL: http://arxiv.org/abs/2412.18483v1
- Date: Tue, 24 Dec 2024 15:14:58 GMT
- Title: A region-wide, multi-year set of crop field boundary labels for Africa
- Authors: L. D. Estes, A. Wussah, M. Asipunu, M. Gathigi, P. Kovačič, J. Muhando, B. V. Yeboah, F. K. Addai, E. S. Akakpo, M. K. Allotey, P. Amkoya, E. Amponsem, K. D. Donkoh, N. Ha, E. Heltzel, C. Juma, R. Mdawida, A. Miroyo, J. Mucha, J. Mugami, F. Mwawaza, D. A. Nyarko, P. Oduor, K. N. Ohemeng, S. I. D. Segbefia, T. Tumbula, F. Wambua, G. H. Xeflide, S. Ye, F. Yeboah,
- Abstract summary: We delineated field boundaries in 33,746 Planet images captured between 2017 and 2023 across the continent.
Quality metrics showed that label quality was moderately high (0.75) for measures of total field extent, but low regarding the number of individual fields delineated.
This sample provides valuable insight into regional agricultural characteristics, highlighting variations in the median field size and density.
- Score: 0.0
- License:
- Abstract: African agriculture is undergoing rapid transformation. Annual maps of crop fields are key to understanding the nature of this transformation, but such maps are currently lacking and must be developed using advanced machine learning models trained on high resolution remote sensing imagery. To enable the development of such models, we delineated field boundaries in 33,746 Planet images captured between 2017 and 2023 across the continent using a custom labeling platform with built-in procedures for assessing and mitigating label error. We collected 42,403 labels, including 7,204 labels arising from tasks dedicated to assessing label quality (Class 1 labels), 32,167 from sites mapped once by a single labeller (Class 2) and 3,032 labels from sites where 3 or more labellers were tasked to map the same location (Class 4). Class 1 labels were used to calculate labeller-specific quality scores, while Class 1 and 4 sites mapped by at least 3 labellers were used to further evaluate label uncertainty using a Bayesian risk metric. Quality metrics showed that label quality was moderately high (0.75) for measures of total field extent, but low regarding the number of individual fields delineated (0.33), and the position of field edges (0.05). These values are expected when delineating small-scale fields in 3-5 m resolution imagery, which can be too coarse to reliably distinguish smaller fields, particularly in dense croplands, and therefore requires substantial labeller judgement. Nevertheless, previous work shows that such labels can train effective field mapping models. Furthermore, this large, probabilistic sample on its own provides valuable insight into regional agricultural characteristics, highlighting variations in the median field size and density. The imagery and vectorized labels along with quality information is available for download from two public repositories.
Related papers
- SAModified: A Foundation Model-Based Zero-Shot Approach for Refining Noisy Land-Use Land-Cover Maps [2.374912052693646]
Land-use and land cover (LULC) analysis is critical in remote sensing.
automating LULC map generation using machine learning is rendered challenging due to noisy labels.
We propose a zero-shot approach using the foundation model, Segment Anything Model (SAM)
We achieve a significant reduction in label noise and an improvement in the performance of the downstream segmentation model by $approx 5%$ when trained with denoised labels.
arXiv Detail & Related papers (2024-12-17T05:23:00Z) - Weak Labeling for Cropland Mapping in Africa [3.5759681393339697]
Cropland mapping can play a vital role in addressing environmental, agricultural, and food security challenges.
In Africa, practical applications are often hindered by the limited availability of high-resolution cropland maps.
We propose an approach that utilizes unsupervised object clustering to refine existing weak labels.
arXiv Detail & Related papers (2024-01-13T08:45:41Z) - Taking it further: leveraging pseudo labels for field delineation across
label-scarce smallholder regions [2.8120847363433654]
This study explores opportunities of using sparse field delineation pseudo labels for fine-tuning models across geographies and sensor characteristics.
We build on a FracTAL ResUNet trained for crop field delineation in India (median field size of 0.24 ha) and use this pre-trained model to generate pseudo labels in Mozambique.
We then used the human-annotated labels and the pseudo labels for model fine-tuning and compared predictions against human field annotations.
arXiv Detail & Related papers (2023-12-12T08:39:07Z) - HarvestNet: A Dataset for Detecting Smallholder Farming Activity Using
Harvest Piles and Remote Sensing [50.4506590177605]
HarvestNet is a dataset for mapping the presence of farms in the Ethiopian regions of Tigray and Amhara during 2020-2023.
We introduce a new approach based on the detection of harvest piles characteristic of many smallholder systems.
We conclude that remote sensing of harvest piles can contribute to more timely and accurate cropland assessments in food insecure regions.
arXiv Detail & Related papers (2023-08-23T11:03:28Z) - Imprecise Label Learning: A Unified Framework for Learning with Various Imprecise Label Configurations [91.67511167969934]
imprecise label learning (ILL) is a framework for the unification of learning with various imprecise label configurations.
We demonstrate that ILL can seamlessly adapt to partial label learning, semi-supervised learning, noisy label learning, and, more importantly, a mixture of these settings.
arXiv Detail & Related papers (2023-05-22T04:50:28Z) - Bridging the Gap between Model Explanations in Partially Annotated
Multi-label Classification [85.76130799062379]
We study how false negative labels affect the model's explanation.
We propose to boost the attribution scores of the model trained with partial labels to make its explanation resemble that of the model trained with full labels.
arXiv Detail & Related papers (2023-04-04T14:00:59Z) - Heterogeneous Semantic Transfer for Multi-label Recognition with Partial Labels [70.45813147115126]
Multi-label image recognition with partial labels (MLR-PL) may greatly reduce the cost of annotation and thus facilitate large-scale MLR.
We find that strong semantic correlations exist within each image and across different images.
These correlations can help transfer the knowledge possessed by the known labels to retrieve the unknown labels.
arXiv Detail & Related papers (2022-05-23T08:37:38Z) - Semantic-Aware Representation Blending for Multi-Label Image Recognition
with Partial Labels [86.17081952197788]
We propose to blend category-specific representation across different images to transfer information of known labels to complement unknown labels.
Experiments on the MS-COCO, Visual Genome, Pascal VOC 2007 datasets show that the proposed SARB framework obtains superior performance over current leading competitors.
arXiv Detail & Related papers (2022-03-04T07:56:16Z) - Semi-Supervised Learning with Taxonomic Labels [42.02670649470055]
We propose techniques to incorporate coarse taxonomic labels to train image classifiers in fine-grained domains.
On the Semi-iNat dataset consisting of 810 species across three Kingdoms, incorporating Phylum labels improves the Species level classification accuracy by 6%.
We propose a technique to select relevant data from a large collection of unlabeled images guided by the hierarchy which improves the robustness.
arXiv Detail & Related papers (2021-11-23T00:50:25Z) - Crop mapping from image time series: deep learning with multi-scale
label hierarchies [22.58506027920305]
We develop a crop classification method that exploits expert knowledge and significantly improves the mapping of rare crop types.
The three-level label hierarchy is encoded in a convolutional, recurrent neural network (convRNN)
We validate the proposed method on a new, large dataset that we make public.
arXiv Detail & Related papers (2021-02-17T15:27:49Z) - A Study on the Autoregressive and non-Autoregressive Multi-label
Learning [77.11075863067131]
We propose a self-attention based variational encoder-model to extract the label-label and label-feature dependencies jointly.
Our model can therefore be used to predict all labels in parallel while still including both label-label and label-feature dependencies.
arXiv Detail & Related papers (2020-12-03T05:41:44Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.