Feasibility of machine learning-based rice yield prediction in India at
the district level using climate reanalysis data
- URL: http://arxiv.org/abs/2403.07967v1
- Date: Tue, 12 Mar 2024 13:31:13 GMT
- Title: Feasibility of machine learning-based rice yield prediction in India at
the district level using climate reanalysis data
- Authors: Djavan De Clercq, Adam Mahdi
- Abstract summary: This study aims to investigate whether machine learning-based yield prediction models can capably predict Kharif season rice yields at the district level in India.
The methodology involved training 19 machine learning models on 20 years of climate, satellite, and rice yield data across 247 of Indian rice-producing districts.
Results showed rice yields can be predicted with a reasonable degree of accuracy, with out-of-sample R2, MAE, and MAPE performance of up to 0.82, 0.29, and 0.16 respectively.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Yield forecasting, the science of predicting agricultural productivity before
the crop harvest occurs, helps a wide range of stakeholders make better
decisions around agricultural planning. This study aims to investigate whether
machine learning-based yield prediction models can capably predict Kharif
season rice yields at the district level in India several months before the
rice harvest takes place. The methodology involved training 19 machine learning
models such as CatBoost, LightGBM, Orthogonal Matching Pursuit, and Extremely
Randomized Trees on 20 years of climate, satellite, and rice yield data across
247 of Indian rice-producing districts. In addition to model-building, a
dynamic dashboard was built understand how the reliability of rice yield
predictions varies across districts. The results of the proof-of-concept
machine learning pipeline demonstrated that rice yields can be predicted with a
reasonable degree of accuracy, with out-of-sample R2, MAE, and MAPE performance
of up to 0.82, 0.29, and 0.16 respectively. These results outperformed test set
performance reported in related literature on rice yield modeling in other
contexts and countries. In addition, SHAP value analysis was conducted to infer
both the importance and directional impact of the climate and remote sensing
variables included in the model. Important features driving rice yields
included temperature, soil water volume, and leaf area index. In particular,
higher temperatures in August correlate with increased rice yields,
particularly when the leaf area index in August is also high. Building on the
results, a proof-of-concept dashboard was developed to allow users to easily
explore which districts may experience a rise or fall in yield relative to the
previous year.
Related papers
- Anticipatory Understanding of Resilient Agriculture to Climate [66.008020515555]
We present a framework to better identify food security hotspots using a combination of remote sensing, deep learning, crop yield modeling, and causal modeling of the food distribution system.
We focus our analysis on the wheat breadbasket of northern India, which supplies a large percentage of the world's population.
arXiv Detail & Related papers (2024-11-07T22:29:05Z) - Sparsity, Regularization and Causality in Agricultural Yield: The Case
of Paddy Rice in Peru [0.0]
This study integrates agricultural census data with remotely sensed time series to develop precise predictive models for paddy rice yield across various regions of Peru.
By utilizing sparse regression and Elastic-Net regularization techniques, the study identifies causal relationships between key remotely sensed variables.
arXiv Detail & Related papers (2024-09-25T19:16:54Z) - A Novel Fusion of Optical and Radar Satellite Data for Crop Phenology Estimation using Machine Learning and Cloud Computing [0.0]
In the era of big Earth observation data ubiquity, attempts have been made to accurately predict crop phenology based on Remote Sensing data.
Here, we estimate phenological developments for eight major crops and 13 phenological stages across Germany at 30m scale using a novel framework.
arXiv Detail & Related papers (2024-08-16T13:44:35Z) - Naïve Bayes and Random Forest for Crop Yield Prediction [0.0]
This study analyzes crop yield prediction in India from 1997 to 2020, focusing on various crops and key environmental factors.
It aims to predict agricultural yields by utilizing advanced machine learning techniques like Linear Regression, Decision Tree, KNN, Na"ive Bayes, K-Mean Clustering, and Random Forest.
arXiv Detail & Related papers (2024-04-23T16:55:45Z) - FengWu-GHR: Learning the Kilometer-scale Medium-range Global Weather
Forecasting [56.73502043159699]
This work presents FengWu-GHR, the first data-driven global weather forecasting model running at the 0.09$circ$ horizontal resolution.
It introduces a novel approach that opens the door for operating ML-based high-resolution forecasts by inheriting prior knowledge from a low-resolution model.
The hindcast of weather prediction in 2022 indicates that FengWu-GHR is superior to the IFS-HRES.
arXiv Detail & Related papers (2024-01-28T13:23:25Z) - Long-term drought prediction using deep neural networks based on geospatial weather data [75.38539438000072]
High-quality drought forecasting up to a year in advance is critical for agriculture planning and insurance.
We tackle drought data by introducing an end-to-end approach that adopts a systematic end-to-end approach.
Key findings are the exceptional performance of a Transformer model, EarthFormer, in making accurate short-term (up to six months) forecasts.
arXiv Detail & Related papers (2023-09-12T13:28:06Z) - HarvestNet: A Dataset for Detecting Smallholder Farming Activity Using
Harvest Piles and Remote Sensing [50.4506590177605]
HarvestNet is a dataset for mapping the presence of farms in the Ethiopian regions of Tigray and Amhara during 2020-2023.
We introduce a new approach based on the detection of harvest piles characteristic of many smallholder systems.
We conclude that remote sensing of harvest piles can contribute to more timely and accurate cropland assessments in food insecure regions.
arXiv Detail & Related papers (2023-08-23T11:03:28Z) - Extreme Gradient Boosting for Yield Estimation compared with Deep
Learning Approaches [0.0]
We propose a pipeline to process remote sensing images into feature-based representations that allow the employment of Extreme Gradient Boosting (XGBoost) for yield prediction.
A comparative evaluation of soybean yield prediction within the United States shows promising prediction accuracies compared to state-of-the-art yield prediction systems based on Deep Learning.
arXiv Detail & Related papers (2022-08-26T12:48:18Z) - Comparison of Machine Learning Methods for Predicting Winter Wheat Yield
in Germany [0.0]
This study analyzed the performance of different machine learning methods for winter wheat yield prediction.
To address the seasonality, weekly features were used that explicitly take soil moisture conditions and meteorological events into account.
arXiv Detail & Related papers (2021-05-04T04:40:53Z) - Estimating Crop Primary Productivity with Sentinel-2 and Landsat 8 using
Machine Learning Methods Trained with Radiative Transfer Simulations [58.17039841385472]
We take advantage of all parallel developments in mechanistic modeling and satellite data availability for advanced monitoring of crop productivity.
Our model successfully estimates gross primary productivity across a variety of C3 crop types and environmental conditions even though it does not use any local information from the corresponding sites.
This highlights its potential to map crop productivity from new satellite sensors at a global scale with the help of current Earth observation cloud computing platforms.
arXiv Detail & Related papers (2020-12-07T16:23:13Z) - Learning from Data to Optimize Control in Precision Farming [77.34726150561087]
Special issue presents the latest development in statistical inference, machine learning and optimum control for precision farming.
Satellite positioning and navigation followed by Internet-of-Things generate vast information that can be used to optimize farming processes in real-time.
arXiv Detail & Related papers (2020-07-07T12:44:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.