Knowledge-guided machine learning for county-level corn yield prediction under drought
- URL: http://arxiv.org/abs/2503.16328v2
- Date: Mon, 05 May 2025 21:01:27 GMT
- Title: Knowledge-guided machine learning for county-level corn yield prediction under drought
- Authors: Xiaoyu Wang, Yijia Xu, Jingyi Huang, Zhengwei Yang, Zhou Zhang,
- Abstract summary: Remote sensing (RS) technique, enabling the non-contact acquisition of extensive ground observations, is a valuable tool for crop yield predictions.<n>Traditional process-based models struggle to incorporate large volumes of RS data.<n>Machine learning (ML) models are often criticized as "black boxes" due to their limited interpretability.
- Score: 7.75600387348283
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Remote sensing (RS) technique, enabling the non-contact acquisition of extensive ground observations, is a valuable tool for crop yield predictions. Traditional process-based models struggle to incorporate large volumes of RS data, and most users lack understanding of crop growth mechanisms. In contrast, machine learning (ML) models are often criticized as "black boxes" due to their limited interpretability. To address these limitations, we utilized Knowledge-Guided Machine Learning (KGML), a framework that leverages the strengths of both process-based and ML models. Existing works have either overlooked the role of soil moisture in corn growth or did not embed this effect into their models. To bridge this gap, we developed the Knowledge-Guided Machine Learning with Soil Moisture (KGML-SM) framework, treating soil moisture as an intermediate variable in corn growth to emphasize its key role in plant development. Additionally, based on the prior knowledge that the model may overestimate under drought conditions, we designed a drought-aware loss function that penalized predicted yield in drought-affected areas. Our experiments showed that the KGML-SM model outperformed other traditional ML models. We explored the relationships between drought, soil moisture, and corn yield prediction by assessing the importance of different features within the model, and analyzing how soil moisture impacts predictions across different regions and time periods. Finally we provided interpretability for prediction errors to guide future model optimization.
Related papers
- Machine Learning Models for Soil Parameter Prediction Based on Satellite, Weather, Clay and Yield Data [1.546169961420396]
The AgroLens project endeavors to develop Machine Learning-based methodologies to predict soil nutrient levels without reliance on laboratory tests.
The approach begins with the development of a robust European model using the LUCAS Soil dataset and Sentinel-2 satellite imagery.
Advanced algorithms, including Random Forests, Extreme Gradient Boosting (XGBoost), and Fully Connected Neural Networks (FCNN), were implemented and finetuned for precise nutrient prediction.
arXiv Detail & Related papers (2025-03-28T09:44:32Z) - Loss Landscape Analysis for Reliable Quantized ML Models for Scientific Sensing [41.89148096989836]
We propose a method to perform empirical analysis of the loss landscape of machine learning (ML) models.<n>Our method allows assessing the robustness of ML models to such effects as a function of quantization precision and under different regularization techniques.
arXiv Detail & Related papers (2025-02-12T12:30:49Z) - Anticipatory Understanding of Resilient Agriculture to Climate [66.008020515555]
We present a framework to better identify food security hotspots using a combination of remote sensing, deep learning, crop yield modeling, and causal modeling of the food distribution system.
We focus our analysis on the wheat breadbasket of northern India, which supplies a large percentage of the world's population.
arXiv Detail & Related papers (2024-11-07T22:29:05Z) - MambaDS: Near-Surface Meteorological Field Downscaling with Topography Constrained Selective State Space Modeling [68.69647625472464]
Downscaling, a crucial task in meteorological forecasting, enables the reconstruction of high-resolution meteorological states for target regions.
Previous downscaling methods lacked tailored designs for meteorology and encountered structural limitations.
We propose a novel model called MambaDS, which enhances the utilization of multivariable correlations and topography information.
arXiv Detail & Related papers (2024-08-20T13:45:49Z) - MIS-ME: A Multi-modal Framework for Soil Moisture Estimation [0.5235143203977018]
We develop a dataset consisting of real-world images taken from ground stations and corresponding weather data.
We also propose MIS-ME - Meteorological & Image based Soil Moisture Estor.
Our analysis shows that MIS-ME achieves a MAPE of 10.14%, outperforming traditional unimodal approaches.
arXiv Detail & Related papers (2024-08-02T00:35:18Z) - Toward Adaptive Large Language Models Structured Pruning via Hybrid-grained Weight Importance Assessment [58.030196381554745]
We introduce the Hybrid-grained Weight Importance Assessment (HyWIA), a novel method that merges fine-grained and coarse-grained evaluations of weight importance for the pruning of large language models (LLMs)
Extensive experiments on LLaMA-V1/V2, Vicuna, Baichuan, and Bloom across various benchmarks demonstrate the effectiveness of HyWIA in pruning LLMs.
arXiv Detail & Related papers (2024-03-16T04:12:50Z) - Fairness Feedback Loops: Training on Synthetic Data Amplifies Bias [47.79659355705916]
Model-induced distribution shifts (MIDS) occur as previous model outputs pollute new model training sets over generations of models.
We introduce a framework that allows us to track multiple MIDS over many generations, finding that they can lead to loss in performance, fairness, and minoritized group representation.
Despite these negative consequences, we identify how models might be used for positive, intentional, interventions in their data ecosystems.
arXiv Detail & Related papers (2024-03-12T17:48:08Z) - Ensemble models outperform single model uncertainties and predictions
for operator-learning of hypersonic flows [43.148818844265236]
Training scientific machine learning (SciML) models on limited high-fidelity data offers one approach to rapidly predict behaviors for situations that have not been seen before.
High-fidelity data is itself in limited quantity to validate all outputs of the SciML model in unexplored input space.
We extend a DeepONet using three different uncertainty mechanisms: mean-variance estimation, evidential uncertainty, and ensembling.
arXiv Detail & Related papers (2023-10-31T18:07:29Z) - Long-term drought prediction using deep neural networks based on geospatial weather data [75.38539438000072]
High-quality drought forecasting up to a year in advance is critical for agriculture planning and insurance.
We tackle drought data by introducing an end-to-end approach that adopts a systematic end-to-end approach.
Key findings are the exceptional performance of a Transformer model, EarthFormer, in making accurate short-term (up to six months) forecasts.
arXiv Detail & Related papers (2023-09-12T13:28:06Z) - Winter Wheat Crop Yield Prediction on Multiple Heterogeneous Datasets
using Machine Learning [0.2580765958706853]
Winter wheat is one of the most important crops in the United Kingdom, and crop yield prediction is essential for the nation's food security.
Several studies have employed machine learning (ML) techniques to predict crop yield on a county or farm-based level.
The main objective of this study is to predict winter wheat crop yield using ML models on multiple heterogeneous datasets.
arXiv Detail & Related papers (2023-06-20T23:52:39Z) - Online Non-Destructive Moisture Content Estimation of Filter Media
During Drying Using Artificial Neural Networks [95.42181254494287]
Moisture content (MC) estimation is important in the manufacturing process of drying bulky filter media products.
An artificial neural network (ANN) based method is compared to state-of-the-art MC estimation methods reported in the literature.
Experimental results show that ANNs combined with oven settings data, drying time and product temperature can be used to reliably estimate the MC of bulky filter media products.
arXiv Detail & Related papers (2023-03-27T19:37:53Z) - Back2Future: Leveraging Backfill Dynamics for Improving Real-time
Predictions in Future [73.03458424369657]
In real-time forecasting in public health, data collection is a non-trivial and demanding task.
'Backfill' phenomenon and its effect on model performance has been barely studied in the prior literature.
We formulate a novel problem and neural framework Back2Future that aims to refine a given model's predictions in real-time.
arXiv Detail & Related papers (2021-06-08T14:48:20Z) - Comparison of Machine Learning Methods for Predicting Winter Wheat Yield
in Germany [0.0]
This study analyzed the performance of different machine learning methods for winter wheat yield prediction.
To address the seasonality, weekly features were used that explicitly take soil moisture conditions and meteorological events into account.
arXiv Detail & Related papers (2021-05-04T04:40:53Z) - Semi-supervised Soil Moisture Prediction through Graph Neural Networks [12.891517184512551]
We propose to convert the problem of soil moisture prediction as a semi-supervised learning on temporal graphs.
We propose a dynamic graph neural network which can use the dependency of related locations over a region to predict soil moisture.
Our algorithm, referred as DGLR, provides an end-to-end learning which can predict soil moisture over multiple locations in a region over time and also update the graph structure in between.
arXiv Detail & Related papers (2020-12-07T07:56:11Z) - Coupling Machine Learning and Crop Modeling Improves Crop Yield
Prediction in the US Corn Belt [2.580765958706854]
This study investigates whether coupling crop modeling and machine learning (ML) improves corn yield predictions in the US Corn Belt.
The main objectives are to explore whether a hybrid approach (crop modeling + ML) would result in better predictions, and determine the features from the crop modeling that are most effective to be integrated with ML for corn yield prediction.
arXiv Detail & Related papers (2020-07-28T16:22:44Z) - Sub-Seasonal Climate Forecasting via Machine Learning: Challenges,
Analysis, and Advances [44.28969320556008]
Sub-seasonal climate forecasting (SSF) focuses on predicting key climate variables such as temperature and precipitation in the 2-week to 2-month time scales.
In this paper, we study a variety of machine learning (ML) approaches for SSF over the US mainland.
arXiv Detail & Related papers (2020-06-14T18:39:27Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.