Comparing Machine Learning Techniques for Alfalfa Biomass Yield
Prediction
- URL: http://arxiv.org/abs/2210.11226v1
- Date: Thu, 20 Oct 2022 13:00:33 GMT
- Title: Comparing Machine Learning Techniques for Alfalfa Biomass Yield
Prediction
- Authors: Jonathan Vance, Khaled Rasheed, Ali Missaoui, Frederick Maier,
Christian Adkins, Chris Whitmire
- Abstract summary: alfalfa crop is globally important as livestock feed, so highly efficient planting and harvesting could benefit many industries.
Recent work using machine learning to predict yields for alfalfa and other crops has shown promise.
Previous efforts used remote sensing, weather, planting, and soil data to train machine learning models for yield prediction.
- Score: 0.8808021343665321
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The alfalfa crop is globally important as livestock feed, so highly efficient
planting and harvesting could benefit many industries, especially as the global
climate changes and traditional methods become less accurate. Recent work using
machine learning (ML) to predict yields for alfalfa and other crops has shown
promise. Previous efforts used remote sensing, weather, planting, and soil data
to train machine learning models for yield prediction. However, while remote
sensing works well, the models require large amounts of data and cannot make
predictions until the harvesting season begins. Using weather and planting data
from alfalfa variety trials in Kentucky and Georgia, our previous work compared
feature selection techniques to find the best technique and best feature set.
In this work, we trained a variety of machine learning models, using cross
validation for hyperparameter optimization, to predict biomass yields, and we
showed better accuracy than similar work that employed more complex techniques.
Our best individual model was a random forest with a mean absolute error of
0.081 tons/acre and R{$^2$} of 0.941. Next, we expanded this dataset to include
Wisconsin and Mississippi, and we repeated our experiments, obtaining a higher
best R{$^2$} of 0.982 with a regression tree. We then isolated our testing
datasets by state to explore this problem's eligibility for domain adaptation
(DA), as we trained on multiple source states and tested on one target state.
This Trivial DA (TDA) approach leaves plenty of room for improvement through
exploring more complex DA techniques in forthcoming work.
Related papers
- A Deep Learning Model for Heterogeneous Dataset Analysis -- Application
to Winter Wheat Crop Yield Prediction [0.6595290783361958]
Time-series deep learning models, such as Long Short Term Memory (LSTM), have already been explored and applied to yield prediction.
The existing LSTM cannot handle heterogeneous datasets.
We propose an efficient deep learning model that can deal with heterogeneous datasets.
arXiv Detail & Related papers (2023-06-20T23:39:06Z) - Exploring the Effectiveness of Dataset Synthesis: An application of
Apple Detection in Orchards [68.95806641664713]
We explore the usability of Stable Diffusion 2.1-base for generating synthetic datasets of apple trees for object detection.
We train a YOLOv5m object detection model to predict apples in a real-world apple detection dataset.
Results demonstrate that the model trained on generated data is slightly underperforming compared to a baseline model trained on real-world images.
arXiv Detail & Related papers (2023-06-20T09:46:01Z) - Distill Gold from Massive Ores: Bi-level Data Pruning towards Efficient Dataset Distillation [96.92250565207017]
We study the data efficiency and selection for the dataset distillation task.
By re-formulating the dynamics of distillation, we provide insight into the inherent redundancy in the real dataset.
We find the most contributing samples based on their causal effects on the distillation.
arXiv Detail & Related papers (2023-05-28T06:53:41Z) - Productive Crop Field Detection: A New Dataset and Deep Learning
Benchmark Results [1.2233362977312945]
In precision agriculture, detecting productive crop fields is an essential practice that allows the farmer to evaluate operating performance.
Previous studies explore different methods to detect crop fields using advanced machine learning algorithms.
We propose a high-quality dataset generated by machine operation combined with Sentinel-2 images.
arXiv Detail & Related papers (2023-05-19T20:30:59Z) - ASPEST: Bridging the Gap Between Active Learning and Selective
Prediction [56.001808843574395]
Selective prediction aims to learn a reliable model that abstains from making predictions when uncertain.
Active learning aims to lower the overall labeling effort, and hence human dependence, by querying the most informative examples.
In this work, we introduce a new learning paradigm, active selective prediction, which aims to query more informative samples from the shifted target domain.
arXiv Detail & Related papers (2023-04-07T23:51:07Z) - Corn Yield Prediction based on Remotely Sensed Variables Using
Variational Autoencoder and Multiple Instance Regression [1.2697842097171117]
In the U.S., corn is the most produced crop and has been an essential part of the American diet.
To meet the demand for supply chain management and regional food security, accurate and timely large-scale corn yield prediction is attracting more attention in precision agriculture.
Recently, remote sensing technology and machine learning methods have been widely explored for crop yield prediction.
arXiv Detail & Related papers (2022-11-23T20:18:26Z) - Generative models-based data labeling for deep networks regression:
application to seed maturity estimation from UAV multispectral images [3.6868861317674524]
Monitoring seed maturity is an increasing challenge in agriculture due to climate change and more restrictive practices.
Traditional methods are based on limited sampling in the field and analysis in laboratory.
We propose a method for estimating parsley seed maturity using multispectral UAV imagery, with a new approach for automatic data labeling.
arXiv Detail & Related papers (2022-08-09T09:06:51Z) - Leveraging Unlabeled Data to Predict Out-of-Distribution Performance [63.740181251997306]
Real-world machine learning deployments are characterized by mismatches between the source (training) and target (test) distributions.
In this work, we investigate methods for predicting the target domain accuracy using only labeled source data and unlabeled target data.
We propose Average Thresholded Confidence (ATC), a practical method that learns a threshold on the model's confidence, predicting accuracy as the fraction of unlabeled examples.
arXiv Detail & Related papers (2022-01-11T23:01:12Z) - Comparison of Machine Learning Methods for Predicting Winter Wheat Yield
in Germany [0.0]
This study analyzed the performance of different machine learning methods for winter wheat yield prediction.
To address the seasonality, weekly features were used that explicitly take soil moisture conditions and meteorological events into account.
arXiv Detail & Related papers (2021-05-04T04:40:53Z) - Injecting Knowledge in Data-driven Vehicle Trajectory Predictors [82.91398970736391]
Vehicle trajectory prediction tasks have been commonly tackled from two perspectives: knowledge-driven or data-driven.
In this paper, we propose to learn a "Realistic Residual Block" (RRB) which effectively connects these two perspectives.
Our proposed method outputs realistic predictions by confining the residual range and taking into account its uncertainty.
arXiv Detail & Related papers (2021-03-08T16:03:09Z) - A CNN Approach to Simultaneously Count Plants and Detect Plantation-Rows
from UAV Imagery [56.10033255997329]
We propose a novel deep learning method based on a Convolutional Neural Network (CNN)
It simultaneously detects and geolocates plantation-rows while counting its plants considering highly-dense plantation configurations.
The proposed method achieved state-of-the-art performance for counting and geolocating plants and plant-rows in UAV images from different types of crops.
arXiv Detail & Related papers (2020-12-31T18:51:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.