End-to-end deep learning for directly estimating grape yield from
ground-based imagery
- URL: http://arxiv.org/abs/2208.02394v1
- Date: Thu, 4 Aug 2022 01:34:46 GMT
- Title: End-to-end deep learning for directly estimating grape yield from
ground-based imagery
- Authors: Alexander G. Olenskyj, Brent S. Sams, Zhenghao Fei, Vishal Singh,
Pranav V. Raja, Gail M. Bornhorst, J. Mason Earles
- Abstract summary: This study demonstrates the application of proximal imaging combined with deep learning for yield estimation in vineyards.
Three model architectures were tested: object detection, CNN regression, and transformer models.
The study showed the applicability of proximal imaging and deep learning for prediction of grapevine yield on a large scale.
- Score: 53.086864957064876
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Yield estimation is a powerful tool in vineyard management, as it allows
growers to fine-tune practices to optimize yield and quality. However, yield
estimation is currently performed using manual sampling, which is
time-consuming and imprecise. This study demonstrates the application of
proximal imaging combined with deep learning for yield estimation in vineyards.
Continuous data collection using a vehicle-mounted sensing kit combined with
collection of ground truth yield data at harvest using a commercial yield
monitor allowed for the generation of a large dataset of 23,581 yield points
and 107,933 images. Moreover, this study was conducted in a mechanically
managed commercial vineyard, representing a challenging environment for image
analysis but a common set of conditions in the California Central Valley. Three
model architectures were tested: object detection, CNN regression, and
transformer models. The object detection model was trained on hand-labeled
images to localize grape bunches, and either bunch count or pixel area was
summed to correlate with grape yield. Conversely, regression models were
trained end-to-end to predict grape yield from image data without the need for
hand labeling. Results demonstrated that both a transformer as well as the
object detection model with pixel area processing performed comparably, with a
mean absolute percent error of 18% and 18.5%, respectively on a representative
holdout dataset. Saliency mapping was used to demonstrate the attention of the
CNN model was localized near the predicted location of grape bunches, as well
as on the top of the grapevine canopy. Overall, the study showed the
applicability of proximal imaging and deep learning for prediction of grapevine
yield on a large scale. Additionally, the end-to-end modeling approach was able
to perform comparably to the object detection approach while eliminating the
need for hand-labeling.
Related papers
- Deep Domain Adaptation: A Sim2Real Neural Approach for Improving Eye-Tracking Systems [80.62854148838359]
Eye image segmentation is a critical step in eye tracking that has great influence over the final gaze estimate.
We use dimensionality-reduction techniques to measure the overlap between the target eye images and synthetic training data.
Our methods result in robust, improved performance when tackling the discrepancy between simulation and real-world data samples.
arXiv Detail & Related papers (2024-03-23T22:32:06Z) - Integrating processed-based models and machine learning for crop yield
prediction [1.3107669223114085]
In this work we investigate potato yield prediction using a hybrid meta-modeling approach.
A crop growth model is employed to generate synthetic data for (pre)training a convolutional neural net.
When applied in silico, our meta-modeling approach yields better predictions than a baseline comprising a purely data-driven approach.
arXiv Detail & Related papers (2023-07-25T12:51:25Z) - Exploring the Effectiveness of Dataset Synthesis: An application of
Apple Detection in Orchards [68.95806641664713]
We explore the usability of Stable Diffusion 2.1-base for generating synthetic datasets of apple trees for object detection.
We train a YOLOv5m object detection model to predict apples in a real-world apple detection dataset.
Results demonstrate that the model trained on generated data is slightly underperforming compared to a baseline model trained on real-world images.
arXiv Detail & Related papers (2023-06-20T09:46:01Z) - Generative models-based data labeling for deep networks regression:
application to seed maturity estimation from UAV multispectral images [3.6868861317674524]
Monitoring seed maturity is an increasing challenge in agriculture due to climate change and more restrictive practices.
Traditional methods are based on limited sampling in the field and analysis in laboratory.
We propose a method for estimating parsley seed maturity using multispectral UAV imagery, with a new approach for automatic data labeling.
arXiv Detail & Related papers (2022-08-09T09:06:51Z) - Facilitated machine learning for image-based fruit quality assessment in
developing countries [68.8204255655161]
Automated image classification is a common task for supervised machine learning in food science.
We propose an alternative method based on pre-trained vision transformers (ViTs)
It can be easily implemented with limited resources on a standard device.
arXiv Detail & Related papers (2022-07-10T19:52:20Z) - Monitoring crop phenology with street-level imagery using computer
vision [0.0]
We present a framework to collect and extract crop type and phenological information from street level imagery using computer vision.
During the 2018 growing season, high definition pictures were captured with side-looking action cameras in the Flevoland province of the Netherlands.
arXiv Detail & Related papers (2021-12-16T20:36:45Z) - Enlisting 3D Crop Models and GANs for More Data Efficient and
Generalizable Fruit Detection [0.0]
We propose a method that generates agricultural images from a synthetic 3D crop model domain into real world crop domains.
The method uses a semantically constrained GAN (generative adversarial network) to preserve the fruit position and geometry.
Incremental training experiments in vineyard grape detection tasks show that the images generated from our method can significantly speed the domain process.
arXiv Detail & Related papers (2021-08-30T16:11:59Z) - Potato Crop Stress Identification in Aerial Images using Deep
Learning-based Object Detection [60.83360138070649]
The paper presents an approach for analyzing aerial images of a potato crop using deep neural networks.
The main objective is to demonstrate automated spatial recognition of a healthy versus stressed crop at a plant level.
Experimental validation demonstrated the ability for distinguishing healthy and stressed plants in field images, achieving an average Dice coefficient of 0.74.
arXiv Detail & Related papers (2021-06-14T21:57:40Z) - Contemplating real-world object classification [53.10151901863263]
We reanalyze the ObjectNet dataset recently proposed by Barbu et al. containing objects in daily life situations.
We find that applying deep models to the isolated objects, rather than the entire scene as is done in the original paper, results in around 20-30% performance improvement.
arXiv Detail & Related papers (2021-03-08T23:29:59Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.