Optimizing Navigation And Chemical Application in Precision Agriculture With Deep Reinforcement Learning And Conditional Action Tree
- URL: http://arxiv.org/abs/2503.17985v1
- Date: Sun, 23 Mar 2025 08:38:13 GMT
- Title: Optimizing Navigation And Chemical Application in Precision Agriculture With Deep Reinforcement Learning And Conditional Action Tree
- Authors: Mahsa Khosravi, Zhanhong Jiang, Joshua R Waite, Sarah Jonesc, Hernan Torres, Arti Singh, Baskar Ganapathysubramanian, Asheesh Kumar Singh, Soumik Sarkar,
- Abstract summary: This paper presents a novel reinforcement learning-based planning scheme for robotic management of biotic stresses in precision agriculture.<n>The framework employs a hierarchical decision-making structure with conditional action masking, where high-level actions direct the robot's exploration.
- Score: 6.642314074960705
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This paper presents a novel reinforcement learning (RL)-based planning scheme for optimized robotic management of biotic stresses in precision agriculture. The framework employs a hierarchical decision-making structure with conditional action masking, where high-level actions direct the robot's exploration, while low-level actions optimize its navigation and efficient chemical spraying in affected areas. The key objectives of optimization include improving the coverage of infected areas with limited battery power and reducing chemical usage, thus preventing unnecessary spraying of healthy areas of the field. Our numerical experimental results demonstrate that the proposed method, Hierarchical Action Masking Proximal Policy Optimization (HAM-PPO), significantly outperforms baseline practices, such as LawnMower navigation + indiscriminate spraying (Carpet Spray), in terms of yield recovery and resource efficiency. HAM-PPO consistently achieves higher yield recovery percentages and lower chemical costs across a range of infection scenarios. The framework also exhibits robustness to observation noise and generalizability under diverse environmental conditions, adapting to varying infection ranges and spatial distribution patterns.
Related papers
- Optimizing Interplanetary Trajectories using Hybrid Meta-heuristic [1.03590082373586]
This paper proposes an advanced hybrid optimization (GMPA) algorithm to address the inherent limitations of the Grey Wolf Predators (GWO)<n>GMPA integrates essential features from the Marine Algorithm (MPA) into the GWO framework, enabling superior performance through enhanced exploration and exploitation balance.<n> Empirical evaluations demonstrate GMPA's superior effectiveness compared to traditional GWO and other advanced metaheuristic algorithms.
arXiv Detail & Related papers (2025-05-18T12:53:48Z) - Enhancing Treatment Effect Estimation via Active Learning: A Counterfactual Covering Perspective [61.284843894545475]
Complex algorithms for treatment effect estimation are ineffective when handling insufficiently labeled training sets.<n>We propose FCCM, which transforms the optimization objective into the textitFactual and textitCounterfactual Coverage Maximization to ensure effective radius reduction during data acquisition.<n> benchmarking FCCM against other baselines demonstrates its superiority across both fully synthetic and semi-synthetic datasets.
arXiv Detail & Related papers (2025-05-08T13:42:00Z) - Goat Optimization Algorithm: A Novel Bio-Inspired Metaheuristic for Global Optimization [1.2289361708127877]
This paper presents a novel bio-inspired metaheuristic optimization technique inspired by goats' adaptive foraging, strategic movement, and parasite avoidance behaviors.<n>The algorithm's performance is evaluated on standard unimodal benchmark functions.<n>The findings suggest that GOA is a promising advancement in bio-inspired optimization techniques.
arXiv Detail & Related papers (2025-03-04T06:44:07Z) - Global-Decision-Focused Neural ODEs for Proactive Grid Resilience Management [50.34345101758248]
We propose predict-all-then-optimize-globally (PATOG), a framework that integrates outage prediction with globally optimized interventions.<n>Our approach ensures spatially and temporally coherent decision-making, improving both predictive accuracy and operational efficiency.<n>Experiments on synthetic and real-world datasets demonstrate significant improvements in outage prediction consistency and grid resilience.
arXiv Detail & Related papers (2025-02-25T16:15:35Z) - DrugImproverGPT: A Large Language Model for Drug Optimization with Fine-Tuning via Structured Policy Optimization [53.27954325490941]
Finetuning a Large Language Model (LLM) is crucial for generating results towards specific objectives.<n>This research introduces a novel reinforcement learning algorithm to finetune a drug optimization LLM-based generative model.
arXiv Detail & Related papers (2025-02-11T04:00:21Z) - CROPS: A Deployable Crop Management System Over All Possible State Availabilities [11.831002170207547]
This paper introduces a deployable textbfCRop Management system textbfOver all textbfPossible textbfState availabilities (CROPS)
arXiv Detail & Related papers (2024-11-09T02:06:09Z) - A Comparative Study of Deep Reinforcement Learning for Crop Production Management [13.123171643387668]
Reinforcement learning (RL) has emerged as a promising tool for developing adaptive crop management policies.
In the gym-DSSAT crop model environment, one of the most widely used simulators for crop management, proximal policy optimization (PPO) and deep Q-networks (DQN) have shown promising results.
In this study, we evaluated PPO and DQN against static baseline policies across three different RL tasks, fertilization, irrigation, and mixed management, provided by the gym-DSSAT environment.
arXiv Detail & Related papers (2024-11-06T18:35:51Z) - Hierarchical Preference Optimization: Learning to achieve goals via feasible subgoals prediction [71.81851971324187]
This work introduces Hierarchical Preference Optimization (HPO), a novel approach to hierarchical reinforcement learning (HRL)
HPO addresses non-stationarity and infeasible subgoal generation issues when solving complex robotic control tasks.
Experiments on challenging robotic navigation and manipulation tasks demonstrate impressive performance of HPO, where it shows an improvement of up to 35% over the baselines.
arXiv Detail & Related papers (2024-11-01T04:58:40Z) - AgGym: An agricultural biotic stress simulation environment for ultra-precision management planning [8.205412609306713]
We present AgGym, a modular, crop and stress simulation framework to model the spread of biotic stresses in a field.
We show that AgGym can be customized with limited data to simulate yield outcomes under various biotic stress conditions.
Our proposed framework enables personalized decision support that can transform biotic stress management from being schedule based to opportunistic and prescriptive.
arXiv Detail & Related papers (2024-09-01T14:55:45Z) - The Effect of Different Optimization Strategies to Physics-Constrained
Deep Learning for Soil Moisture Estimation [5.804881282638357]
We propose a physics-constrained deep learning (P-DL) framework to integrate physics-based principles on water transport and water sensing signals.
We demonstrate the empirical convergence function Adams outperforms the other optimization methods in both mini-batch and full-batch training.
arXiv Detail & Related papers (2024-03-13T00:32:30Z) - Airport take-off and landing optimization through genetic algorithms [55.2480439325792]
This research addresses the crucial issue of pollution from aircraft operations, focusing on optimizing both gate allocation and runway scheduling simultaneously.
The study presents an innovative genetic algorithm-based method for minimizing pollution from fuel combustion during aircraft take-off and landing at airports.
arXiv Detail & Related papers (2024-02-29T14:53:55Z) - Learning Regions of Interest for Bayesian Optimization with Adaptive
Level-Set Estimation [84.0621253654014]
We propose a framework, called BALLET, which adaptively filters for a high-confidence region of interest.
We show theoretically that BALLET can efficiently shrink the search space, and can exhibit a tighter regret bound than standard BO.
arXiv Detail & Related papers (2023-07-25T09:45:47Z) - Diverse Policy Optimization for Structured Action Space [59.361076277997704]
We propose Diverse Policy Optimization (DPO) to model the policies in structured action space as the energy-based models (EBM)
A novel and powerful generative model, GFlowNet, is introduced as the efficient, diverse EBM-based policy sampler.
Experiments on ATSC and Battle benchmarks demonstrate that DPO can efficiently discover surprisingly diverse policies.
arXiv Detail & Related papers (2023-02-23T10:48:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.