An Online Optimization-Based Decision Support Tool for Small Farmers in
India: Learning in Non-stationary Environments
- URL: http://arxiv.org/abs/2311.17277v1
- Date: Tue, 28 Nov 2023 23:33:16 GMT
- Title: An Online Optimization-Based Decision Support Tool for Small Farmers in
India: Learning in Non-stationary Environments
- Authors: Tuxun Lu, Aviva Prins
- Abstract summary: Small farmers in India, who could greatly benefit from these tools, do not have access to them.
In this paper, we model an individual greenhouse as a Markov Decision Process (MDP) and adapt Li and Li's Follow the Leader (FWL) online learning algorithm to offer crop planning advice.
- Score: 1.3597551064547502
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Crop management decision support systems are specialized tools for farmers
that reduce the riskiness of revenue streams, especially valuable for use under
the current climate changes that impact agricultural productivity.
Unfortunately, small farmers in India, who could greatly benefit from these
tools, do not have access to them. In this paper, we model an individual
greenhouse as a Markov Decision Process (MDP) and adapt Li and Li (2019)'s
Follow the Weighted Leader (FWL) online learning algorithm to offer crop
planning advice. We successfully produce utility-preserving cropping pattern
suggestions in simulations. When we compare against an offline planning
algorithm, we achieve the same cumulative revenue with greatly reduced runtime.
Related papers
- To Measure or Not: A Cost-Sensitive, Selective Measuring Environment for Agricultural Management Decisions with Reinforcement Learning [0.0]
In most cases, it is not feasible to gather crop state measurements before every decision moment.
We apply reinforcement learning to recommend opportune moments to simultaneously measure crop features and apply nitrogen fertilizer.
Our results highlight the importance of measuring when crop feature measurements are not readily available.
arXiv Detail & Related papers (2025-01-22T12:03:53Z) - The Dual-use Dilemma in LLMs: Do Empowering Ethical Capacities Make a Degraded Utility? [54.18519360412294]
Large Language Models (LLMs) must balance between rejecting harmful requests for safety and accommodating legitimate ones for utility.
This paper presents a Direct Preference Optimization (DPO) based alignment framework that achieves better overall performance.
Our resulting model, LibraChem, outperforms leading LLMs including Claude-3, GPT-4o, and LLaMA-3 by margins of 13.44%, 7.16%, and 7.10% respectively.
arXiv Detail & Related papers (2025-01-20T06:35:01Z) - Adaptive Pruning for Large Language Models with Structural Importance Awareness [66.2690963378878]
Large language models (LLMs) have significantly improved language understanding and generation capabilities.
LLMs are difficult to deploy on resource-constrained edge devices due to their high computational and storage resource demands.
We propose structurally-aware adaptive pruning (SAAP) to significantly reduce the computational and memory costs while maintaining model performance.
arXiv Detail & Related papers (2024-12-19T18:08:04Z) - Comparative Analysis of Multi-Agent Reinforcement Learning Policies for Crop Planning Decision Support [0.873811641236639]
In India, majority of farmers are classified as small or marginal, making their livelihoods particularly vulnerable to economic losses due to market saturation and climate risks.
Existing decision support systems (DSS) often provide generic recommendations that fail to account for real-time market dynamics and the interactions among multiple farmers.
In this paper, we evaluate the viability of three multi-agent reinforcement learning (MARL) approaches for optimizing total farmer income and promoting fairness in crop planning.
arXiv Detail & Related papers (2024-12-03T00:30:19Z) - AgGym: An agricultural biotic stress simulation environment for ultra-precision management planning [8.205412609306713]
We present AgGym, a modular, crop and stress simulation framework to model the spread of biotic stresses in a field.
We show that AgGym can be customized with limited data to simulate yield outcomes under various biotic stress conditions.
Our proposed framework enables personalized decision support that can transform biotic stress management from being schedule based to opportunistic and prescriptive.
arXiv Detail & Related papers (2024-09-01T14:55:45Z) - MOTO: Offline Pre-training to Online Fine-tuning for Model-based Robot
Learning [52.101643259906915]
We study the problem of offline pre-training and online fine-tuning for reinforcement learning from high-dimensional observations.
Existing model-based offline RL methods are not suitable for offline-to-online fine-tuning in high-dimensional domains.
We propose an on-policy model-based method that can efficiently reuse prior data through model-based value expansion and policy regularization.
arXiv Detail & Related papers (2024-01-06T21:04:31Z) - AdaPlanner: Adaptive Planning from Feedback with Language Models [56.367020818139665]
Large language models (LLMs) have recently demonstrated the potential in acting as autonomous agents for sequential decision-making tasks.
We propose a closed-loop approach, AdaPlanner, which allows the LLM agent to refine its self-generated plan adaptively in response to environmental feedback.
To mitigate hallucination, we develop a code-style LLM prompt structure that facilitates plan generation across a variety of tasks, environments, and agent capabilities.
arXiv Detail & Related papers (2023-05-26T05:52:27Z) - Benchmarking Automated Machine Learning Methods for Price Forecasting
Applications [58.720142291102135]
We show the possibility of substituting manually created ML pipelines with automated machine learning (AutoML) solutions.
Based on the CRISP-DM process, we split the manual ML pipeline into a machine learning and non-machine learning part.
We show in a case study for the industrial use case of price forecasting, that domain knowledge combined with AutoML can weaken the dependence on ML experts.
arXiv Detail & Related papers (2023-04-28T10:27:38Z) - Privacy Adhering Machine Un-learning in NLP [66.17039929803933]
In real world industry use Machine Learning to build models on user data.
Such mandates require effort both in terms of data as well as model retraining.
continuous removal of data and model retraining steps do not scale.
We propose textitMachine Unlearning to tackle this challenge.
arXiv Detail & Related papers (2022-12-19T16:06:45Z) - DeepG2P: Fusing Multi-Modal Data to Improve Crop Production [1.7406327893433846]
We present a Natural Language Processing-based neural network architecture to process the G, E and M inputs and their interactions.
We show that by modeling DNA as natural language, our approach performs better than previous approaches when tested for new environments.
arXiv Detail & Related papers (2022-11-11T03:32:44Z) - Machine Learning aided Crop Yield Optimization [0.0]
We present a crop simulation environment with an OpenAI Gym interface, and apply modern deep reinforcement learning (DRL) algorithms to optimize yield.
We empirically show that DRL algorithms may be useful in discovering new policies and approaches to help optimize crop yield, while simultaneously minimizing constraining factors such as water and fertilizer usage.
arXiv Detail & Related papers (2021-11-01T14:14:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.