GPT-4 as an Agronomist Assistant? Answering Agriculture Exams Using
Large Language Models
- URL: http://arxiv.org/abs/2310.06225v2
- Date: Thu, 12 Oct 2023 17:06:17 GMT
- Title: GPT-4 as an Agronomist Assistant? Answering Agriculture Exams Using
Large Language Models
- Authors: Bruno Silva, Leonardo Nunes, Roberto Estev\~ao, Vijay Aski, Ranveer
Chandra
- Abstract summary: Large language models (LLMs) have demonstrated remarkable capabilities in natural language understanding across various domains.
We present a comprehensive evaluation of popular LLMs, such as Llama 2 and GPT, on their ability to answer agriculture-related questions.
We selected agriculture exams and benchmark datasets from three of the largest agriculture producer countries: Brazil, India, and the USA.
- Score: 1.3999521658236698
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Large language models (LLMs) have demonstrated remarkable capabilities in
natural language understanding across various domains, including healthcare and
finance. For some tasks, LLMs achieve similar or better performance than
trained human beings, therefore it is reasonable to employ human exams (e.g.,
certification tests) to assess the performance of LLMs. We present a
comprehensive evaluation of popular LLMs, such as Llama 2 and GPT, on their
ability to answer agriculture-related questions. In our evaluation, we also
employ RAG (Retrieval-Augmented Generation) and ER (Ensemble Refinement)
techniques, which combine information retrieval, generation capabilities, and
prompting strategies to improve the LLMs' performance. To demonstrate the
capabilities of LLMs, we selected agriculture exams and benchmark datasets from
three of the largest agriculture producer countries: Brazil, India, and the
USA. Our analysis highlights GPT-4's ability to achieve a passing score on
exams to earn credits for renewing agronomist certifications, answering 93% of
the questions correctly and outperforming earlier general-purpose models, which
achieved 88% accuracy. On one of our experiments, GPT-4 obtained the highest
performance when compared to human subjects. This performance suggests that
GPT-4 could potentially pass on major graduate education admission tests or
even earn credits for renewing agronomy certificates. We also explore the
models' capacity to address general agriculture-related questions and generate
crop management guidelines for Brazilian and Indian farmers, utilizing robust
datasets from the Brazilian Agency of Agriculture (Embrapa) and graduate
program exams from India. The results suggest that GPT-4, ER, and RAG can
contribute meaningfully to agricultural education, assessment, and crop
management practice, offering valuable insights to farmers and agricultural
professionals.
Related papers
- Enhancing Agricultural Machinery Management through Advanced LLM Integration [0.7366405857677226]
The integration of artificial intelligence into agricultural practices has the potential to revolutionize efficiency and sustainability in farming.
This paper introduces a novel approach that leverages large language models (LLMs), particularly GPT-4, to enhance decision-making processes in agricultural machinery management.
arXiv Detail & Related papers (2024-07-30T06:49:55Z) - AgriLLM: Harnessing Transformers for Farmer Queries [2.8592691160719554]
This work explores the transformative potential of Large Language Models (LLMs) in automating query resolution for agricultural farmers.
Using a subset of a vast dataset of real-world farmer queries collected in India, our study focuses on approximately 4 million queries from the state of Tamil Nadu.
arXiv Detail & Related papers (2024-06-21T07:37:41Z) - Application of Machine Learning in Agriculture: Recent Trends and Future Research Avenues [6.0460261046732455]
Food production is a vital global concern and the potential for an agritech revolution through artificial intelligence (AI) remains largely unexplored.
This paper presents a comprehensive review focused on the application of machine learning (ML) in agriculture, aiming to explore its transformative potential in farming practices and efficiency enhancement.
arXiv Detail & Related papers (2024-05-23T17:53:31Z) - A Survey on RAG Meeting LLMs: Towards Retrieval-Augmented Large Language Models [71.25225058845324]
Large Language Models (LLMs) have demonstrated revolutionary abilities in language understanding and generation.
Retrieval-Augmented Generation (RAG) can offer reliable and up-to-date external knowledge.
RA-LLMs have emerged to harness external and authoritative knowledge bases, rather than relying on the model's internal knowledge.
arXiv Detail & Related papers (2024-05-10T02:48:45Z) - GPT-4 as Evaluator: Evaluating Large Language Models on Pest Management in Agriculture [7.458004824488893]
The application of large language models (LLMs) in agriculture, particularly in pest management, remains nascent.
We aimed to prove the feasibility by evaluating the content of the pest management advice generated by LLMs, including the Generative Pre-trained Transformer (GPT) series from OpenAI and the FLAN series from Google.
We proposed an innovative approach, using GPT-4 as an evaluator, to score the generated content on Coherence, Logical Consistency, Fluency, Relevance, Comprehensibility, and Exhaustiveness.
arXiv Detail & Related papers (2024-03-18T15:08:01Z) - RAG vs Fine-tuning: Pipelines, Tradeoffs, and a Case Study on
Agriculture [2.4184993026516213]
We propose a pipeline for fine-tuning and RAG, and present the tradeoffs of both for popular Large Language Models.
Our results show the effectiveness of our dataset generation pipeline in capturing geographic-specific knowledge.
arXiv Detail & Related papers (2024-01-16T14:44:47Z) - LLaMA Beyond English: An Empirical Study on Language Capability Transfer [49.298360366468934]
We focus on how to effectively transfer the capabilities of language generation and following instructions to a non-English language.
We analyze the impact of key factors such as vocabulary extension, further pretraining, and instruction tuning on transfer.
We employ four widely used standardized testing benchmarks: C-Eval, MMLU, AGI-Eval, and GAOKAO-Bench.
arXiv Detail & Related papers (2024-01-02T06:29:02Z) - Large Language Models as Automated Aligners for benchmarking
Vision-Language Models [48.4367174400306]
Vision-Language Models (VLMs) have reached a new level of sophistication, showing notable competence in executing intricate cognition and reasoning tasks.
Existing evaluation benchmarks, primarily relying on rigid, hand-crafted datasets, face significant limitations in assessing the alignment of these increasingly anthropomorphic models with human intelligence.
In this work, we address the limitations via Auto-Bench, which delves into exploring LLMs as proficient curation, measuring the alignment betweenVLMs and human intelligence and value through automatic data curation and assessment.
arXiv Detail & Related papers (2023-11-24T16:12:05Z) - Is GPT-4 a Good Data Analyst? [67.35956981748699]
We consider GPT-4 as a data analyst to perform end-to-end data analysis with databases from a wide range of domains.
We design several task-specific evaluation metrics to systematically compare the performance between several professional human data analysts and GPT-4.
Experimental results show that GPT-4 can achieve comparable performance to humans.
arXiv Detail & Related papers (2023-05-24T11:26:59Z) - LLMs for Knowledge Graph Construction and Reasoning: Recent Capabilities and Future Opportunities [66.36633042421387]
Large Language Models (LLMs) for Knowledge Graph (KG) construction and reasoning evaluated.
We propose AutoKG, a multi-agent-based approach employing LLMs and external sources for KG construction and reasoning.
arXiv Detail & Related papers (2023-05-22T15:56:44Z) - Empowering Agrifood System with Artificial Intelligence: A Survey of the Progress, Challenges and Opportunities [86.89427012495457]
We review how AI techniques can transform agrifood systems and contribute to the modern agrifood industry.
We present a progress review of AI methods in agrifood systems, specifically in agriculture, animal husbandry, and fishery.
We highlight potential challenges and promising research opportunities for transforming modern agrifood systems with AI.
arXiv Detail & Related papers (2023-05-03T05:16:54Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.