Comparative Analysis of Drug-GPT and ChatGPT LLMs for Healthcare
Insights: Evaluating Accuracy and Relevance in Patient and HCP Contexts
- URL: http://arxiv.org/abs/2307.16850v1
- Date: Mon, 24 Jul 2023 19:27:11 GMT
- Title: Comparative Analysis of Drug-GPT and ChatGPT LLMs for Healthcare
Insights: Evaluating Accuracy and Relevance in Patient and HCP Contexts
- Authors: Giorgos Lysandrou, Roma English Owen, Kirsty Mursec, Grant Le Brun,
Elizabeth A. L. Fairley
- Abstract summary: This study presents a comparative analysis of three Generative Pre-trained Transformer (GPT) solutions in a question and answer (Q&A) setting.
The objective is to determine which model delivers the most accurate and relevant information in response to prompts related to patient experiences with atopic dermatitis (AD) and healthcare professional (HCP) discussions about diabetes.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: This study presents a comparative analysis of three Generative Pre-trained
Transformer (GPT) solutions in a question and answer (Q&A) setting: Drug-GPT 3,
Drug-GPT 4, and ChatGPT, in the context of healthcare applications. The
objective is to determine which model delivers the most accurate and relevant
information in response to prompts related to patient experiences with atopic
dermatitis (AD) and healthcare professional (HCP) discussions about diabetes.
The results demonstrate that while all three models are capable of generating
relevant and accurate responses, Drug-GPT 3 and Drug-GPT 4, which are supported
by curated datasets of patient and HCP social media and message board posts,
provide more targeted and in-depth insights. ChatGPT, a more general-purpose
model, generates broader and more general responses, which may be valuable for
readers seeking a high-level understanding of the topics but may lack the depth
and personal insights found in the answers generated by the specialized
Drug-GPT models. This comparative analysis highlights the importance of
considering the language model's perspective, depth of knowledge, and currency
when evaluating the usefulness of generated information in healthcare
applications.
Related papers
- Enhancing Health Information Retrieval with RAG by Prioritizing Topical Relevance and Factual Accuracy [0.7673339435080445]
This paper introduces a solution driven by Retrieval-Augmented Generation (RAG) to enhance the retrieval of health-related documents grounded in scientific evidence.
In particular, we propose a three-stage model: in the first stage, the user's query is employed to retrieve topically relevant passages with associated references from a knowledge base constituted by scientific literature.
In the second stage, these passages, alongside the initial query, are processed by LLMs to generate a contextually relevant rich text (GenText)
In the last stage, the documents to be retrieved are evaluated and ranked both from the point of
arXiv Detail & Related papers (2025-02-07T05:19:13Z) - Conversation AI Dialog for Medicare powered by Finetuning and Retrieval Augmented Generation [0.0]
Large language models (LLMs) have shown impressive capabilities in natural language processing tasks, including dialogue generation.
This research aims to conduct a novel comparative analysis of two prominent techniques, fine-tuning with LoRA and the Retrieval-Augmented Generation framework.
arXiv Detail & Related papers (2025-02-04T11:50:40Z) - Reasoning-Enhanced Healthcare Predictions with Knowledge Graph Community Retrieval [61.70489848327436]
KARE is a novel framework that integrates knowledge graph (KG) community-level retrieval with large language models (LLMs) reasoning.
Extensive experiments demonstrate that KARE outperforms leading models by up to 10.8-15.0% on MIMIC-III and 12.6-12.7% on MIMIC-IV for mortality and readmission predictions.
arXiv Detail & Related papers (2024-10-06T18:46:28Z) - A Systematic Evaluation of GPT-4V's Multimodal Capability for Medical
Image Analysis [87.25494411021066]
GPT-4V's multimodal capability for medical image analysis is evaluated.
It is found that GPT-4V excels in understanding medical images and generates high-quality radiology reports.
It is found that its performance for medical visual grounding needs to be substantially improved.
arXiv Detail & Related papers (2023-10-31T11:39:09Z) - Clairvoyance: A Pipeline Toolkit for Medical Time Series [95.22483029602921]
Time-series learning is the bread and butter of data-driven *clinical decision support*
Clairvoyance proposes a unified, end-to-end, autoML-friendly pipeline that serves as a software toolkit.
Clairvoyance is the first to demonstrate viability of a comprehensive and automatable pipeline for clinical time-series ML.
arXiv Detail & Related papers (2023-10-28T12:08:03Z) - Navigating Healthcare Insights: A Birds Eye View of Explainability with
Knowledge Graphs [0.0]
Knowledge graphs (KGs) are gaining prominence in Healthcare AI, especially in drug discovery and pharmaceutical research.
This overview summarizes recent literature on the impact of KGs in healthcare and their role in developing explainable AI models.
We emphasize the importance of making KGs more interpretable through knowledge-infused learning in healthcare.
arXiv Detail & Related papers (2023-09-28T16:57:03Z) - SynerGPT: In-Context Learning for Personalized Drug Synergy Prediction
and Drug Design [64.69434941796904]
We propose a novel setting and models for in-context drug synergy learning.
We are given a small "personalized dataset" of 10-20 drug synergy relationships in the context of specific cancer cell targets.
Our goal is to predict additional drug synergy relationships in that context.
arXiv Detail & Related papers (2023-06-19T17:03:46Z) - Informing clinical assessment by contextualizing post-hoc explanations
of risk prediction models in type-2 diabetes [50.8044927215346]
We consider a comorbidity risk prediction scenario and focus on contexts regarding the patients clinical state.
We employ several state-of-the-art LLMs to present contexts around risk prediction model inferences and evaluate their acceptability.
Our paper is one of the first end-to-end analyses identifying the feasibility and benefits of contextual explanations in a real-world clinical use case.
arXiv Detail & Related papers (2023-02-11T18:07:11Z) - Self-supervised Answer Retrieval on Clinical Notes [68.87777592015402]
We introduce CAPR, a rule-based self-supervision objective for training Transformer language models for domain-specific passage matching.
We apply our objective in four Transformer-based architectures: Contextual Document Vectors, Bi-, Poly- and Cross-encoders.
We report that CAPR outperforms strong baselines in the retrieval of domain-specific passages and effectively generalizes across rule-based and human-labeled passages.
arXiv Detail & Related papers (2021-08-02T10:42:52Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.