GenSpectrum Chat: Data Exploration in Public Health Using Large Language
Models
- URL: http://arxiv.org/abs/2305.13821v1
- Date: Tue, 23 May 2023 08:43:43 GMT
- Title: GenSpectrum Chat: Data Exploration in Public Health Using Large Language
Models
- Authors: Chaoran Chen, Tanja Stadler
- Abstract summary: The COVID-19 pandemic highlighted the importance of making epidemiological data easily accessible and explorable.
We developed the "GenSpectrum Chat" which uses GPT-4 as the underlying large language model (LLM) to explore SARS-CoV-2 genomic sequencing data.
- Score: 2.9823962001574187
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Introduction: The COVID-19 pandemic highlighted the importance of making
epidemiological data and scientific insights easily accessible and explorable
for public health agencies, the general public, and researchers.
State-of-the-art approaches for sharing data and insights included regularly
updated reports and web dashboards. However, they face a trade-off between the
simplicity and flexibility of data exploration. With the capabilities of recent
large language models (LLMs) such as GPT-4, this trade-off can be overcome.
Results: We developed the chatbot "GenSpectrum Chat"
(https://cov-spectrum.org/chat) which uses GPT-4 as the underlying large
language model (LLM) to explore SARS-CoV-2 genomic sequencing data. Out of 500
inputs from real-world users, the chatbot provided a correct answer for 453
prompts; an incorrect answer for 13 prompts, and no answer although the
question was within scope for 34 prompts. We also tested the chatbot with
inputs from 10 different languages, and despite being provided solely with
English instructions and examples, it successfully processed prompts in all
tested languages.
Conclusion: LLMs enable new ways of interacting with information systems. In
the field of public health, GenSpectrum Chat can facilitate the analysis of
real-time pathogen genomic data. With our chatbot supporting interactive
exploration in different languages, we envision quick and direct access to the
latest evidence for policymakers around the world.
Related papers
- "What's Up, Doc?": Analyzing How Users Seek Health Information in Large-Scale Conversational AI Datasets [6.459488580102546]
HealthChat-11K is a curated dataset of 11K real-world conversations composed of 25K user messages.<n>Our analysis reveals insights into the nature of how and why users seek health information.
arXiv Detail & Related papers (2025-06-26T17:52:18Z) - Customizable LLM-Powered Chatbot for Behavioral Science Research [6.084958172018792]
Large Language Models (LLMs) produce text that closely resembles human communication.
The potential utility of chatbots transcends traditional applications, particularly in research contexts.
In this study, we present a Customizable LLM-Powered (CLPC) system designed to assist in behavioral science research.
arXiv Detail & Related papers (2025-01-09T19:27:28Z) - In Generative AI we Trust: Can Chatbots Effectively Verify Political
Information? [39.58317527488534]
This article presents a comparative analysis of the ability of two large language model (LLM)-based chatbots, ChatGPT and Bing Chat, to detect veracity of political information.
We use AI auditing methodology to investigate how chatbots evaluate true, false, and borderline statements on five topics: COVID-19, Russian aggression against Ukraine, the Holocaust, climate change, and LGBTQ+ related debates.
The results show high performance of ChatGPT for the baseline veracity evaluation task, with 72 percent of the cases evaluated correctly on average across languages without pre-training.
arXiv Detail & Related papers (2023-12-20T15:17:03Z) - Comprehensive Assessment of Toxicity in ChatGPT [49.71090497696024]
We evaluate the toxicity in ChatGPT by utilizing instruction-tuning datasets.
prompts in creative writing tasks can be 2x more likely to elicit toxic responses.
Certain deliberately toxic prompts, designed in earlier studies, no longer yield harmful responses.
arXiv Detail & Related papers (2023-11-03T14:37:53Z) - Performance of ChatGPT on USMLE: Unlocking the Potential of Large
Language Models for AI-Assisted Medical Education [0.0]
This study determined how reliable ChatGPT can be for answering complex medical and clinical questions.
The paper evaluated the obtained results using a 2-way ANOVA and posthoc analysis.
ChatGPT-generated answers were found to be more context-oriented than regular Google search results.
arXiv Detail & Related papers (2023-06-30T19:53:23Z) - XrayGPT: Chest Radiographs Summarization using Medical Vision-Language
Models [60.437091462613544]
We introduce XrayGPT, a novel conversational medical vision-language model.
It can analyze and answer open-ended questions about chest radiographs.
We generate 217k interactive and high-quality summaries from free-text radiology reports.
arXiv Detail & Related papers (2023-06-13T17:59:59Z) - LLaVA-Med: Training a Large Language-and-Vision Assistant for
Biomedicine in One Day [85.19963303642427]
We propose a cost-efficient approach for training a vision-language conversational assistant that can answer open-ended research questions of biomedical images.
The model first learns to align biomedical vocabulary using the figure-caption pairs as is, then learns to master open-ended conversational semantics.
This enables us to train a Large Language and Vision Assistant for BioMedicine in less than 15 hours (with eight A100s)
arXiv Detail & Related papers (2023-06-01T16:50:07Z) - HuatuoGPT, towards Taming Language Model to Be a Doctor [67.96794664218318]
HuatuoGPT is a large language model (LLM) for medical consultation.
We leverage both textitdistilled data from ChatGPT and textitreal-world data from doctors in the supervised fine-tuned stage.
arXiv Detail & Related papers (2023-05-24T11:56:01Z) - ChatGPT Beyond English: Towards a Comprehensive Evaluation of Large
Language Models in Multilingual Learning [70.57126720079971]
Large language models (LLMs) have emerged as the most important breakthroughs in natural language processing (NLP)
This paper evaluates ChatGPT on 7 different tasks, covering 37 diverse languages with high, medium, low, and extremely low resources.
Compared to the performance of previous models, our extensive experimental results demonstrate a worse performance of ChatGPT for different NLP tasks and languages.
arXiv Detail & Related papers (2023-04-12T05:08:52Z) - To ChatGPT, or not to ChatGPT: That is the question! [78.407861566006]
This study provides a comprehensive and contemporary assessment of the most recent techniques in ChatGPT detection.
We have curated a benchmark dataset consisting of prompts from ChatGPT and humans, including diverse questions from medical, open Q&A, and finance domains.
Our evaluation results demonstrate that none of the existing methods can effectively detect ChatGPT-generated content.
arXiv Detail & Related papers (2023-04-04T03:04:28Z) - How Close is ChatGPT to Human Experts? Comparison Corpus, Evaluation,
and Detection [8.107721810172112]
ChatGPT is able to respond effectively to a wide range of human questions.
People are starting to worry about the potential negative impacts that large language models (LLMs) like ChatGPT could have on society.
In this work, we collected tens of thousands of comparison responses from both human experts and ChatGPT.
arXiv Detail & Related papers (2023-01-18T15:23:25Z) - ChatGPT Makes Medicine Easy to Swallow: An Exploratory Case Study on
Simplified Radiology Reports [0.4194454151396506]
ChatGPT is a language model capable of generating text that appears human-like and authentic.
We asked 15 radiologists to assess the quality of radiology reports simplified by ChatGPT.
Most radiologists agreed that the simplified reports were factually correct, complete, and not potentially harmful to the patient.
arXiv Detail & Related papers (2022-12-30T18:55:16Z) - A Cross-lingual Natural Language Processing Framework for Infodemic
Management [0.6606016007748989]
The COVID-19 pandemic has put immense pressure on health systems which are further strained due to misinformation surrounding it.
We have exploited the potential of Natural Language Processing for identifying relevant information that needs to be disseminated amongst the masses.
We present a novel Cross-lingual Natural Language Processing framework to provide relevant information by matching daily news with trusted guidelines from the World Health Organization.
arXiv Detail & Related papers (2020-10-30T16:26:35Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.