CDialog: A Multi-turn Covid-19 Conversation Dataset for Entity-Aware
Dialog Generation
- URL: http://arxiv.org/abs/2212.06049v1
- Date: Wed, 16 Nov 2022 11:07:34 GMT
- Title: CDialog: A Multi-turn Covid-19 Conversation Dataset for Entity-Aware
Dialog Generation
- Authors: Deeksha Varshney, Aizan Zafar, Niranshu Kumar Behra and Asif Ekbal
- Abstract summary: We release a high-quality multi-turn Medical Dialog dataset relating to Covid-19 disease named CDialog.
We propose a novel neural medical dialog system based on the CDialog dataset to advance future research on developing automated medical dialog systems.
- Score: 18.047064216849204
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: The development of conversational agents to interact with patients and
deliver clinical advice has attracted the interest of many researchers,
particularly in light of the COVID-19 pandemic. The training of an end-to-end
neural based dialog system, on the other hand, is hampered by a lack of
multi-turn medical dialog corpus. We make the very first attempt to release a
high-quality multi-turn Medical Dialog dataset relating to Covid-19 disease
named CDialog, with over 1K conversations collected from the online medical
counselling websites. We annotate each utterance of the conversation with seven
different categories of medical entities, including diseases, symptoms, medical
tests, medical history, remedies, medications and other aspects as additional
labels. Finally, we propose a novel neural medical dialog system based on the
CDialog dataset to advance future research on developing automated medical
dialog systems. We use pre-trained language models for dialogue generation,
incorporating annotated medical entities, to generate a virtual doctor's
response that addresses the patient's query. Experimental results show that the
proposed dialog models perform comparably better when supplemented with entity
information and hence can improve the response quality.
Related papers
- RaDialog: A Large Vision-Language Model for Radiology Report Generation
and Conversational Assistance [53.20640629352422]
Conversational AI tools can generate and discuss clinically correct radiology reports for a given medical image.
RaDialog is the first thoroughly evaluated and publicly available large vision-language model for radiology report generation and interactive dialog.
Our method achieves state-of-the-art clinical correctness in report generation and shows impressive abilities in interactive tasks such as correcting reports and answering questions.
arXiv Detail & Related papers (2023-11-30T16:28:40Z) - MidMed: Towards Mixed-Type Dialogues for Medical Consultation [12.676937863407542]
Most medical dialogue systems assume that patients have clear goals (medicine querying, surgical operation querying, etc.) before medical consultation.
Due to the lack of medical knowledge, it is usually difficult for patients to determine clear goals with all necessary slots.
We propose a novel task and create a human-to-human mixed-type medical consultation dialogue corpus, termed MidMed.
arXiv Detail & Related papers (2023-06-05T14:36:31Z) - Medical Dialogue Generation via Dual Flow Modeling [9.328694317877169]
Medical dialogue systems (MDS) aim to provide patients with medical services, such as diagnosis and prescription.
Previous studies mainly addressed this by extracting the mentioned medical entities as critical dialogue history information.
In this work, we argue that it is also essential to capture the transitions of the medical entities and the doctor's dialogue acts in each turn.
arXiv Detail & Related papers (2023-05-29T14:23:34Z) - HybriDialogue: An Information-Seeking Dialogue Dataset Grounded on
Tabular and Textual Data [87.67278915655712]
We present a new dialogue dataset, HybriDialogue, which consists of crowdsourced natural conversations grounded on both Wikipedia text and tables.
The conversations are created through the decomposition of complex multihop questions into simple, realistic multiturn dialogue interactions.
arXiv Detail & Related papers (2022-04-28T00:52:16Z) - A Benchmark for Automatic Medical Consultation System: Frameworks, Tasks
and Datasets [70.32630628211803]
We propose two frameworks to support automatic medical consultation, namely doctor-patient dialogue understanding and task-oriented interaction.
A new large medical dialogue dataset with multi-level fine-grained annotations is introduced.
We report a set of benchmark results for each task, which shows the usability of the dataset and sets a baseline for future studies.
arXiv Detail & Related papers (2022-04-19T16:43:21Z) - MedDG: An Entity-Centric Medical Consultation Dataset for Entity-Aware
Medical Dialogue Generation [86.38736781043109]
We build and release a large-scale high-quality Medical Dialogue dataset related to 12 types of common Gastrointestinal diseases named MedDG.
We propose two kinds of medical dialogue tasks based on MedDG dataset. One is the next entity prediction and the other is the doctor response generation.
Experimental results show that the pre-train language models and other baselines struggle on both tasks with poor performance in our dataset.
arXiv Detail & Related papers (2020-10-15T03:34:33Z) - On the Generation of Medical Dialogues for COVID-19 [60.63485429268256]
People experiencing COVID19-related symptoms or exposed to risk factors have a pressing need to consult doctors.
Because of the shortage of medical professionals, many people cannot receive online consultations timely.
We aim to develop a medical dialogue system that can provide COVID19-related consultations.
arXiv Detail & Related papers (2020-05-11T21:23:43Z) - Towards Conversational Recommendation over Multi-Type Dialogs [78.52354759386296]
We propose a new task of conversational recommendation over multi-type dialogs, where the bots can proactively and naturally lead a conversation from a non-recommendation dialog to a recommendation dialog.
To facilitate the study of this task, we create a human-to-human Chinese dialog dataset emphDuRecDial (about 10k dialogs, 156k utterances)
In each dialog, the recommender proactively leads a multi-type dialog to approach recommendation targets and then makes multiple recommendations with rich interaction behavior.
arXiv Detail & Related papers (2020-05-08T11:01:21Z) - MedDialog: Two Large-scale Medical Dialogue Datasets [27.619211324563498]
We build two large-scale medical dialogue datasets: MedDialog-EN and MedDialog-CN.
MedDialog-EN is an English dataset containing 0.3 million conversations between patients and doctors and 0.5 million utterances.
MedDialog-CN is a Chinese dataset containing 1.1 million conversations and 4 million utterances.
arXiv Detail & Related papers (2020-04-07T13:07:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.