M^2-MedDialog: A Dataset and Benchmarks for Multi-domain Multi-service
Medical Dialogues
- URL: http://arxiv.org/abs/2109.00430v1
- Date: Wed, 1 Sep 2021 15:24:54 GMT
- Title: M^2-MedDialog: A Dataset and Benchmarks for Multi-domain Multi-service
Medical Dialogues
- Authors: Guojun Yan and Jiahuan Pei and Pengjie Ren and Zhumin Chen and
Zhaochun Ren and Huasheng Liang
- Abstract summary: Medical dialogue systems (MDSs) aim to assist doctors and patients with a range of professional medical services.
No dataset has so large-scale dialogues contains both multiple medical services and fine-grained medical labels.
We first build a Multiple-domain Multiple-service medical dialogue (M2-MedDialog)dataset, which contains 1,557 conversations between doctors and patients.
- Score: 25.58066103487436
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Medical dialogue systems (MDSs) aim to assist doctors and patients with a
range of professional medical services, i.e., diagnosis, consultation, and
treatment. However, one-stop MDS is still unexplored because: (1) no dataset
has so large-scale dialogues contains both multiple medical services and
fine-grained medical labels (i.e., intents, slots, values); (2) no model has
addressed a MDS based on multiple-service conversations in a unified framework.
In this work, we first build a Multiple-domain Multiple-service medical
dialogue (M^2-MedDialog)dataset, which contains 1,557 conversations between
doctors and patients, covering 276 types of diseases, 2,468 medical entities,
and 3 specialties of medical services. To the best of our knowledge, it is the
only medical dialogue dataset that includes both multiple medical services and
fine-grained medical labels. Then, we formulate a one-stop MDS as a
sequence-to-sequence generation problem. We unify a MDS with causal language
modeling and conditional causal language modeling, respectively. Specifically,
we employ several pretrained models (i.e., BERT-WWM, BERT-MED, GPT2, and MT5)
and their variants to get benchmarks on M^2-MedDialog dataset. We also propose
pseudo labeling and natural perturbation methods to expand M2-MedDialog dataset
and enhance the state-of-the-art pretrained models. We demonstrate the results
achieved by the benchmarks so far through extensive experiments on
M2-MedDialog. We release the dataset, the code, as well as the evaluation
scripts to facilitate future research in this important research direction.
Related papers
- MediTOD: An English Dialogue Dataset for Medical History Taking with Comprehensive Annotations [23.437292621092823]
We introduce MediTOD, a dataset of doctor-patient dialogues in English for the medical history-taking task.
We devise a questionnaire-based labeling scheme tailored to the medical domain.
Then, medical professionals create the dataset with high-quality comprehensive annotations.
arXiv Detail & Related papers (2024-10-18T06:38:22Z) - RuleAlign: Making Large Language Models Better Physicians with Diagnostic Rule Alignment [54.91736546490813]
We introduce the RuleAlign framework, designed to align Large Language Models with specific diagnostic rules.
We develop a medical dialogue dataset comprising rule-based communications between patients and physicians.
Experimental results demonstrate the effectiveness of the proposed approach.
arXiv Detail & Related papers (2024-08-22T17:44:40Z) - Towards Generalist Foundation Model for Radiology by Leveraging
Web-scale 2D&3D Medical Data [66.9359934608229]
This study aims to initiate the development of Radiology Foundation Model, termed as RadFM.
To the best of our knowledge, this is the first large-scale, high-quality, medical visual-language dataset, with both 2D and 3D scans.
We propose a new evaluation benchmark, RadBench, that comprises five tasks, including modality recognition, disease diagnosis, visual question answering, report generation and rationale diagnosis.
arXiv Detail & Related papers (2023-08-04T17:00:38Z) - Med-Flamingo: a Multimodal Medical Few-shot Learner [58.85676013818811]
We propose Med-Flamingo, a multimodal few-shot learner adapted to the medical domain.
Based on OpenFlamingo-9B, we continue pre-training on paired and interleaved medical image-text data from publications and textbooks.
We conduct the first human evaluation for generative medical VQA where physicians review the problems and blinded generations in an interactive app.
arXiv Detail & Related papers (2023-07-27T20:36:02Z) - Medical Dialogue Generation via Dual Flow Modeling [9.328694317877169]
Medical dialogue systems (MDS) aim to provide patients with medical services, such as diagnosis and prescription.
Previous studies mainly addressed this by extracting the mentioned medical entities as critical dialogue history information.
In this work, we argue that it is also essential to capture the transitions of the medical entities and the doctor's dialogue acts in each turn.
arXiv Detail & Related papers (2023-05-29T14:23:34Z) - CDialog: A Multi-turn Covid-19 Conversation Dataset for Entity-Aware
Dialog Generation [18.047064216849204]
We release a high-quality multi-turn Medical Dialog dataset relating to Covid-19 disease named CDialog.
We propose a novel neural medical dialog system based on the CDialog dataset to advance future research on developing automated medical dialog systems.
arXiv Detail & Related papers (2022-11-16T11:07:34Z) - "My nose is running.""Are you also coughing?": Building A Medical
Diagnosis Agent with Interpretable Inquiry Logics [80.55587329326046]
We propose a more interpretable decision process to implement the dialogue manager of DSMD.
We devise a model with highly transparent components to conduct the inference.
Experiments show that our method obtains 7.7%, 10.0%, 3.0% absolute improvement in diagnosis accuracy.
arXiv Detail & Related papers (2022-04-29T09:02:23Z) - MedDG: An Entity-Centric Medical Consultation Dataset for Entity-Aware
Medical Dialogue Generation [86.38736781043109]
We build and release a large-scale high-quality Medical Dialogue dataset related to 12 types of common Gastrointestinal diseases named MedDG.
We propose two kinds of medical dialogue tasks based on MedDG dataset. One is the next entity prediction and the other is the doctor response generation.
Experimental results show that the pre-train language models and other baselines struggle on both tasks with poor performance in our dataset.
arXiv Detail & Related papers (2020-10-15T03:34:33Z) - On the Generation of Medical Dialogues for COVID-19 [60.63485429268256]
People experiencing COVID19-related symptoms or exposed to risk factors have a pressing need to consult doctors.
Because of the shortage of medical professionals, many people cannot receive online consultations timely.
We aim to develop a medical dialogue system that can provide COVID19-related consultations.
arXiv Detail & Related papers (2020-05-11T21:23:43Z) - MedDialog: Two Large-scale Medical Dialogue Datasets [27.619211324563498]
We build two large-scale medical dialogue datasets: MedDialog-EN and MedDialog-CN.
MedDialog-EN is an English dataset containing 0.3 million conversations between patients and doctors and 0.5 million utterances.
MedDialog-CN is a Chinese dataset containing 1.1 million conversations and 4 million utterances.
arXiv Detail & Related papers (2020-04-07T13:07:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.