Semi-Supervised Variational Reasoning for Medical Dialogue Generation
- URL: http://arxiv.org/abs/2105.06071v1
- Date: Thu, 13 May 2021 04:14:35 GMT
- Title: Semi-Supervised Variational Reasoning for Medical Dialogue Generation
- Authors: Dongdong Li, Zhaochun Ren, Pengjie Ren, Zhumin Chen, Miao Fan, Jun Ma,
Maarten de Rijke
- Abstract summary: Two key characteristics are relevant for medical dialogue generation: patient states and physician actions.
We propose an end-to-end variational reasoning approach to medical dialogue generation.
A physician policy network composed of an action-classifier and two reasoning detectors is proposed for augmented reasoning ability.
- Score: 70.838542865384
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Medical dialogue generation aims to provide automatic and accurate responses
to assist physicians to obtain diagnosis and treatment suggestions in an
efficient manner. In medical dialogues two key characteristics are relevant for
response generation: patient states (such as symptoms, medication) and
physician actions (such as diagnosis, treatments). In medical scenarios
large-scale human annotations are usually not available, due to the high costs
and privacy requirements. Hence, current approaches to medical dialogue
generation typically do not explicitly account for patient states and physician
actions, and focus on implicit representation instead. We propose an end-to-end
variational reasoning approach to medical dialogue generation. To be able to
deal with a limited amount of labeled data, we introduce both patient state and
physician action as latent variables with categorical priors for explicit
patient state tracking and physician policy learning, respectively. We propose
a variational Bayesian generative approach to approximate posterior
distributions over patient states and physician actions. We use an efficient
stochastic gradient variational Bayes estimator to optimize the derived
evidence lower bound, where a 2-stage collapsed inference method is proposed to
reduce the bias during model training. A physician policy network composed of
an action-classifier and two reasoning detectors is proposed for augmented
reasoning ability. We conduct experiments on three datasets collected from
medical platforms. Our experimental results show that the proposed method
outperforms state-of-the-art baselines in terms of objective and subjective
evaluation metrics. Our experiments also indicate that our proposed
semi-supervised reasoning method achieves a comparable performance as
state-of-the-art fully supervised learning baselines for physician policy
learning.
Related papers
- Robust and Interpretable Medical Image Classifiers via Concept
Bottleneck Models [49.95603725998561]
We propose a new paradigm to build robust and interpretable medical image classifiers with natural language concepts.
Specifically, we first query clinical concepts from GPT-4, then transform latent image features into explicit concepts with a vision-language model.
arXiv Detail & Related papers (2023-10-04T21:57:09Z) - Generating medically-accurate summaries of patient-provider dialogue: A
multi-stage approach using large language models [6.252236971703546]
An effective summary is required to be coherent and accurately capture all the medically relevant information in the dialogue.
This paper tackles the problem of medical conversation summarization by discretizing the task into several smaller dialogue-understanding tasks.
arXiv Detail & Related papers (2023-05-10T08:48:53Z) - Clinically-Inspired Multi-Agent Transformers for Disease Trajectory
Forecasting from Multimodal Data [13.766035805437847]
We formulate the prognosis prediction problem as a one-to-many prediction problem.
Inspired by a clinical decision-making process with two agents, we predict prognosis with two transformer-based components.
We show the effectiveness of our approach in predicting the development of structural knee osteoarthritis changes and forecasting Alzheimer's disease clinical status directly from raw multi-modal data.
arXiv Detail & Related papers (2022-10-25T10:16:42Z) - Towards Trustworthy Automatic Diagnosis Systems by Emulating Doctors'
Reasoning with Deep Reinforcement Learning [2.314562406457073]
We propose to model the evidence acquisition and automatic diagnosis tasks using a deep reinforcement learning framework.
We show that our approach performs better than existing models while maintaining competitive pathology prediction accuracy.
arXiv Detail & Related papers (2022-10-13T17:17:17Z) - Conditional Generation Net for Medication Recommendation [73.09366442098339]
Medication recommendation targets to provide a proper set of medicines according to patients' diagnoses, which is a critical task in clinics.
We propose Conditional Generation Net (COGNet) which introduces a novel copy-or-predict mechanism to generate the set of medicines.
We validate the proposed model on the public MIMIC data set, and the experimental results show that the proposed model can outperform state-of-the-art approaches.
arXiv Detail & Related papers (2022-02-14T10:16:41Z) - Clinical Outcome Prediction from Admission Notes using Self-Supervised
Knowledge Integration [55.88616573143478]
Outcome prediction from clinical text can prevent doctors from overlooking possible risks.
Diagnoses at discharge, procedures performed, in-hospital mortality and length-of-stay prediction are four common outcome prediction targets.
We propose clinical outcome pre-training to integrate knowledge about patient outcomes from multiple public sources.
arXiv Detail & Related papers (2021-02-08T10:26:44Z) - MedDG: An Entity-Centric Medical Consultation Dataset for Entity-Aware
Medical Dialogue Generation [86.38736781043109]
We build and release a large-scale high-quality Medical Dialogue dataset related to 12 types of common Gastrointestinal diseases named MedDG.
We propose two kinds of medical dialogue tasks based on MedDG dataset. One is the next entity prediction and the other is the doctor response generation.
Experimental results show that the pre-train language models and other baselines struggle on both tasks with poor performance in our dataset.
arXiv Detail & Related papers (2020-10-15T03:34:33Z) - BiteNet: Bidirectional Temporal Encoder Network to Predict Medical
Outcomes [53.163089893876645]
We propose a novel self-attention mechanism that captures the contextual dependency and temporal relationships within a patient's healthcare journey.
An end-to-end bidirectional temporal encoder network (BiteNet) then learns representations of the patient's journeys.
We have evaluated the effectiveness of our methods on two supervised prediction and two unsupervised clustering tasks with a real-world EHR dataset.
arXiv Detail & Related papers (2020-09-24T00:42:36Z) - Dr. Summarize: Global Summarization of Medical Dialogue by Exploiting
Local Structures [4.3012765978447565]
We present a novel approach to medical conversation summarization that leverages the unique and independent local structures created when gathering a patient's medical history.
Our approach is preferred on twice the number of pointers to the baseline pointer generator model and captures most or all of the information in 80% of the conversations.
arXiv Detail & Related papers (2020-09-18T07:35:44Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.