"Nothing Abnormal": Disambiguating Medical Reports via Contrastive
Knowledge Infusion
- URL: http://arxiv.org/abs/2305.08300v1
- Date: Mon, 15 May 2023 02:01:20 GMT
- Title: "Nothing Abnormal": Disambiguating Medical Reports via Contrastive
Knowledge Infusion
- Authors: Zexue He, An Yan, Amilcare Gentili, Julian McAuley, Chun-Nan Hsu
- Abstract summary: We propose a rewriting algorithm based on contrastive pretraining and perturbation-based rewriting.
We create two datasets, OpenI-Annotated based on chest reports and VA-Annotated based on general medical reports.
Our proposed algorithm effectively rewrites input sentences in a less ambiguous way with high content fidelity.
- Score: 6.9551174393701345
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Sharing medical reports is essential for patient-centered care. A recent line
of work has focused on automatically generating reports with NLP methods.
However, different audiences have different purposes when writing/reading
medical reports -- for example, healthcare professionals care more about
pathology, whereas patients are more concerned with the diagnosis ("Is there
any abnormality?"). The expectation gap results in a common situation where
patients find their medical reports to be ambiguous and therefore unsure about
the next steps. In this work, we explore the audience expectation gap in
healthcare and summarize common ambiguities that lead patients to be confused
about their diagnosis into three categories: medical jargon, contradictory
findings, and misleading grammatical errors. Based on our analysis, we define a
disambiguation rewriting task to regenerate an input to be unambiguous while
preserving information about the original content. We further propose a
rewriting algorithm based on contrastive pretraining and perturbation-based
rewriting. In addition, we create two datasets, OpenI-Annotated based on chest
reports and VA-Annotated based on general medical reports, with available
binary labels for ambiguity and abnormality presence annotated by radiology
specialists. Experimental results on these datasets show that our proposed
algorithm effectively rewrites input sentences in a less ambiguous way with
high content fidelity. Our code and annotated data are released to facilitate
future research.
Related papers
- Contrastive Learning with Counterfactual Explanations for Radiology Report Generation [83.30609465252441]
We propose a textbfCountertextbfFactual textbfExplanations-based framework (CoFE) for radiology report generation.
Counterfactual explanations serve as a potent tool for understanding how decisions made by algorithms can be changed by asking what if'' scenarios.
Experiments on two benchmarks demonstrate that leveraging the counterfactual explanations enables CoFE to generate semantically coherent and factually complete reports.
arXiv Detail & Related papers (2024-07-19T17:24:25Z) - Predicting Unplanned Readmissions in the Intensive Care Unit: A
Multimodality Evaluation [2.2559617939136505]
A hospital readmission is when a patient who was discharged from the hospital is admitted again for the same or related care within a certain period.
We use state-of-the-art machine learning approaches in time-series analysis and natural language processing to predict Unplanned Readmissions in ICUs.
arXiv Detail & Related papers (2023-05-14T12:20:13Z) - Weakly Supervised Contrastive Learning for Chest X-Ray Report Generation [3.3978173451092437]
Radiology report generation aims at generating descriptive text from radiology images automatically.
A typical setting consists of training encoder-decoder models on image-report pairs with a cross entropy loss.
We propose a novel weakly supervised contrastive loss for medical report generation.
arXiv Detail & Related papers (2021-09-25T00:06:23Z) - MIMO: Mutual Integration of Patient Journey and Medical Ontology for
Healthcare Representation Learning [49.57261599776167]
We propose an end-to-end robust Transformer-based solution, Mutual Integration of patient journey and Medical Ontology (MIMO) for healthcare representation learning and predictive analytics.
arXiv Detail & Related papers (2021-07-20T07:04:52Z) - Semi-Supervised Variational Reasoning for Medical Dialogue Generation [70.838542865384]
Two key characteristics are relevant for medical dialogue generation: patient states and physician actions.
We propose an end-to-end variational reasoning approach to medical dialogue generation.
A physician policy network composed of an action-classifier and two reasoning detectors is proposed for augmented reasoning ability.
arXiv Detail & Related papers (2021-05-13T04:14:35Z) - Interpretable bias mitigation for textual data: Reducing gender bias in
patient notes while maintaining classification performance [0.11545092788508224]
We identify and remove gendered language from two clinical-note datasets.
We show minimal degradation in health condition classification tasks for low- to medium-levels of bias removal via data augmentation.
This work outlines an interpretable approach for using data augmentation to identify and reduce the potential for bias in natural language processing pipelines.
arXiv Detail & Related papers (2021-03-10T03:09:30Z) - Unifying Relational Sentence Generation and Retrieval for Medical Image
Report Composition [142.42920413017163]
Current methods often generate the most common sentences due to dataset bias for individual case.
We propose a novel framework that unifies template retrieval and sentence generation to handle both common and rare abnormality.
arXiv Detail & Related papers (2021-01-09T04:33:27Z) - BiteNet: Bidirectional Temporal Encoder Network to Predict Medical
Outcomes [53.163089893876645]
We propose a novel self-attention mechanism that captures the contextual dependency and temporal relationships within a patient's healthcare journey.
An end-to-end bidirectional temporal encoder network (BiteNet) then learns representations of the patient's journeys.
We have evaluated the effectiveness of our methods on two supervised prediction and two unsupervised clustering tasks with a real-world EHR dataset.
arXiv Detail & Related papers (2020-09-24T00:42:36Z) - Extracting Structured Data from Physician-Patient Conversations By
Predicting Noteworthy Utterances [39.888619005843246]
We describe a new dataset consisting of conversation transcripts, post-visit summaries, corresponding supporting evidence (in the transcript), and structured labels.
One methodological challenge is that the conversations are long (around 1500 words) making it difficult for modern deep-learning models to use them as input.
We find that by first filtering for (predicted) noteworthy utterances, we can significantly boost predictive performance for recognizing both diagnoses and RoS abnormalities.
arXiv Detail & Related papers (2020-07-14T16:10:37Z) - Auxiliary Signal-Guided Knowledge Encoder-Decoder for Medical Report
Generation [107.3538598876467]
We propose an Auxiliary Signal-Guided Knowledge-Decoder (ASGK) to mimic radiologists' working patterns.
ASGK integrates internal visual feature fusion and external medical linguistic information to guide medical knowledge transfer and learning.
arXiv Detail & Related papers (2020-06-06T01:00:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.