Improving the Factual Accuracy of Abstractive Clinical Text
Summarization using Multi-Objective Optimization
- URL: http://arxiv.org/abs/2204.00797v1
- Date: Sat, 2 Apr 2022 07:59:28 GMT
- Title: Improving the Factual Accuracy of Abstractive Clinical Text
Summarization using Multi-Objective Optimization
- Authors: Amanuel Alambo, Tanvi Banerjee, Krishnaprasad Thirunarayan, Mia Cajita
- Abstract summary: We propose a framework for improving the factual accuracy of abstractive summarization of clinical text using knowledge-guided multi-objective optimization.
In this study, we propose a framework for improving the factual accuracy of abstractive summarization of clinical text using knowledge-guided multi-objective optimization.
- Score: 3.977582258550673
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: While there has been recent progress in abstractive summarization as applied
to different domains including news articles, scientific articles, and blog
posts, the application of these techniques to clinical text summarization has
been limited. This is primarily due to the lack of large-scale training data
and the messy/unstructured nature of clinical notes as opposed to other domains
where massive training data come in structured or semi-structured form.
Further, one of the least explored and critical components of clinical text
summarization is factual accuracy of clinical summaries. This is specifically
crucial in the healthcare domain, cardiology in particular, where an accurate
summary generation that preserves the facts in the source notes is critical to
the well-being of a patient. In this study, we propose a framework for
improving the factual accuracy of abstractive summarization of clinical text
using knowledge-guided multi-objective optimization. We propose to jointly
optimize three cost functions in our proposed architecture during training:
generative loss, entity loss and knowledge loss and evaluate the proposed
architecture on 1) clinical notes of patients with heart failure (HF), which we
collect for this study; and 2) two benchmark datasets, Indiana University Chest
X-ray collection (IU X-Ray), and MIMIC-CXR, that are publicly available. We
experiment with three transformer encoder-decoder architectures and demonstrate
that optimizing different loss functions leads to improved performance in terms
of entity-level factual accuracy.
Related papers
- Adapting Abstract Meaning Representation Parsing to the Clinical Narrative -- the SPRING THYME parser [6.121530113520979]
This paper is dedicated to the design and evaluation of the first AMR tailored for clinical notes.
We adapted a state-of-the-art AMR utilizing continuous training.
Our approach incorporates data augmentation techniques to enhance the accuracy of AMR structure predictions.
arXiv Detail & Related papers (2024-05-15T07:32:43Z) - Attribute Structuring Improves LLM-Based Evaluation of Clinical Text
Summaries [62.32403630651586]
Large language models (LLMs) have shown the potential to generate accurate clinical text summaries, but still struggle with issues regarding grounding and evaluation.
Here, we explore a general mitigation framework using Attribute Structuring (AS), which structures the summary evaluation process.
AS consistently improves the correspondence between human annotations and automated metrics in clinical text summarization.
arXiv Detail & Related papers (2024-03-01T21:59:03Z) - XAI for In-hospital Mortality Prediction via Multimodal ICU Data [57.73357047856416]
We propose an efficient, explainable AI solution for predicting in-hospital mortality via multimodal ICU data.
We employ multimodal learning in our framework, which can receive heterogeneous inputs from clinical data and make decisions.
Our framework can be easily transferred to other clinical tasks, which facilitates the discovery of crucial factors in healthcare research.
arXiv Detail & Related papers (2023-12-29T14:28:04Z) - Self-Verification Improves Few-Shot Clinical Information Extraction [73.6905567014859]
Large language models (LLMs) have shown the potential to accelerate clinical curation via few-shot in-context learning.
They still struggle with issues regarding accuracy and interpretability, especially in mission-critical domains such as health.
Here, we explore a general mitigation framework using self-verification, which leverages the LLM to provide provenance for its own extraction and check its own outputs.
arXiv Detail & Related papers (2023-05-30T22:05:11Z) - Extrinsic Factors Affecting the Accuracy of Biomedical NER [0.1529342790344802]
Biomedical named entity recognition (NER) is a critial task that aims to identify structured information in clinical text.
NER in the biomedical domain is challenging due to limited data availability.
arXiv Detail & Related papers (2023-05-29T15:29:49Z) - Development and validation of a natural language processing algorithm to
pseudonymize documents in the context of a clinical data warehouse [53.797797404164946]
The study highlights the difficulties faced in sharing tools and resources in this domain.
We annotated a corpus of clinical documents according to 12 types of identifying entities.
We build a hybrid system, merging the results of a deep learning model as well as manual rules.
arXiv Detail & Related papers (2023-03-23T17:17:46Z) - A Multimodal Transformer: Fusing Clinical Notes with Structured EHR Data
for Interpretable In-Hospital Mortality Prediction [8.625186194860696]
We provide a novel multimodal transformer to fuse clinical notes and structured EHR data for better prediction of in-hospital mortality.
To improve interpretability, we propose an integrated gradients (IG) method to select important words in clinical notes.
We also investigate the significance of domain adaptive pretraining and task adaptive fine-tuning on the Clinical BERT.
arXiv Detail & Related papers (2022-08-09T03:49:52Z) - Self-supervised Answer Retrieval on Clinical Notes [68.87777592015402]
We introduce CAPR, a rule-based self-supervision objective for training Transformer language models for domain-specific passage matching.
We apply our objective in four Transformer-based architectures: Contextual Document Vectors, Bi-, Poly- and Cross-encoders.
We report that CAPR outperforms strong baselines in the retrieval of domain-specific passages and effectively generalizes across rule-based and human-labeled passages.
arXiv Detail & Related papers (2021-08-02T10:42:52Z) - A Meta-embedding-based Ensemble Approach for ICD Coding Prediction [64.42386426730695]
International Classification of Diseases (ICD) are the de facto codes used globally for clinical coding.
These codes enable healthcare providers to claim reimbursement and facilitate efficient storage and retrieval of diagnostic information.
Our proposed approach enhances the performance of neural models by effectively training word vectors using routine medical data as well as external knowledge from scientific articles.
arXiv Detail & Related papers (2021-02-26T17:49:58Z) - Drug and Disease Interpretation Learning with Biomedical Entity
Representation Transformer [9.152161078854146]
Concept normalization in free-form texts is a crucial step in every text-mining pipeline.
We propose a simple and effective two-stage neural approach based on fine-tuned BERT architectures.
arXiv Detail & Related papers (2021-01-22T20:01:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.