Automated Clinical Coding for Outpatient Departments
- URL: http://arxiv.org/abs/2312.13533v2
- Date: Sun, 24 Dec 2023 08:16:46 GMT
- Title: Automated Clinical Coding for Outpatient Departments
- Authors: Viktor Schlegel, Abhinav Ramesh Kashyap, Thanh-Tung Nguyen, Tsung-Han
Yang, Vijay Prakash Dwivedi, Wei-Hsian Yin, Jeng Wei, Stefan Winkler
- Abstract summary: This paper is the first to investigate how well state-of-the-art deep learning-based clinical coding approaches work in the outpatient setting at hospital scale.
We collect a large outpatient dataset comprising over 7 million notes documenting over half a million patients.
We adapt four state-of-the-art clinical coding approaches to this setting and evaluate their potential to assist coders.
- Score: 14.923343535929515
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Computerised clinical coding approaches aim to automate the process of
assigning a set of codes to medical records. While there is active research
pushing the state of the art on clinical coding for hospitalized patients, the
outpatient setting -- where doctors tend to non-hospitalised patients -- is
overlooked. Although both settings can be formalised as a multi-label
classification task, they present unique and distinct challenges, which raises
the question of whether the success of inpatient clinical coding approaches
translates to the outpatient setting. This paper is the first to investigate
how well state-of-the-art deep learning-based clinical coding approaches work
in the outpatient setting at hospital scale. To this end, we collect a large
outpatient dataset comprising over 7 million notes documenting over half a
million patients. We adapt four state-of-the-art clinical coding approaches to
this setting and evaluate their potential to assist coders. We find evidence
that clinical coding in outpatient settings can benefit from more innovations
in popular inpatient coding benchmarks. A deeper analysis of the factors
contributing to the success -- amount and form of data and choice of document
representation -- reveals the presence of easy-to-solve examples, the coding of
which can be completely automated with a low error rate.
Related papers
- Aligning AI Research with the Needs of Clinical Coding Workflows: Eight Recommendations Based on US Data Analysis and Critical Review [14.381199039813675]
This position paper aims to align AI coding research more closely with practical challenges of clinical coding.
Based on our analysis, we offer eight specific recommendations, suggesting ways to improve current evaluation methods.
arXiv Detail & Related papers (2024-12-23T23:39:05Z) - Large Language Models in the Clinic: A Comprehensive Benchmark [63.21278434331952]
We build a benchmark ClinicBench to better understand large language models (LLMs) in the clinic.
We first collect eleven existing datasets covering diverse clinical language generation, understanding, and reasoning tasks.
We then construct six novel datasets and clinical tasks that are complex but common in real-world practice.
We conduct an extensive evaluation of twenty-two LLMs under both zero-shot and few-shot settings.
arXiv Detail & Related papers (2024-04-25T15:51:06Z) - Exploring the Consistency, Quality and Challenges in Manual and
Automated Coding of Free-text Diagnoses from Hospital Outpatient Letters [4.743464936070594]
This work evaluates the quality and consistency of both manual and automated clinical coding of diagnoses from hospital outpatient letters.
A gold standard was constructed by a panel of clinicians from a subset of the annotated diagnoses.
Results indicate that humans slightly out-performed automated coding, while both performed notably better when there was only a single diagnosis contained in the free-text description.
arXiv Detail & Related papers (2023-11-17T20:32:24Z) - Medical Codes Prediction from Clinical Notes: From Human Coders to
Machines [0.21320960069210473]
Prediction of medical codes from clinical notes is a practical and essential need for every healthcare delivery organization.
The biggest challenge is directly identifying appropriate medical codes from several thousands of high-dimensional codes from unstructured free-text clinical notes.
Recent studies have shown the state-of-the-art code prediction results of full-fledged deep learning-based methods.
arXiv Detail & Related papers (2022-10-30T14:24:13Z) - Automated Clinical Coding: What, Why, and Where We Are? [17.086212195006894]
Clinical coding could potentially be supported by an automated system to improve the efficiency and accuracy of the process.
Our research reveals the gaps between the current deep learning-based approach applied to clinical coding and the need for explainability and consistency in real-world practice.
There is much to achieve to develop and deploy an AI-based automated system to support coding in the next five years and beyond.
arXiv Detail & Related papers (2022-03-21T16:17:38Z) - Clinical Outcome Prediction from Admission Notes using Self-Supervised
Knowledge Integration [55.88616573143478]
Outcome prediction from clinical text can prevent doctors from overlooking possible risks.
Diagnoses at discharge, procedures performed, in-hospital mortality and length-of-stay prediction are four common outcome prediction targets.
We propose clinical outcome pre-training to integrate knowledge about patient outcomes from multiple public sources.
arXiv Detail & Related papers (2021-02-08T10:26:44Z) - Inheritance-guided Hierarchical Assignment for Clinical Automatic
Diagnosis [50.15205065710629]
Clinical diagnosis, which aims to assign diagnosis codes for a patient based on the clinical note, plays an essential role in clinical decision-making.
We propose a novel framework to combine the inheritance-guided hierarchical assignment and co-occurrence graph propagation for clinical automatic diagnosis.
arXiv Detail & Related papers (2021-01-27T13:16:51Z) - Ensemble model for pre-discharge icd10 coding prediction [45.82374977939355]
We propose an ensemble model incorporating multiple clinical data sources for accurate code predictions.
We obtain multi-label classification accuracies of 0.73 and 0.58 for average precision, 0.56 and 0.35 for F1-scores and 0.71 and 0.4 accuracy in predicting principal diagnosis for inpatient and outpatient datasets respectively.
arXiv Detail & Related papers (2020-12-16T07:02:56Z) - Benchmarking Automated Clinical Language Simplification: Dataset,
Algorithm, and Evaluation [48.87254340298189]
We construct a new dataset named MedLane to support the development and evaluation of automated clinical language simplification approaches.
We propose a new model called DECLARE that follows the human annotation procedure and achieves state-of-the-art performance.
arXiv Detail & Related papers (2020-12-04T06:09:02Z) - BiteNet: Bidirectional Temporal Encoder Network to Predict Medical
Outcomes [53.163089893876645]
We propose a novel self-attention mechanism that captures the contextual dependency and temporal relationships within a patient's healthcare journey.
An end-to-end bidirectional temporal encoder network (BiteNet) then learns representations of the patient's journeys.
We have evaluated the effectiveness of our methods on two supervised prediction and two unsupervised clustering tasks with a real-world EHR dataset.
arXiv Detail & Related papers (2020-09-24T00:42:36Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.