Modeling Diagnostic Label Correlation for Automatic ICD Coding
- URL: http://arxiv.org/abs/2106.12800v1
- Date: Thu, 24 Jun 2021 07:26:30 GMT
- Title: Modeling Diagnostic Label Correlation for Automatic ICD Coding
- Authors: Shang-Chi Tsai, Chao-Wei Huang, Yun-Nung Chen
- Abstract summary: We propose a two-stage framework to improve automatic ICD coding by capturing the label correlation.
Specifically, we train a label set distribution estimator to rescore the probability of each label set candidate.
In the experiments, our proposed framework is able to improve upon best-performing predictors on the benchmark MIMIC datasets.
- Score: 37.79764232289666
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Given the clinical notes written in electronic health records (EHRs), it is
challenging to predict the diagnostic codes which is formulated as a
multi-label classification task. The large set of labels, the hierarchical
dependency, and the imbalanced data make this prediction task extremely hard.
Most existing work built a binary prediction for each label independently,
ignoring the dependencies between labels. To address this problem, we propose a
two-stage framework to improve automatic ICD coding by capturing the label
correlation. Specifically, we train a label set distribution estimator to
rescore the probability of each label set candidate generated by a base
predictor. This paper is the first attempt at learning the label set
distribution as a reranking module for medical code prediction. In the
experiments, our proposed framework is able to improve upon best-performing
predictors on the benchmark MIMIC datasets. The source code of this project is
available at https://github.com/MiuLab/ICD-Correlation.
Related papers
- Label Matching Semi-Supervised Object Detection [85.99282969977541]
Semi-supervised object detection has made significant progress with the development of mean teacher driven self-training.
Label mismatch problem is not yet fully explored in the previous works, leading to severe confirmation bias during self-training.
We propose a simple yet effective LabelMatch framework from two different yet complementary perspectives.
arXiv Detail & Related papers (2022-06-14T05:59:41Z) - Instance-Dependent Partial Label Learning [69.49681837908511]
Partial label learning is a typical weakly supervised learning problem.
Most existing approaches assume that the incorrect labels in each training example are randomly picked as the candidate labels.
In this paper, we consider instance-dependent and assume that each example is associated with a latent label distribution constituted by the real number of each label.
arXiv Detail & Related papers (2021-10-25T12:50:26Z) - Enhancing Label Correlation Feedback in Multi-Label Text Classification
via Multi-Task Learning [6.1538971100140145]
We introduce a novel approach with multi-task learning to enhance label correlation feedback.
We propose two auxiliary label co-occurrence prediction tasks to enhance label correlation learning.
arXiv Detail & Related papers (2021-06-06T12:26:14Z) - TransICD: Transformer Based Code-wise Attention Model for Explainable
ICD Coding [5.273190477622007]
International Classification of Disease (ICD) coding procedure has been shown to be effective and crucial to the billing system in medical sector.
Currently, ICD codes are assigned to a clinical note manually which is likely to cause many errors.
In this project, we apply a transformer-based architecture to capture the interdependence among the tokens of a document and then use a code-wise attention mechanism to learn code-specific representations of the entire document.
arXiv Detail & Related papers (2021-03-28T05:34:32Z) - From Extreme Multi-label to Multi-class: A Hierarchical Approach for
Automated ICD-10 Coding Using Phrase-level Attention [4.387302129801651]
Clinical coding is the task of assigning a set of alphanumeric codes, referred to as ICD (International Classification of Diseases), to a medical event based on the context captured in a clinical narrative.
We propose a novel approach for automatic ICD coding by reformulating the extreme multi-label problem into a simpler multi-class problem using a hierarchical solution.
arXiv Detail & Related papers (2021-02-18T03:19:14Z) - Uncertainty-Aware Label Refinement for Sequence Labeling [47.67853514765981]
We introduce a novel two-stage label decoding framework to model long-term label dependencies.
A base model first predicts draft labels, and then a novel two-stream self-attention model makes refinements on these draft predictions.
arXiv Detail & Related papers (2020-12-19T06:56:59Z) - A Study on the Autoregressive and non-Autoregressive Multi-label
Learning [77.11075863067131]
We propose a self-attention based variational encoder-model to extract the label-label and label-feature dependencies jointly.
Our model can therefore be used to predict all labels in parallel while still including both label-label and label-feature dependencies.
arXiv Detail & Related papers (2020-12-03T05:41:44Z) - Coping with Label Shift via Distributionally Robust Optimisation [72.80971421083937]
We propose a model that minimises an objective based on distributionally robust optimisation (DRO)
We then design and analyse a gradient descent-proximal mirror ascent algorithm tailored for large-scale problems to optimise the proposed objective.
arXiv Detail & Related papers (2020-10-23T08:33:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.