Detecting of a Patient's Condition From Clinical Narratives Using
Natural Language Representation
- URL: http://arxiv.org/abs/2104.03969v1
- Date: Thu, 8 Apr 2021 17:16:04 GMT
- Title: Detecting of a Patient's Condition From Clinical Narratives Using
Natural Language Representation
- Authors: Thanh-Dung Le, Jerome Rambaud, Guillaume Sans, Philippe Jouvet and
Rita Noumeir
- Abstract summary: This paper proposes a joint clinical natural language representation learning and supervised classification framework.
The novel framework jointly discovers distributional syntactic and latent semantic (representation learning) from contextual clinical narrative inputs.
The proposed framework yields an overall classification performance with accuracy, recall, and precision of 89 % and 88 %, 89 %, respectively.
- Score: 0.3149883354098941
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: This paper proposes a joint clinical natural language representation learning
and supervised classification framework based on machine learning for detecting
concept labels in clinical narratives at CHU Sainte Justine Hospital (CHUSJ).
The novel framework jointly discovers distributional syntactic and latent
semantic (representation learning) from contextual clinical narrative inputs
and, then, learns the knowledge representation for labeling in the contextual
output (supervised classification). First, for having an effective
representation learning approach with a small data set, mixing of numeric
values and texts. Four different methods are applied to capture the numerical
vital sign values. Then, different representation learning approaches are using
to discover the rich structure from clinical narrative data. Second, for an
automatic encounter with disease prediction, in this case, cardiac failure. The
binary classifiers are iteratively trained to learn the knowledge
representation of processed data in the preceding steps. The multilayer
perceptron neural network outperforms other discriminative and generative
classifiers. Consequently, the proposed framework yields an overall
classification performance with accuracy, recall, and precision of 89 % and 88
%, 89 %, respectively. Furthermore, a generative autoencoder (AE) learning
algorithm is then proposed to leverage the sparsity reduction. Affirmatively,
AE algorithm is overperforming other sparsity reduction techniques. And, the
classifier performances can successfully achieve up to 91 %, 91%, and 91%,
respectively, for accuracy, recall, and precision.
Related papers
- Local-to-Global Self-Supervised Representation Learning for Diabetic Retinopathy Grading [0.0]
This research aims to present a novel hybrid learning model using self-supervised learning and knowledge distillation.
In our algorithm, for the first time among all self-supervised learning and knowledge distillation models, the test dataset is 50% larger than the training dataset.
Compared to a similar state-of-the-art model, our results achieved higher accuracy and more effective representation spaces.
arXiv Detail & Related papers (2024-10-01T15:19:16Z) - Infusing clinical knowledge into tokenisers for language models [1.9921590146992474]
This study introduces a novel knowledge enhanced tokenisation mechanism, K-Tokeniser, for clinical text processing.
At initialisation stage, K-Tokeniser populates global representations of tokens based on semantic types of domain concepts.
To avoid pretraining using the new tokeniser, an embedding initialisation approach is proposed to generate representations for new tokens.
arXiv Detail & Related papers (2024-06-20T13:43:03Z) - Comparison of Machine Learning Classification Algorithms and Application
to the Framingham Heart Study [0.0]
The use of machine learning algorithms in healthcare can amplify social injustices and health inequities.
This research pertains to some generalizability impediments that occur during the development and the post-deployment of machine learning classification algorithms.
arXiv Detail & Related papers (2024-02-22T22:49:35Z) - Multi-task Explainable Skin Lesion Classification [54.76511683427566]
We propose a few-shot-based approach for skin lesions that generalizes well with few labelled data.
The proposed approach comprises a fusion of a segmentation network that acts as an attention module and classification network.
arXiv Detail & Related papers (2023-10-11T05:49:47Z) - XAL: EXplainable Active Learning Makes Classifiers Better Low-resource Learners [71.8257151788923]
We propose a novel Explainable Active Learning framework (XAL) for low-resource text classification.
XAL encourages classifiers to justify their inferences and delve into unlabeled data for which they cannot provide reasonable explanations.
Experiments on six datasets show that XAL achieves consistent improvement over 9 strong baselines.
arXiv Detail & Related papers (2023-10-09T08:07:04Z) - Detecting automatically the layout of clinical documents to enhance the
performances of downstream natural language processing [53.797797404164946]
We designed an algorithm to process clinical PDF documents and extract only clinically relevant text.
The algorithm consists of several steps: initial text extraction using a PDF, followed by classification into such categories as body text, left notes, and footers.
Medical performance was evaluated by examining the extraction of medical concepts of interest from the text in their respective sections.
arXiv Detail & Related papers (2023-05-23T08:38:33Z) - Adaptation of Autoencoder for Sparsity Reduction From Clinical Notes
Representation Learning [0.19573380763700707]
We propose an autoencoder learning algorithm to take advantage of sparsity reduction in clinical note representation.
The motivation was to determine how to compress sparse, high-dimensional data by reducing the dimension of the clinical note representation feature space.
The proposed approach provided overall performance gains of up to 3% for each evaluation.
arXiv Detail & Related papers (2022-09-26T16:37:37Z) - LifeLonger: A Benchmark for Continual Disease Classification [59.13735398630546]
We introduce LifeLonger, a benchmark for continual disease classification on the MedMNIST collection.
Task and class incremental learning of diseases address the issue of classifying new samples without re-training the models from scratch.
Cross-domain incremental learning addresses the issue of dealing with datasets originating from different institutions while retaining the previously obtained knowledge.
arXiv Detail & Related papers (2022-04-12T12:25:05Z) - SCARF: Self-Supervised Contrastive Learning using Random Feature
Corruption [72.35532598131176]
We propose SCARF, a technique for contrastive learning, where views are formed by corrupting a random subset of features.
We show that SCARF complements existing strategies and outperforms alternatives like autoencoders.
arXiv Detail & Related papers (2021-06-29T08:08:33Z) - Machine Learning Based on Natural Language Processing to Detect Cardiac
Failure in Clinical Narratives [0.2936007114555107]
The purpose of the study is to develop a machine learning algorithm that automatically detects whether a patient has a cardiac failure or a healthy condition.
A word representation learning technique was employed by using bag-of-word (BoW), term frequency inverse document frequency (TFIDF), and neural word embeddings (word2vec)
The proposed framework yielded an overall classification performance with acc, pre, rec, and f1 of 84% and 82%, 85%, and 83%, respectively.
arXiv Detail & Related papers (2021-04-08T17:28:43Z) - Semi-supervised Medical Image Classification with Relation-driven
Self-ensembling Model [71.80319052891817]
We present a relation-driven semi-supervised framework for medical image classification.
It exploits the unlabeled data by encouraging the prediction consistency of given input under perturbations.
Our method outperforms many state-of-the-art semi-supervised learning methods on both single-label and multi-label image classification scenarios.
arXiv Detail & Related papers (2020-05-15T06:57:54Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.