Machine Learning Based on Natural Language Processing to Detect Cardiac
Failure in Clinical Narratives
- URL: http://arxiv.org/abs/2104.03934v1
- Date: Thu, 8 Apr 2021 17:28:43 GMT
- Title: Machine Learning Based on Natural Language Processing to Detect Cardiac
Failure in Clinical Narratives
- Authors: Thanh-Dung Le, Rita Noumeir, Jerome Rambaud, Guillaume Sans, and
Philippe Jouvet
- Abstract summary: The purpose of the study is to develop a machine learning algorithm that automatically detects whether a patient has a cardiac failure or a healthy condition.
A word representation learning technique was employed by using bag-of-word (BoW), term frequency inverse document frequency (TFIDF), and neural word embeddings (word2vec)
The proposed framework yielded an overall classification performance with acc, pre, rec, and f1 of 84% and 82%, 85%, and 83%, respectively.
- Score: 0.2936007114555107
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The purpose of the study presented herein is to develop a machine learning
algorithm based on natural language processing that automatically detects
whether a patient has a cardiac failure or a healthy condition by using
physician notes in Research Data Warehouse at CHU Sainte Justine Hospital.
First, a word representation learning technique was employed by using
bag-of-word (BoW), term frequency inverse document frequency (TFIDF), and
neural word embeddings (word2vec). Each representation technique aims to retain
the words semantic and syntactic analysis in critical care data. It helps to
enrich the mutual information for the word representation and leads to an
advantage for further appropriate analysis steps. Second, a machine learning
classifier was used to detect the patients condition for either cardiac failure
or stable patient through the created word representation vector space from the
previous step. This machine learning approach is based on a supervised binary
classification algorithm, including logistic regression (LR), Gaussian
Naive-Bayes (GaussianNB), and multilayer perceptron neural network (MLPNN).
Technically, it mainly optimizes the empirical loss during training the
classifiers. As a result, an automatic learning algorithm would be accomplished
to draw a high classification performance, including accuracy (acc), precision
(pre), recall (rec), and F1 score (f1). The results show that the combination
of TFIDF and MLPNN always outperformed other combinations with all overall
performance. In the case without any feature selection, the proposed framework
yielded an overall classification performance with acc, pre, rec, and f1 of 84%
and 82%, 85%, and 83%, respectively. Significantly, if the feature selection
was well applied, the overall performance would finally improve up to 4% for
each evaluation.
Related papers
- Enhancing Electrocardiogram Signal Analysis Using NLP-Inspired Techniques: A Novel Approach with Embedding and Self-Attention [2.7651063843287718]
We propose a novel ECG analysis technique, based on embedding and self attention, to capture the spatial as well as the temporal dependencies of the ECG data.
An accuracy of 91% was achieved with a good F1-score for all the disease classes.
arXiv Detail & Related papers (2024-07-15T12:20:15Z) - Comparison of Machine Learning Classification Algorithms and Application
to the Framingham Heart Study [0.0]
The use of machine learning algorithms in healthcare can amplify social injustices and health inequities.
This research pertains to some generalizability impediments that occur during the development and the post-deployment of machine learning classification algorithms.
arXiv Detail & Related papers (2024-02-22T22:49:35Z) - Automatically measuring speech fluency in people with aphasia: first
achievements using read-speech data [55.84746218227712]
This study aims at assessing the relevance of a signalprocessingalgorithm, initially developed in the field of language acquisition, for the automatic measurement of speech fluency.
arXiv Detail & Related papers (2023-08-09T07:51:40Z) - Detecting automatically the layout of clinical documents to enhance the
performances of downstream natural language processing [53.797797404164946]
We designed an algorithm to process clinical PDF documents and extract only clinically relevant text.
The algorithm consists of several steps: initial text extraction using a PDF, followed by classification into such categories as body text, left notes, and footers.
Medical performance was evaluated by examining the extraction of medical concepts of interest from the text in their respective sections.
arXiv Detail & Related papers (2023-05-23T08:38:33Z) - Classification and Self-Supervised Regression of Arrhythmic ECG Signals
Using Convolutional Neural Networks [13.025714736073489]
We propose a deep neural network model capable of solving regression and classification tasks.
We tested the model on the MIT-BIH Arrhythmia database.
arXiv Detail & Related papers (2022-10-25T18:11:13Z) - Adaptation of Autoencoder for Sparsity Reduction From Clinical Notes
Representation Learning [0.19573380763700707]
We propose an autoencoder learning algorithm to take advantage of sparsity reduction in clinical note representation.
The motivation was to determine how to compress sparse, high-dimensional data by reducing the dimension of the clinical note representation feature space.
The proposed approach provided overall performance gains of up to 3% for each evaluation.
arXiv Detail & Related papers (2022-09-26T16:37:37Z) - Deep Feature Learning for Medical Acoustics [78.56998585396421]
The purpose of this paper is to compare different learnables in medical acoustics tasks.
A framework has been implemented to classify human respiratory sounds and heartbeats in two categories, i.e. healthy or affected by pathologies.
arXiv Detail & Related papers (2022-08-05T10:39:37Z) - Towards Diverse Evaluation of Class Incremental Learning: A Representation Learning Perspective [67.45111837188685]
Class incremental learning (CIL) algorithms aim to continually learn new object classes from incrementally arriving data.
We experimentally analyze neural network models trained by CIL algorithms using various evaluation protocols in representation learning.
arXiv Detail & Related papers (2022-06-16T11:44:11Z) - Do We Really Need a Learnable Classifier at the End of Deep Neural
Network? [118.18554882199676]
We study the potential of learning a neural network for classification with the classifier randomly as an ETF and fixed during training.
Our experimental results show that our method is able to achieve similar performances on image classification for balanced datasets.
arXiv Detail & Related papers (2022-03-17T04:34:28Z) - Detecting of a Patient's Condition From Clinical Narratives Using
Natural Language Representation [0.3149883354098941]
This paper proposes a joint clinical natural language representation learning and supervised classification framework.
The novel framework jointly discovers distributional syntactic and latent semantic (representation learning) from contextual clinical narrative inputs.
The proposed framework yields an overall classification performance with accuracy, recall, and precision of 89 % and 88 %, 89 %, respectively.
arXiv Detail & Related papers (2021-04-08T17:16:04Z) - Network Classifiers Based on Social Learning [71.86764107527812]
We propose a new way of combining independently trained classifiers over space and time.
The proposed architecture is able to improve prediction performance over time with unlabeled data.
We show that this strategy results in consistent learning with high probability, and it yields a robust structure against poorly trained classifiers.
arXiv Detail & Related papers (2020-10-23T11:18:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.