Mythological Medical Machine Learning: Boosting the Performance of a
Deep Learning Medical Data Classifier Using Realistic Physiological Models
- URL: http://arxiv.org/abs/2112.15442v1
- Date: Tue, 28 Dec 2021 17:55:37 GMT
- Title: Mythological Medical Machine Learning: Boosting the Performance of a
Deep Learning Medical Data Classifier Using Realistic Physiological Models
- Authors: Ismail Sadiq (1), Erick A. Perez-Alday (2), Amit J. Shah (2), Ali
Bahrami Rad (2), Reza Sameni (2), Gari D. Clifford (1,2)
- Abstract summary: We generate 180,000 artificial ECGs with or without significant T-wave Alternans (TWA) as a result of PTSD.
A deep neural network (DNN) trained on over 70,000 patients to classify 25 different rhythms was modified the output layer to a binary class.
The best performing approach was found by performing both transfer learning steps, using the pre-trained arrhythmia DNN, the artificial data and the real PTSD-related ECG data.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Objective: To determine if a realistic, but computationally efficient model
of the electrocardiogram can be used to pre-train a deep neural network (DNN)
with a wide range of morphologies and abnormalities specific to a given
condition - T-wave Alternans (TWA) as a result of Post-Traumatic Stress
Disorder, or PTSD - and significantly boost performance on a small database of
rare individuals.
Approach: Using a previously validated artificial ECG model, we generated
180,000 artificial ECGs with or without significant TWA, with varying heart
rate, breathing rate, TWA amplitude, and ECG morphology. A DNN, trained on over
70,000 patients to classify 25 different rhythms, was modified the output layer
to a binary class (TWA or no-TWA, or equivalently, PTSD or no-PTSD), and
transfer learning was performed on the artificial ECG. In a final transfer
learning step, the DNN was trained and cross-validated on ECG from 12 PTSD and
24 controls for all combinations of using the three databases.
Main results: The best performing approach (AUROC = 0.77, Accuracy = 0.72,
F1-score = 0.64) was found by performing both transfer learning steps, using
the pre-trained arrhythmia DNN, the artificial data and the real PTSD-related
ECG data. Removing the artificial data from training led to the largest drop in
performance. Removing the arrhythmia data from training provided a modest, but
significant, drop in performance. The final model showed no significant drop in
performance on the artificial data, indicating no overfitting.
Significance: In healthcare, it is common to only have a small collection of
high-quality data and labels, or a larger database with much lower quality (and
less relevant) labels. The paradigm presented here, involving model-based
performance boosting, provides a solution through transfer learning on a large
realistic artificial database, and a partially relevant real database.
Related papers
- The effect of data augmentation and 3D-CNN depth on Alzheimer's Disease
detection [51.697248252191265]
This work summarizes and strictly observes best practices regarding data handling, experimental design, and model evaluation.
We focus on Alzheimer's Disease (AD) detection, which serves as a paradigmatic example of challenging problem in healthcare.
Within this framework, we train predictive 15 models, considering three different data augmentation strategies and five distinct 3D CNN architectures.
arXiv Detail & Related papers (2023-09-13T10:40:41Z) - Textual Data Augmentation for Patient Outcomes Prediction [67.72545656557858]
We propose a novel data augmentation method to generate artificial clinical notes in patients' Electronic Health Records.
We fine-tune the generative language model GPT-2 to synthesize labeled text with the original training data.
We evaluate our method on the most common patient outcome, i.e., the 30-day readmission rate.
arXiv Detail & Related papers (2022-11-13T01:07:23Z) - Classification and Self-Supervised Regression of Arrhythmic ECG Signals
Using Convolutional Neural Networks [13.025714736073489]
We propose a deep neural network model capable of solving regression and classification tasks.
We tested the model on the MIT-BIH Arrhythmia database.
arXiv Detail & Related papers (2022-10-25T18:11:13Z) - Synthetic ECG Signal Generation Using Generative Neural Networks [7.122393663641668]
We studied the synthetic ECG generation capability of 5 different models from the generative adversarial network (GAN) family.
The results show that all the tested models can to an extent successfully mass-generate acceptable heartbeats with high similarity in morphological features.
arXiv Detail & Related papers (2021-12-05T20:28:55Z) - Generalizing electrocardiogram delineation: training convolutional
neural networks with synthetic data augmentation [63.51064808536065]
Existing databases for ECG delineation are small, being insufficient in size and in the array of pathological conditions they represent.
This article delves has two main contributions. First, a pseudo-synthetic data generation algorithm was developed, based in probabilistically composing ECG traces given "pools" of fundamental segments, as cropped from the original databases, and a set of rules for their arrangement into coherent synthetic traces.
Second, two novel segmentation-based loss functions have been developed, which attempt at enforcing the prediction of an exact number of independent structures and at producing closer segmentation boundaries by focusing on a reduced number of samples.
arXiv Detail & Related papers (2021-11-25T10:11:41Z) - Improving the efficacy of Deep Learning models for Heart Beat detection
on heterogeneous datasets [0.0]
We investigate the issues related to applying a Deep Learning model on heterogeneous datasets.
We show that the performance of a model trained on data from healthy subjects decreases when applied to patients with cardiac conditions.
We then evaluate the use of Transfer Learning to adapt the model to the different datasets.
arXiv Detail & Related papers (2021-10-26T14:26:55Z) - SOUL: An Energy-Efficient Unsupervised Online Learning Seizure Detection
Classifier [68.8204255655161]
Implantable devices that record neural activity and detect seizures have been adopted to issue warnings or trigger neurostimulation to suppress seizures.
For an implantable seizure detection system, a low power, at-the-edge, online learning algorithm can be employed to dynamically adapt to neural signal drifts.
SOUL was fabricated in TSMC's 28 nm process occupying 0.1 mm2 and achieves 1.5 nJ/classification energy efficiency, which is at least 24x more efficient than state-of-the-art.
arXiv Detail & Related papers (2021-10-01T23:01:20Z) - Real-Time Patient-Specific ECG Classification by 1D Self-Operational
Neural Networks [24.226952040270564]
We propose 1D Self-organized Operational Neural Networks (1D Self-ONNs) for ECG classification.
1D Self-ONNs have the utmost advantage and superiority over conventional ONNs where the prior operator search within the operator set library is entirely avoided.
Our results over the MIT-BIH arrhythmia benchmark database demonstrate that 1D Self-ONNs can surpass 1D CNNs with a significant margin.
arXiv Detail & Related papers (2021-09-30T19:37:36Z) - Bootstrapping Your Own Positive Sample: Contrastive Learning With
Electronic Health Record Data [62.29031007761901]
This paper proposes a novel contrastive regularized clinical classification model.
We introduce two unique positive sampling strategies specifically tailored for EHR data.
Our framework yields highly competitive experimental results in predicting the mortality risk on real-world COVID-19 EHR data.
arXiv Detail & Related papers (2021-04-07T06:02:04Z) - ECG-DelNet: Delineation of Ambulatory Electrocardiograms with Mixed
Quality Labeling Using Neural Networks [69.25956542388653]
Deep learning (DL) algorithms are gaining weight in academic and industrial settings.
We demonstrate DL can be successfully applied to low interpretative tasks by embedding ECG detection and delineation onto a segmentation framework.
The model was trained using PhysioNet's QT database, comprised of 105 ambulatory ECG recordings.
arXiv Detail & Related papers (2020-05-11T16:29:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.