Electrocardiogram-based diagnosis of liver diseases: an externally validated and explainable machine learning approach
- URL: http://arxiv.org/abs/2412.03717v2
- Date: Tue, 20 May 2025 08:35:51 GMT
- Title: Electrocardiogram-based diagnosis of liver diseases: an externally validated and explainable machine learning approach
- Authors: Juan Miguel Lopez Alcaraz, Wilhelm Haverkamp, Nils Strodthoff,
- Abstract summary: Liver diseases present a significant global health challenge.<n>ECG can enable the detection of liver disease by capturing cardiovascular-hepatic interactions.<n>We trained tree-based machine learning models on ECG features to detect liver diseases.
- Score: 0.9503773054285559
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Background: Liver diseases present a significant global health challenge and often require costly, invasive diagnostics. Electrocardiography (ECG), a widely available and non-invasive tool, can enable the detection of liver disease by capturing cardiovascular-hepatic interactions. Methods: We trained tree-based machine learning models on ECG features to detect liver diseases using two large datasets: MIMIC-IV-ECG (467,729 patients, 2008-2019) and ECG-View II (775,535 patients, 1994-2013). The task was framed as binary classification, with performance evaluated via the area under the receiver operating characteristic curve (AUROC). To improve interpretability, we applied explainability methods to identify key predictive features. Findings: The models showed strong predictive performance with good generalizability. For example, AUROCs for alcoholic liver disease (K70) were 0.8025 (95% confidence interval (CI), 0.8020-0.8035) internally and 0.7644 (95% CI, 0.7641-0.7649) externally; for hepatic failure (K72), scores were 0.7404 (95% CI, 0.7389-0.7415) and 0.7498 (95% CI, 0.7494-0.7509), respectively. The explainability analysis consistently identified age and prolonged QTc intervals (corrected QT, reflecting ventricular repolarization) as key predictors. Features linked to autonomic regulation and electrical conduction abnormalities were also prominent, supporting known cardiovascular-liver connections and suggesting QTc as a potential biomarker. Interpretation: ECG-based machine learning offers a promising, interpretable approach for liver disease detection, particularly in resource-limited settings. By revealing clinically relevant biomarkers, this method supports non-invasive diagnostics, early detection, and risk stratification prior to targeted clinical assessments.
Related papers
- Manifold Learning for Personalized and Label-Free Detection of Cardiac Arrhythmias [0.9208007322096533]
We show that nonlinear dimensionality reduction (NLDR) can accommodate medically relevant features in ECG signals.<n>Using the MLII and V1 leads of the MIT-BIH dataset, we demonstrate that NLDR holds much promise for cardiac monitoring.
arXiv Detail & Related papers (2025-06-19T17:39:57Z) - Finetuning and Quantization of EEG-Based Foundational BioSignal Models on ECG and PPG Data for Blood Pressure Estimation [53.2981100111204]
Photoplethysmography and electrocardiography can potentially enable continuous blood pressure (BP) monitoring.<n>Yet accurate and robust machine learning (ML) models remains challenging due to variability in data quality and patient-specific factors.<n>In this work, we investigate whether a model pre-trained on one modality can effectively be exploited to improve the accuracy of a different signal type.<n>Our approach achieves near state-of-the-art accuracy for diastolic BP and surpasses by 1.5x the accuracy of prior works for systolic BP.
arXiv Detail & Related papers (2025-02-10T13:33:12Z) - Explainable and externally validated machine learning for neuropsychiatric diagnosis via electrocardiograms [0.8108972030676012]
Electrocardiogram (ECG) analysis has emerged as a promising tool for identifying physiological changes associated with neuropsychiatric conditions.<n>The potential of the ECG to accurately distinguish neuropsychiatric conditions, particularly among diverse patient populations, remains underexplored.<n>This study utilized ECG markers and basic demographic data to predict neuropsychiatric conditions using machine learning models.
arXiv Detail & Related papers (2025-02-07T13:37:13Z) - Explainable machine learning for neoplasms diagnosis via electrocardiograms: an externally validated study [0.9503773054285559]
Neoplasms remains a leading cause of mortality worldwide.
Current diagnostic methods are often invasive, costly, and inaccessible to many populations.
This study explores the application of machine learning models to analyze ECG features for the diagnosis of neoplasms.
arXiv Detail & Related papers (2024-12-10T18:34:08Z) - Integrating Deep Learning with Fundus and Optical Coherence Tomography for Cardiovascular Disease Prediction [47.7045293755736]
Early identification of patients at risk of cardiovascular diseases (CVD) is crucial for effective preventive care, reducing healthcare burden, and improving patients' quality of life.
This study demonstrates the potential of retinal optical coherence tomography ( OCT) imaging combined with fundus photographs for identifying future adverse cardiac events.
We propose a novel binary classification network based on a Multi-channel Variational Autoencoder (MCVAE), which learns a latent embedding of patients' fundus and OCT images to classify individuals into two groups: those likely to develop CVD in the future and those who are not.
arXiv Detail & Related papers (2024-10-18T12:37:51Z) - Self-supervised inter-intra period-aware ECG representation learning for detecting atrial fibrillation [41.82319894067087]
We propose an inter-intra period-aware ECG representation learning approach.
Considering ECGs of atrial fibrillation patients exhibit the irregularity in RR intervals and the absence of P-waves, we develop specific pre-training tasks for interperiod and intraperiod representations.
Our approach demonstrates remarkable AUC performances on the BTCH dataset, textiti.e., 0.953/0.996 for paroxysmal/persistent atrial fibrillation detection.
arXiv Detail & Related papers (2024-10-08T10:03:52Z) - Estimation of Cardiac and Non-cardiac Diagnosis from Electrocardiogram Features [1.068128849363198]
We use publicly available datasets to investigate the feasibility of inferring general diagnostic conditions from ECG features.
We train a tree-based model (XGBoost) based on ECG features and basic demographic features to estimate a wide range of diagnoses.
arXiv Detail & Related papers (2024-08-30T14:42:03Z) - Self-supervised Anomaly Detection Pretraining Enhances Long-tail ECG Diagnosis [32.37717219026923]
Current computer-aided ECG diagnostic systems struggle with the underdetection of rare but critical cardiac anomalies.
This study introduces a novel approach using self-supervised anomaly detection pretraining to address this limitation.
The anomaly detection model is specifically designed to detect and localize subtle deviations from normal cardiac patterns.
arXiv Detail & Related papers (2024-08-30T09:48:47Z) - Explainable Biomedical Hypothesis Generation via Retrieval Augmented Generation enabled Large Language Models [46.05020842978823]
Large Language Models (LLMs) have emerged as powerful tools to navigate this complex data landscape.
RAGGED is a comprehensive workflow designed to support investigators with knowledge integration and hypothesis generation.
arXiv Detail & Related papers (2024-07-17T07:44:18Z) - Anomaly Detection in Electrocardiograms: Advancing Clinical Diagnosis Through Self-Supervised Learning [32.37717219026923]
Existing systems often miss rare cardiac anomalies that could be precursors to serious, life-threatening issues or alterations in the cardiac macro/microstructure.
We focus on self-supervised anomaly detection (AD), training exclusively on normal ECGs to recognize deviations indicating anomalies.
We introduce a novel self-supervised learning framework for ECG AD, utilizing a vast dataset of normal ECGs to autonomously detect and localize cardiac anomalies.
arXiv Detail & Related papers (2024-04-07T12:15:53Z) - Unlocking the Diagnostic Potential of ECG through Knowledge Transfer
from Cardiac MRI [6.257859765229826]
We propose the first self-supervised contrastive approach that transfers domain-specific information from CMR images to ECG embeddings.
Our approach combines multimodal contrastive learning with masked data modeling to enable holistic cardiac screening solely from ECG data.
arXiv Detail & Related papers (2023-08-09T10:05:11Z) - Learning to diagnose cirrhosis from radiological and histological labels
with joint self and weakly-supervised pretraining strategies [62.840338941861134]
We propose to leverage transfer learning from large datasets annotated by radiologists, to predict the histological score available on a small annex dataset.
We compare different pretraining methods, namely weakly-supervised and self-supervised ones, to improve the prediction of the cirrhosis.
This method outperforms the baseline classification of the METAVIR score, reaching an AUC of 0.84 and a balanced accuracy of 0.75.
arXiv Detail & Related papers (2023-02-16T17:06:23Z) - Transfer Knowledge from Natural Language to Electrocardiography: Can We
Detect Cardiovascular Disease Through Language Models? [16.220138060415305]
We propose an approach for cardiovascular disease diagnosis and automatic ECG diagnosis report generation.
The learned embeddings are evaluated on two downstream tasks: (1) automatic ECG diagnosis report generation, and (2) zero-shot cardiovascular disease detection.
Our approach is able to generate high-quality cardiac diagnosis reports and also achieves competitive zero-shot classification performance even compared with supervised baselines.
arXiv Detail & Related papers (2023-01-21T21:58:00Z) - Hierarchical Deep Learning with Generative Adversarial Network for
Automatic Cardiac Diagnosis from ECG Signals [2.5008947886814186]
We propose a two-level hierarchical deep learning framework with Generative Adversarial Network (GAN) for automatic diagnosis of ECG signals.
The first-level model is composed of a Memory-Augmented Deep auto-Encoder with GAN, which aims to differentiate abnormal signals from normal ECGs for anomaly detection.
The second-level learning aims at robust multi-class classification for different arrhythmias identification.
arXiv Detail & Related papers (2022-10-19T12:29:05Z) - Self-supervised contrastive learning of echocardiogram videos enables
label-efficient cardiac disease diagnosis [48.64462717254158]
We developed a self-supervised contrastive learning approach, EchoCLR, to catered to echocardiogram videos.
When fine-tuned on small portions of labeled data, EchoCLR pretraining significantly improved classification performance for left ventricular hypertrophy (LVH) and aortic stenosis (AS)
EchoCLR is unique in its ability to learn representations of medical videos and demonstrates that SSL can enable label-efficient disease classification from small, labeled datasets.
arXiv Detail & Related papers (2022-07-23T19:17:26Z) - COVID-Net Biochem: An Explainability-driven Framework to Building
Machine Learning Models for Predicting Survival and Kidney Injury of COVID-19
Patients from Clinical and Biochemistry Data [66.43957431843324]
We introduce COVID-Net Biochem, a versatile and explainable framework for constructing machine learning models.
We apply this framework to predict COVID-19 patient survival and the likelihood of developing Acute Kidney Injury during hospitalization.
arXiv Detail & Related papers (2022-04-24T07:38:37Z) - SIM-ECG: A Signal Importance Mask-driven ECGClassification System [11.030532126096006]
Heart disease is the number one killer, and ECGs can assist in the early diagnosis and prevention of deadly outcomes.
Current systems are not as accurate as skilled ECG readers, and black-box approaches to providing diagnosis result in a lack of trust by medical personnel.
We propose a signal importance mask feedback-based machine learning system that continuously accepts feedback, improves accuracy, and ex-plains the resulting diagnosis.
arXiv Detail & Related papers (2021-10-28T01:27:37Z) - Estimation of atrial fibrillation from lead-I ECGs: Comparison with
cardiologists and machine learning model (CurAlive), a clinical validation
study [0.0]
This study presents a method to detect atrial fibrillation with lead-I ECGs using artificial intelligence.
The aim of the study is to compare the accuracy of the diagnoses estimated by cardiologists and artificial intelligence over lead-I ECGs.
arXiv Detail & Related papers (2021-04-15T12:50:16Z) - Variational Knowledge Distillation for Disease Classification in Chest
X-Rays [102.04931207504173]
We propose itvariational knowledge distillation (VKD), which is a new probabilistic inference framework for disease classification based on X-rays.
We demonstrate the effectiveness of our method on three public benchmark datasets with paired X-ray images and EHRs.
arXiv Detail & Related papers (2021-03-19T14:13:56Z) - Identification of Ischemic Heart Disease by using machine learning
technique based on parameters measuring Heart Rate Variability [50.591267188664666]
In this study, 18 non-invasive features (age, gender, left ventricular ejection fraction and 15 obtained from HRV) of 243 subjects were used to train and validate a series of several ANN.
The best result was obtained using 7 input parameters and 7 hidden nodes with an accuracy of 98.9% and 82% for the training and validation dataset.
arXiv Detail & Related papers (2020-10-29T19:14:41Z) - Dynamic Graph Correlation Learning for Disease Diagnosis with Incomplete
Labels [66.57101219176275]
Disease diagnosis on chest X-ray images is a challenging multi-label classification task.
We propose a Disease Diagnosis Graph Convolutional Network (DD-GCN) that presents a novel view of investigating the inter-dependency among different diseases.
Our method is the first to build a graph over the feature maps with a dynamic adjacency matrix for correlation learning.
arXiv Detail & Related papers (2020-02-26T17:10:48Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.