Intra-Inter Subject Self-supervised Learning for Multivariate Cardiac
Signals
- URL: http://arxiv.org/abs/2109.08908v1
- Date: Sat, 18 Sep 2021 11:37:10 GMT
- Title: Intra-Inter Subject Self-supervised Learning for Multivariate Cardiac
Signals
- Authors: Xiang Lan, Dianwen Ng, Shenda Hong, Mengling Feng
- Abstract summary: We propose an Intra-inter Subject self-supervised Learning (ISL) model that is customized for multivariate cardiac signals.
Our proposed ISL model integrates medical knowledge into self-supervision to effectively learn from intra-inter subject differences.
In a semi-supervised transfer learning scenario, our pre-trained ISL model leads about 10% improvement over supervised training when only 1% labeled data is available.
- Score: 5.344233761474541
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Learning information-rich and generalizable representations effectively from
unlabeled multivariate cardiac signals to identify abnormal heart rhythms
(cardiac arrhythmias) is valuable in real-world clinical settings but often
challenging due to its complex temporal dynamics. Cardiac arrhythmias can vary
significantly in temporal patterns even for the same patient ($i.e.$, intra
subject difference). Meanwhile, the same type of cardiac arrhythmia can show
different temporal patterns among different patients due to different cardiac
structures ($i.e.$, inter subject difference). In this paper, we address the
challenges by proposing an Intra-inter Subject self-supervised Learning (ISL)
model that is customized for multivariate cardiac signals. Our proposed ISL
model integrates medical knowledge into self-supervision to effectively learn
from intra-inter subject differences. In intra subject self-supervision, ISL
model first extracts heartbeat-level features from each subject using a
channel-wise attentional CNN-RNN encoder. Then a stationarity test module is
employed to capture the temporal dependencies between heartbeats. In inter
subject self-supervision, we design a set of data augmentations according to
the clinical characteristics of cardiac signals and perform contrastive
learning among subjects to learn distinctive representations for various types
of patients. Extensive experiments on three real-world datasets were conducted.
In a semi-supervised transfer learning scenario, our pre-trained ISL model
leads about 10% improvement over supervised training when only 1% labeled data
is available, suggesting strong generalizability and robustness of the model.
Related papers
- Multi-View Stenosis Classification Leveraging Transformer-Based Multiple-Instance Learning Using Real-World Clinical Data [76.89269238957593]
Coronary artery stenosis is a leading cause of cardiovascular disease, diagnosed by analyzing the coronary arteries from multiple angiography views.<n>We propose SegmentMIL, a transformer-based multi-view multiple-instance learning framework for patient-level stenosis classification.
arXiv Detail & Related papers (2026-02-02T13:07:52Z) - Acoustic Index: A Novel AI-Driven Parameter for Cardiac Disease Risk Stratification Using Echocardiography [0.0]
We introduce the Acoustic Index, a novel AI-derived echocardiographic parameter designed to quantify cardiac dysfunction from standard ultrasound views.<n>The model combines Extended Dynamic Mode Decomposition (EDMD) based on Koopman operator theory with a hybrid neural network that incorporates clinical metadata.<n>In a prospective cohort of 736 patients, encompassing various cardiac pathologies and normal controls, the Acoustic Index achieved an area under the curve (AUC) of 0.89 in an independent test set.<n>Cross-validation across five folds confirmed the robustness of the model, showing that both sensitivity and specificity exceeded 0.8 when evaluated on independent data.
arXiv Detail & Related papers (2025-07-17T21:27:28Z) - Global and Local Contrastive Learning for Joint Representations from Cardiac MRI and ECG [40.407824759778784]
PTACL (Patient and Temporal Alignment Contrastive Learning) is a multimodal contrastive learning framework that enhances ECG representations by integrating-temporal information from CMR.<n>We evaluate PTACL on paired ECG-CMR data from 27,951 subjects in the UK Biobank.<n>Our results highlight the potential of PTACL to enhance non-invasive cardiac diagnostics using ECG.
arXiv Detail & Related papers (2025-06-24T17:19:39Z) - Sensing Cardiac Health Across Scenarios and Devices: A Multi-Modal Foundation Model Pretrained on Heterogeneous Data from 1.7 Million Individuals [36.08910150609342]
We present a cardiac sensing foundation model (CSFM) that learns unified representations from vast, heterogeneous health records.<n>Our model is pretrained on an innovative multi-modal integration of data from multiple large-scale datasets.<n> CSFM consistently outperforms traditional one-modal-one-task approaches.
arXiv Detail & Related papers (2025-06-23T20:58:12Z) - Manifold Learning for Personalized and Label-Free Detection of Cardiac Arrhythmias [0.9208007322096533]
We show that nonlinear dimensionality reduction (NLDR) can accommodate medically relevant features in ECG signals.<n>Using the MLII and V1 leads of the MIT-BIH dataset, we demonstrate that NLDR holds much promise for cardiac monitoring.
arXiv Detail & Related papers (2025-06-19T17:39:57Z) - ArrhythmiaVision: Resource-Conscious Deep Learning Models with Visual Explanations for ECG Arrhythmia Classification [0.0]
We propose ArrhythmiNet V1 and V2, optimized for efficient, real-time arrhythmia classification on edge devices.<n>Inspired by MobileNet's depthwise separable convolutional design, these models maintain memory footprints of just 302.18 KB and 157.76 KB, respectively.<n>Our findings demonstrate the feasibility of combining interpretability, predictive accuracy, and computational efficiency in practical, wearable, and embedded ECG monitoring systems.
arXiv Detail & Related papers (2025-04-30T18:22:45Z) - CTPD: Cross-Modal Temporal Pattern Discovery for Enhanced Multimodal Electronic Health Records Analysis [46.56667527672019]
We introduce a Cross-Modal Temporal Pattern Discovery (CTPD) framework, designed to efficiently extract meaningful cross-modal temporal patterns from multimodal EHR data.
Our approach introduces shared initial temporal pattern representations which are refined using slot attention to generate temporal semantic embeddings.
arXiv Detail & Related papers (2024-11-01T15:54:07Z) - Self-supervised inter-intra period-aware ECG representation learning for detecting atrial fibrillation [41.82319894067087]
We propose an inter-intra period-aware ECG representation learning approach.
Considering ECGs of atrial fibrillation patients exhibit the irregularity in RR intervals and the absence of P-waves, we develop specific pre-training tasks for interperiod and intraperiod representations.
Our approach demonstrates remarkable AUC performances on the BTCH dataset, textiti.e., 0.953/0.996 for paroxysmal/persistent atrial fibrillation detection.
arXiv Detail & Related papers (2024-10-08T10:03:52Z) - Adversarial Contrastive Learning Based Physics-Informed Temporal Networks for Cuffless Blood Pressure Estimation [37.94387581519217]
We introduce a novel physics-informed temporal network(PITN) with adversarial contrastive learning to enable precise BP estimation with very limited data.
We then employ adversarial training to generate extra physiological time series data, improving PITN's robustness in the face of sparse subject-specific data.
arXiv Detail & Related papers (2024-08-16T02:17:21Z) - A Generalizable Deep Learning System for Cardiac MRI [29.429744474335347]
We describe a foundational vision system for cardiac MRI, capable of representing the breadth of human cardiovascular disease and health.
Our deep learning model is trained via self-supervised contrastive learning, by which visual concepts in cine-sequence cardiac MRI scans are learned from the raw text of the accompanying radiology reports.
We show that our deep learning system is capable of not only understanding the staggering complexity of human cardiovascular disease, but can be directed towards clinical problems of interest yielding impressive, clinical grade diagnostic accuracy with a fraction of the training data typically required for such tasks.
arXiv Detail & Related papers (2023-12-01T05:27:29Z) - Successive Subspace Learning for Cardiac Disease Classification with
Two-phase Deformation Fields from Cine MRI [36.044984400761535]
This work proposes a lightweight successive subspace learning framework for CVD classification.
It is based on an interpretable feedforward design, in conjunction with a cardiac atlas.
Compared with 3D CNN-based approaches, our framework achieves superior classification performance with 140$times$ fewer parameters.
arXiv Detail & Related papers (2023-01-21T15:00:59Z) - Multimodality Multi-Lead ECG Arrhythmia Classification using
Self-Supervised Learning [5.675787521359948]
We propose SSL-based multimodality ECG classification.
Our proposed network follows SSL learning paradigm and consists of two modules corresponding to pre-stream task, and down-stream task.
To evaluate the effectiveness of our approach, ten-fold cross validation on the 12-lead PhysioNet 2020 dataset has been conducted.
arXiv Detail & Related papers (2022-09-30T18:45:34Z) - Self-supervised contrastive learning of echocardiogram videos enables
label-efficient cardiac disease diagnosis [48.64462717254158]
We developed a self-supervised contrastive learning approach, EchoCLR, to catered to echocardiogram videos.
When fine-tuned on small portions of labeled data, EchoCLR pretraining significantly improved classification performance for left ventricular hypertrophy (LVH) and aortic stenosis (AS)
EchoCLR is unique in its ability to learn representations of medical videos and demonstrates that SSL can enable label-efficient disease classification from small, labeled datasets.
arXiv Detail & Related papers (2022-07-23T19:17:26Z) - Development of Interpretable Machine Learning Models to Detect
Arrhythmia based on ECG Data [0.0]
This thesis builds Convolutional Neural Network (CNN) and Long Short-Term Memory (LSTM) classifiers based on state-of-the-art models.
Both global and local interpretability methods are exploited to understand the interaction between dependent and independent variables.
It was found that Grad-Cam was the most effective interpretability technique at explaining predictions of proposed CNN and LSTM models.
arXiv Detail & Related papers (2022-05-05T17:29:33Z) - Deep Learning Analysis of Cardiac MRI in Legacy Datasets: Multi-Ethnic
Study of Atherosclerosis [0.4585572408645652]
The Multi-Ethnic Study of Atherosclerosis (MESA) was the first large cohort study to incorporate cardiovascular MRI in over 5000 participants, and there is now a wealth of follow-up data over 20 years.
We describe an automated atlas construction pipeline using deep learning methods applied to the legacy cardiac MRI data in MESA.
arXiv Detail & Related papers (2021-10-28T14:18:21Z) - A Visual Domain Transfer Learning Approach for Heartbeat Sound
Classification [0.0]
Heart disease is the most common reason for human mortality that causes almost one-third of deaths throughout the world.
Detecting the disease early increases the chances of survival of the patient and there are several ways a sign of heart disease can be detected early.
This research proposes to convert cleansed and normalized heart sound into visual mel scale spectrograms and then using visual domain transfer learning approaches to automatically extract features and categorize between heart sounds.
arXiv Detail & Related papers (2021-07-28T09:41:38Z) - A Study of Domain Generalization on Ultrasound-based Multi-Class
Segmentation of Arteries, Veins, Ligaments, and Nerves Using Transfer
Learning [12.869637050331306]
We study the US-based segmentation of multiple classes through transfer learning by fine-tuning different contiguous blocks within the model.
We propose a simple method for predicting generalization on unseen datasets and observe statistically significant differences between the fine-tuning methods.
arXiv Detail & Related papers (2020-11-13T16:59:20Z) - Autoencoders for Unsupervised Anomaly Segmentation in Brain MR Images: A
Comparative Study [43.26668942258135]
New approaches in the field of Unsupervised Anomaly Detection (UAD) in brain MRI.
Main principle behind these works is to learn a model of normal anatomy by learning to compress and recover healthy data.
concept is of great interest to the medical image analysis community as it i) relieves from the need of vast amounts of manually segmented training data.
arXiv Detail & Related papers (2020-04-07T11:12:07Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.