Related papers: Guiding Masked Representation Learning to Capture Spatio-Temporal Relationship of Electrocardiogram

Guiding Masked Representation Learning to Capture Spatio-Temporal Relationship of Electrocardiogram

URL: http://arxiv.org/abs/2402.09450v3
Date: Tue, 19 Mar 2024 16:17:00 GMT
Title: Guiding Masked Representation Learning to Capture Spatio-Temporal Relationship of Electrocardiogram
Authors: Yeongyeon Na, Minje Park, Yunwon Tae, Sunghoon Joo,
Abstract summary: We introduce ST-MEM (S-Temporal Masked Electrocardiogram Modeling), designed to learntemporal features by reconstructing 12-lead ECG data. ST-MEM outperforms other SSL baseline methods in various experimental settings for arrhythmia.
Score: 2.2842904085777045
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Electrocardiograms (ECG) are widely employed as a diagnostic tool for monitoring electrical signals originating from a heart. Recent machine learning research efforts have focused on the application of screening various diseases using ECG signals. However, adapting to the application of screening disease is challenging in that labeled ECG data are limited. Achieving general representation through self-supervised learning (SSL) is a well-known approach to overcome the scarcity of labeled data; however, a naive application of SSL to ECG data, without considering the spatial-temporal relationships inherent in ECG signals, may yield suboptimal results. In this paper, we introduce ST-MEM (Spatio-Temporal Masked Electrocardiogram Modeling), designed to learn spatio-temporal features by reconstructing masked 12-lead ECG data. ST-MEM outperforms other SSL baseline methods in various experimental settings for arrhythmia classification tasks. Moreover, we demonstrate that ST-MEM is adaptable to various lead combinations. Through quantitative and qualitative analysis, we show a spatio-temporal relationship within ECG data. Our code is available at https://github.com/bakqui/ST-MEM.

Related papers

Global and Local Contrastive Learning for Joint Representations from Cardiac MRI and ECG [40.407824759778784]
PTACL (Patient and Temporal Alignment Contrastive Learning) is a multimodal contrastive learning framework that enhances ECG representations by integrating-temporal information from CMR.<n>We evaluate PTACL on paired ECG-CMR data from 27,951 subjects in the UK Biobank.<n>Our results highlight the potential of PTACL to enhance non-invasive cardiac diagnostics using ECG.
arXiv Detail & Related papers (2025-06-24T17:19:39Z)
xLSTM-ECG: Multi-label ECG Classification via Feature Fusion with xLSTM [14.02717596836022]
We propose xLSTM-ECG, a novel approach for multi-label classification of ECG signals. To the best of our knowledge, this work represents the first design and application of xLSTM modules specifically adapted for multi-label ECG classification.
arXiv Detail & Related papers (2025-04-14T16:12:46Z)
GEM: Empowering MLLM for Grounded ECG Understanding with Time Series and Images [43.65650710265957]
We introduce GEM, the first MLLM unifying ECG time series, 12-lead ECG images and text for grounded and clinician-aligned ECG interpretation. GEM enables feature-grounded analysis, evidence-driven reasoning, and a clinician-like diagnostic process through three core innovations. We propose the Grounded ECG task, a clinically motivated benchmark designed to assess the MLLM's capability in grounded ECG understanding.
arXiv Detail & Related papers (2025-03-08T05:48:53Z)
Synthetic Time Series Data Generation for Healthcare Applications: A PCG Case Study [43.28613210217385]
We employ and compare three state-of-the-art generative models to generate PCG data. Our results demonstrate that the generated PCG data closely resembles the original datasets. In our future work, we plan to incorporate this method into a data augmentation pipeline to synthesize abnormal PCG signals with heart murmurs.
arXiv Detail & Related papers (2024-12-17T18:07:40Z)
Learning General Representation of 12-Lead Electrocardiogram with a Joint-Embedding Predictive Architecture [0.0]
We introduce ECG-JEPA, a self-supervised learning model for 12-lead ECG analysis. It learns semantic representations of ECG data by predicting in the hidden latent space. ECG-JEPA achieves state-of-the-art performance in various downstream tasks including ECG classification and feature prediction.
arXiv Detail & Related papers (2024-10-11T06:30:48Z)
Self-supervised inter-intra period-aware ECG representation learning for detecting atrial fibrillation [41.82319894067087]
We propose an inter-intra period-aware ECG representation learning approach. Considering ECGs of atrial fibrillation patients exhibit the irregularity in RR intervals and the absence of P-waves, we develop specific pre-training tasks for interperiod and intraperiod representations. Our approach demonstrates remarkable AUC performances on the BTCH dataset, textiti.e., 0.953/0.996 for paroxysmal/persistent atrial fibrillation detection.
arXiv Detail & Related papers (2024-10-08T10:03:52Z)
MEIT: Multi-Modal Electrocardiogram Instruction Tuning on Large Language Models for Report Generation [41.324530807795256]
Electrocardiogram (ECG) is the primary non-invasive diagnostic tool for monitoring cardiac conditions. Recent studies have concentrated on classifying cardiac conditions using ECG data but have overlooked ECG report generation. We propose the Multimodal ECG Instruction Tuning (MEIT) framework, the first attempt to tackle ECG report generation with LLMs and multimodal instructions.
arXiv Detail & Related papers (2024-03-07T23:20:56Z)
ECG-SL: Electrocardiogram(ECG) Segment Learning, a deep learning method for ECG signal [19.885905393439014]
We propose a novel ECG-Segment based Learning (ECG-SL) framework to explicitly model the periodic nature of ECG signals. Based on the structural features, a temporal model is designed to learn the temporal information for various clinical tasks. The proposed method outperforms the baseline model and shows competitive performances compared with task-specific methods in three clinical applications.
arXiv Detail & Related papers (2023-10-01T23:17:55Z)
DGSD: Dynamical Graph Self-Distillation for EEG-Based Auditory Spatial Attention Detection [49.196182908826565]
Auditory Attention Detection (AAD) aims to detect target speaker from brain signals in a multi-speaker environment. Current approaches primarily rely on traditional convolutional neural network designed for processing Euclidean data like images. This paper proposes a dynamical graph self-distillation (DGSD) approach for AAD, which does not require speech stimuli as input.
arXiv Detail & Related papers (2023-09-07T13:43:46Z)
Improving Multiple Sclerosis Lesion Segmentation Across Clinical Sites: A Federated Learning Approach with Noise-Resilient Training [75.40980802817349]
Deep learning models have shown promise for automatically segmenting MS lesions, but the scarcity of accurately annotated data hinders progress in this area. We introduce a Decoupled Hard Label Correction (DHLC) strategy that considers the imbalanced distribution and fuzzy boundaries of MS lesions. We also introduce a Centrally Enhanced Label Correction (CELC) strategy, which leverages the aggregated central model as a correction teacher for all sites.
arXiv Detail & Related papers (2023-08-31T00:36:10Z)
ECG-CL: A Comprehensive Electrocardiogram Interpretation Method Based on Continual Learning [20.465733855762835]
Electrocardiogram (ECG) monitoring is one of the most powerful technique of cardiovascular disease (CVD) early identification. Classic rule-based algorithms are now completely outperformed by deep learning based methods. We propose a multi-resolution model that can sustain high-resolution low-level semantic information throughout.
arXiv Detail & Related papers (2023-04-10T15:19:00Z)
Frozen Language Model Helps ECG Zero-Shot Learning [12.974685769614062]
We propose Multimodal ECG-Text Self-supervised pre-training (METS) We use a trainable ECG encoder and a frozen language model to embed paired ECG and automatically machine-generated clinical reports separately. In downstream classification tasks, METS achieves around 10% improvement in performance without using any annotated data.
arXiv Detail & Related papers (2023-03-22T05:01:14Z)
Multimodality Multi-Lead ECG Arrhythmia Classification using Self-Supervised Learning [5.675787521359948]
We propose SSL-based multimodality ECG classification. Our proposed network follows SSL learning paradigm and consists of two modules corresponding to pre-stream task, and down-stream task. To evaluate the effectiveness of our approach, ten-fold cross validation on the 12-lead PhysioNet 2020 dataset has been conducted.
arXiv Detail & Related papers (2022-09-30T18:45:34Z)
GeoECG: Data Augmentation via Wasserstein Geodesic Perturbation for Robust Electrocardiogram Prediction [20.8603653664403]
We propose a physiologically-inspired data augmentation method to improve performance and increase the robustness of heart disease detection based on ECG signals. We obtain augmented samples by perturbing the data distribution towards other classes along the geodesic in Wasserstein space. Learning from 12-lead ECG signals, our model is able to distinguish five categories of cardiac conditions.
arXiv Detail & Related papers (2022-08-02T03:14:13Z)
Uncovering the structure of clinical EEG signals with self-supervised learning [64.4754948595556]
Supervised learning paradigms are often limited by the amount of labeled data that is available. This phenomenon is particularly problematic in clinically-relevant data, such as electroencephalography (EEG) By extracting information from unlabeled data, it might be possible to reach competitive performance with deep neural networks.
arXiv Detail & Related papers (2020-07-31T14:34:47Z)
ECG-DelNet: Delineation of Ambulatory Electrocardiograms with Mixed Quality Labeling Using Neural Networks [69.25956542388653]
Deep learning (DL) algorithms are gaining weight in academic and industrial settings. We demonstrate DL can be successfully applied to low interpretative tasks by embedding ECG detection and delineation onto a segmentation framework. The model was trained using PhysioNet's QT database, comprised of 105 ambulatory ECG recordings.
arXiv Detail & Related papers (2020-05-11T16:29:12Z)

This list is automatically generated from the titles and abstracts of the papers in this site.