Contrastive Learning for Multi Label ECG Classification with Jaccard Score Based Sigmoid Loss
- URL: http://arxiv.org/abs/2602.10553v1
- Date: Wed, 11 Feb 2026 05:58:34 GMT
- Title: Contrastive Learning for Multi Label ECG Classification with Jaccard Score Based Sigmoid Loss
- Authors: Junichiro Takahashi, Masataka Sato, Satoshi Kodeta, Norihiko Takeda,
- Abstract summary: We focus on constructing a robust ECG encoder for multimodal pretraining using real world hospital data.<n>We employ SigLIP, a CLIP based model with a sigmoid based loss function enabling multi label prediction.<n>Experiments demonstrate that incorporating medical knowledge in the language model and applying the modified loss significantly improve multi label ECG classification.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Recent advances in large language models (LLMs) have enabled the development of multimodal medical AI. While models such as MedGemini achieve high accuracy on VQA tasks like USMLE MM, their performance on ECG based tasks remains limited, and some models, such as MedGemma, do not support ECG data at all. Interpreting ECGs is inherently challenging, and diagnostic accuracy can vary depending on the interpreter's experience. Although echocardiography provides rich diagnostic information, it requires specialized equipment and personnel, limiting its availability. In this study, we focus on constructing a robust ECG encoder for multimodal pretraining using real world hospital data. We employ SigLIP, a CLIP based model with a sigmoid based loss function enabling multi label prediction, and introduce a modified loss function tailored to the multi label nature of ECG data. Experiments demonstrate that incorporating medical knowledge in the language model and applying the modified loss significantly improve multi label ECG classification. To further enhance performance, we increase the embedding dimensionality and apply random cropping to mitigate data drift. Finally, per label analysis reveals which ECG findings are easier or harder to predict. Our study provides a foundational framework for developing medical models that utilize ECG data.
Related papers
- EfficientECG: Cross-Attention with Feature Fusion for Efficient Electrocardiogram Classification [7.5367987995144565]
We study novel deep learning technologies to effectively manage and analyse ECG data.<n>Our deep learning approaches can automatically extract the features of ECG data through end-to-end training.<n>Our evaluations on ECG datasets validate our model against state-of-the-art works in terms of high precision, multi-feature fusion, and lightweights.
arXiv Detail & Related papers (2025-12-03T13:54:33Z) - EnECG: Efficient Ensemble Learning for Electrocardiogram Multi-task Foundation Model [46.84040404474695]
EnECG is an ensemble-based framework that integrates multiple specialized foundation models, each excelling in different aspects of ECG interpretation.<n>We show that EnECG can help reduce computational and memory costs while maintaining the strong representational power of foundation models.<n>This framework not only enhances feature extraction and predictive performance but also ensures practical efficiency for real-world clinical applications.
arXiv Detail & Related papers (2025-11-28T07:22:33Z) - Simulator and Experience Enhanced Diffusion Model for Comprehensive ECG Generation [52.19347532840774]
We propose SE-Diff, a novel physiological simulator and experience enhanced diffusion model for ECG generation.<n> SE-Diff integrates a lightweight ordinary differential equation (ODE)-based ECG simulator into the diffusion process via a beat decoder.<n>Extensive experiments on real-world ECG datasets demonstrate that SE-Diff improves both signal fidelity and text-ECG semantic alignment.
arXiv Detail & Related papers (2025-11-13T02:57:10Z) - Heartcare Suite: Multi-dimensional Understanding of ECG with Raw Multi-lead Signal Modeling [50.58126509704037]
Heartcare Suite is a framework for fine-grained electrocardiogram (ECG) understanding.<n>Heartcare-220K is a high-quality, structured, and comprehensive multimodal ECG dataset.<n>Heartcare-Bench is a benchmark to guide the optimization of Medical Multimodal Large Language Models (Med-MLLMs) in ECG scenarios.
arXiv Detail & Related papers (2025-06-06T07:56:41Z) - GEM: Empowering MLLM for Grounded ECG Understanding with Time Series and Images [44.50428701650495]
We introduce GEM, the first MLLM unifying ECG time series, 12-lead ECG images and text for grounded and clinician-aligned ECG interpretation.<n> GEM enables feature-grounded analysis, evidence-driven reasoning, and a clinician-like diagnostic process through three core innovations.<n>We propose the Grounded ECG task, a clinically motivated benchmark designed to assess the MLLM's capability in grounded ECG understanding.
arXiv Detail & Related papers (2025-03-08T05:48:53Z) - Conditional Electrocardiogram Generation Using Hierarchical Variational Autoencoders [0.0]
We propose a conditional Nouveau VAE model for ECG signal generation (cNVAE-ECG)<n>This paper proposes the publicly available conditional Nouveau VAE model for ECG signal generation (cNVAE-ECG)
arXiv Detail & Related papers (2025-03-03T13:30:36Z) - Teach Multimodal LLMs to Comprehend Electrocardiographic Images [10.577263066644194]
We introduce ECGInstruct, a comprehensive ECG image instruction tuning dataset of over one million samples.
We also develop PULSE, an MLLM tailored for ECG image comprehension.
Our experiments show that PULSE sets a new state-of-the-art, outperforming general MLLMs with an average accuracy improvement of 15% to 30%.
arXiv Detail & Related papers (2024-10-21T20:26:41Z) - An Electrocardiogram Foundation Model Built on over 10 Million Recordings with External Evaluation across Multiple Domains [17.809094003643523]
ECG Foundation Model (ECGFounder) trained on over 10 million ECGs with 150 label categories from Harvard-Emory ECG Database.<n>ECGFounder achieves expert-level performance on internal validation sets, with AUROC exceeding 0.95 for eighty diagnoses.<n>When fine-tuned, ECGFounder outperforms baseline models in demographic analysis, clinical event detection, and cross-modality cardiac rhythm diagnosis.
arXiv Detail & Related papers (2024-10-05T12:12:02Z) - Semi-Supervised Learning for Multi-Label Cardiovascular Diseases
Prediction:A Multi-Dataset Study [17.84069222975825]
Current ECG-based diagnosis systems show promising performance owing to the rapid development of deep learning techniques.
Label scarcity problem, the co-occurrence of multiple CVDs and the poor performance on unseen datasets hinder the widespread application of deep learning-based models.
We propose a multi-label semi-supervised model (ECGMatch) to recognize multiple CVDs simultaneously with limited supervision.
arXiv Detail & Related papers (2023-06-18T07:46:19Z) - Generalizing electrocardiogram delineation: training convolutional
neural networks with synthetic data augmentation [63.51064808536065]
Existing databases for ECG delineation are small, being insufficient in size and in the array of pathological conditions they represent.
This article delves has two main contributions. First, a pseudo-synthetic data generation algorithm was developed, based in probabilistically composing ECG traces given "pools" of fundamental segments, as cropped from the original databases, and a set of rules for their arrangement into coherent synthetic traces.
Second, two novel segmentation-based loss functions have been developed, which attempt at enforcing the prediction of an exact number of independent structures and at producing closer segmentation boundaries by focusing on a reduced number of samples.
arXiv Detail & Related papers (2021-11-25T10:11:41Z) - ECG-DelNet: Delineation of Ambulatory Electrocardiograms with Mixed
Quality Labeling Using Neural Networks [69.25956542388653]
Deep learning (DL) algorithms are gaining weight in academic and industrial settings.
We demonstrate DL can be successfully applied to low interpretative tasks by embedding ECG detection and delineation onto a segmentation framework.
The model was trained using PhysioNet's QT database, comprised of 105 ambulatory ECG recordings.
arXiv Detail & Related papers (2020-05-11T16:29:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.