Related papers: UniECG: Understanding and Generating ECG in One Unified Model

UniECG: Understanding and Generating ECG in One Unified Model

URL: http://arxiv.org/abs/2509.18588v1
Date: Tue, 23 Sep 2025 03:15:53 GMT
Title: UniECG: Understanding and Generating ECG in One Unified Model
Authors: Jiarui Jin, Haoyu Wang, Xiang Lan, Jun Li, Gaofeng Cheng, Hongyan Li, Shenda Hong,
Abstract summary: We propose UniECG, the first unified model for ECG capable of concurrently performing evidence-based ECG interpretation and text-conditioned ECG generation tasks.<n>UniECG can autonomously choose to interpret or generate an ECG based on user input, significantly extending the capability boundaries of current ECG models.
Score: 26.641666246045133
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Recent unified models such as GPT-5 have achieved encouraging progress on vision-language tasks. However, these unified models typically fail to correctly understand ECG signals and provide accurate medical diagnoses, nor can they correctly generate ECG signals. To address these limitations, we propose UniECG, the first unified model for ECG capable of concurrently performing evidence-based ECG interpretation and text-conditioned ECG generation tasks. Through a decoupled two-stage training approach, the model first learns evidence-based interpretation skills (ECG-to-Text), and then injects ECG generation capabilities (Text-to-ECG) via latent space alignment. UniECG can autonomously choose to interpret or generate an ECG based on user input, significantly extending the capability boundaries of current ECG models. Our code and checkpoints will be made publicly available at https://github.com/PKUDigitalHealth/UniECG upon acceptance.

Related papers

ECG-R1: Protocol-Guided and Modality-Agnostic MLLM for Reliable ECG Interpretation [36.244601234085856]
Existing multimodal large language models (MLLMs) remain unreliable for ECG interpretation.<n>ECG-R1 is the first reasoning MLLM designed for reliable ECG interpretation.<n>Code and data are publicly available at hrefhttp://ai.heartvoice.com.cn/ECG-R1here.
arXiv Detail & Related papers (2026-02-04T07:17:55Z)
Unveiling the Heart-Brain Connection: An Analysis of ECG in Cognitive Performance [0.1631115063641726]
ECG signals can reliably reflect cognitive load and serve as proxies for EEG-based indicators.<n>We propose a cross-modal XGBoost framework to project the ECG features onto EEG-representative cognitive spaces.<n>Our findings underpin ECG as an interpretable, real-time, wearable solution for everyday cognitive monitoring.
arXiv Detail & Related papers (2026-01-04T08:06:19Z)
EnECG: Efficient Ensemble Learning for Electrocardiogram Multi-task Foundation Model [46.84040404474695]
EnECG is an ensemble-based framework that integrates multiple specialized foundation models, each excelling in different aspects of ECG interpretation.<n>We show that EnECG can help reduce computational and memory costs while maintaining the strong representational power of foundation models.<n>This framework not only enhances feature extraction and predictive performance but also ensures practical efficiency for real-world clinical applications.
arXiv Detail & Related papers (2025-11-28T07:22:33Z)
Simulator and Experience Enhanced Diffusion Model for Comprehensive ECG Generation [52.19347532840774]
We propose SE-Diff, a novel physiological simulator and experience enhanced diffusion model for ECG generation.<n> SE-Diff integrates a lightweight ordinary differential equation (ODE)-based ECG simulator into the diffusion process via a beat decoder.<n>Extensive experiments on real-world ECG datasets demonstrate that SE-Diff improves both signal fidelity and text-ECG semantic alignment.
arXiv Detail & Related papers (2025-11-13T02:57:10Z)
ECG-aBcDe: Overcoming Model Dependence, Encoding ECG into a Universal Language for Any LLM [7.632459372363093]
Large Language Models (LLMs) hold significant promise for electrocardiogram (ECG) analysis.<n>Current methods suffer from model-specific ECG encoders, hindering transfer across LLMs.<n>We introduce ECG-aBcDe, a novel encoding method that transforms ECG signals into a universal ECG language readily interpretable by any LLM.
arXiv Detail & Related papers (2025-09-16T03:41:02Z)
EEG-MedRAG: Enhancing EEG-based Clinical Decision-Making via Hierarchical Hypergraph Retrieval-Augmented Generation [45.031633614714]
EEG-MedRAG is a three-layer hypergraph-based retrieval-augmented generation framework.<n>It unifies EEG domain knowledge, individual patient cases, and a large-scale repository into a traversable n-ary relational hypergraph.<n>We introduce the first cross-disease, cross-role EEG clinical QA benchmark, spanning seven disorders and five authentic clinical perspectives.
arXiv Detail & Related papers (2025-08-19T11:12:58Z)
GEM: Empowering MLLM for Grounded ECG Understanding with Time Series and Images [43.65650710265957]
We introduce GEM, the first MLLM unifying ECG time series, 12-lead ECG images and text for grounded and clinician-aligned ECG interpretation.<n> GEM enables feature-grounded analysis, evidence-driven reasoning, and a clinician-like diagnostic process through three core innovations.<n>We propose the Grounded ECG task, a clinically motivated benchmark designed to assess the MLLM's capability in grounded ECG understanding.
arXiv Detail & Related papers (2025-03-08T05:48:53Z)
ECG-Byte: A Tokenizer for End-to-End Generative Electrocardiogram Language Modeling [20.484166589932702]
Large Language Models (LLMs) have demonstrated exceptional versatility across domains, including applications to electrocardiograms (ECGs)<n>We propose ECG-Byte, an adapted byte pair encoding (BPE) tokenizer pipeline for autoregressive language modeling of ECGs.<n>We achieve competitive NLG performance while training 3 times faster and using just 48% of the data required by traditional two-stage methods.
arXiv Detail & Related papers (2024-12-18T22:13:21Z)
An Electrocardiogram Foundation Model Built on over 10 Million Recordings with External Evaluation across Multiple Domains [17.809094003643523]
ECG Foundation Model (ECGFounder) trained on over 10 million ECGs with 150 label categories from Harvard-Emory ECG Database.<n>ECGFounder achieves expert-level performance on internal validation sets, with AUROC exceeding 0.95 for eighty diagnoses.<n>When fine-tuned, ECGFounder outperforms baseline models in demographic analysis, clinical event detection, and cross-modality cardiac rhythm diagnosis.
arXiv Detail & Related papers (2024-10-05T12:12:02Z)
ECG-FM: An Open Electrocardiogram Foundation Model [3.8270632390229777]
We present ECG-FM, an open foundation model for ECG analysis, and conduct a study using a dataset of 1.5 million ECGs.<n>ECG-FM is a transformer-based model pretrained using a hybrid contrastive and generative self-supervised learning approach.<n>We affirm that ECG-FM is robust, label-efficient, and functionally discriminative by showcasing data scaling experiments, performing a latent space analysis, and generating saliency maps.
arXiv Detail & Related papers (2024-08-09T17:06:49Z)
MEIT: Multimodal Electrocardiogram Instruction Tuning on Large Language Models for Report Generation [28.35107188450758]
Electrocardiogram (ECG) is the primary non-invasive diagnostic tool for monitoring cardiac conditions.<n>Recent studies have concentrated on classifying cardiac conditions using ECG data but have overlooked ECG report generation.<n>We propose the Multimodal ECG Instruction Tuning (MEIT) framework, the first attempt to tackle ECG report generation with LLMs and multimodal instructions.
arXiv Detail & Related papers (2024-03-07T23:20:56Z)
ETP: Learning Transferable ECG Representations via ECG-Text Pre-training [10.856365645831728]
ECG-Text Pre-training (ETP) is an innovative framework designed to learn cross-modal representations that link ECG signals with textual reports. ETP employs an ECG encoder along with a pre-trained language model to align ECG signals with their corresponding textual reports.
arXiv Detail & Related papers (2023-09-06T19:19:26Z)
PulseNet: Deep Learning ECG-signal classification using random augmentation policy and continous wavelet transform for canines [46.09869227806991]
evaluating canine electrocardiograms (ECG) require skilled veterinarians. Current availability of veterinary cardiologists for ECG interpretation and diagnostic support is limited. We implement a deep convolutional neural network (CNN) approach for classifying canine electrocardiogram sequences as either normal or abnormal.
arXiv Detail & Related papers (2023-05-17T09:06:39Z)
ECG-DelNet: Delineation of Ambulatory Electrocardiograms with Mixed Quality Labeling Using Neural Networks [69.25956542388653]
Deep learning (DL) algorithms are gaining weight in academic and industrial settings. We demonstrate DL can be successfully applied to low interpretative tasks by embedding ECG detection and delineation onto a segmentation framework. The model was trained using PhysioNet's QT database, comprised of 105 ambulatory ECG recordings.
arXiv Detail & Related papers (2020-05-11T16:29:12Z)

This list is automatically generated from the titles and abstracts of the papers in this site.