Related papers: Comparing Deep Neural Network for Multi-Label ECG Diagnosis From Scanned ECG

Comparing Deep Neural Network for Multi-Label ECG Diagnosis From Scanned ECG

URL: http://arxiv.org/abs/2502.14909v2
Date: Thu, 06 Mar 2025 05:18:12 GMT
Title: Comparing Deep Neural Network for Multi-Label ECG Diagnosis From Scanned ECG
Authors: Cuong V. Nguyen, Hieu X. Nguyen, Dung D. Pham Minh, Cuong D. Do,
Abstract summary: We evaluate the performance of multiple deep neural network architectures, including AlexNet, VGG, ResNet, and Vision Transformer, on scanned ECG datasets.<n>Our comparative analysis examines model accuracy, robustness to image artifacts, and generalizability across different ECG conditions.<n>The findings highlight the strengths and limitations of each architecture, providing insights into the feasibility of image-based ECG diagnosis.
Score: 1.2499537119440243
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Automated ECG diagnosis has seen significant advancements with deep learning techniques, but real-world applications still face challenges when dealing with scanned paper ECGs. In this study, we explore multi-label classification of ECGs extracted from scanned images, moving beyond traditional binary classification (normal/abnormal). We evaluate the performance of multiple deep neural network architectures, including AlexNet, VGG, ResNet, and Vision Transformer, on scanned ECG datasets. Our comparative analysis examines model accuracy, robustness to image artifacts, and generalizability across different ECG conditions. Additionally, we investigate whether ECG signals extracted from scanned images retain sufficient diagnostic information for reliable automated classification. The findings highlight the strengths and limitations of each architecture, providing insights into the feasibility of image-based ECG diagnosis and its potential integration into clinical workflows.

Related papers

A Deep Learning Pipeline Using Synthetic Data to Improve Interpretation of Paper ECG Images [8.559073054541754]
Cardiovascular diseases (CVDs) are the leading global cause of death, and early detection is essential to improve patient outcomes.<n>We propose a deep learning framework designed specifically to classify paper-like ECG images into five main diagnostic categories.<n>Our method was the winning entry to the 2024 British Heart Foundation Open Data Science Challenge.
arXiv Detail & Related papers (2025-07-29T16:16:17Z)
Heartcare Suite: Multi-dimensional Understanding of ECG with Raw Multi-lead Signal Modeling [50.58126509704037]
Heartcare Suite is a framework for fine-grained electrocardiogram (ECG) understanding.<n>Heartcare-220K is a high-quality, structured, and comprehensive multimodal ECG dataset.<n>Heartcare-Bench is a benchmark to guide the optimization of Medical Multimodal Large Language Models (Med-MLLMs) in ECG scenarios.
arXiv Detail & Related papers (2025-06-06T07:56:41Z)
GEM: Empowering MLLM for Grounded ECG Understanding with Time Series and Images [43.65650710265957]
We introduce GEM, the first MLLM unifying ECG time series, 12-lead ECG images and text for grounded and clinician-aligned ECG interpretation. GEM enables feature-grounded analysis, evidence-driven reasoning, and a clinician-like diagnostic process through three core innovations. We propose the Grounded ECG task, a clinically motivated benchmark designed to assess the MLLM's capability in grounded ECG understanding.
arXiv Detail & Related papers (2025-03-08T05:48:53Z)
CognitionCapturer: Decoding Visual Stimuli From Human EEG Signal With Multimodal Information [61.1904164368732]
We propose CognitionCapturer, a unified framework that fully leverages multimodal data to represent EEG signals. Specifically, CognitionCapturer trains Modality Experts for each modality to extract cross-modal information from the EEG modality. The framework does not require any fine-tuning of the generative models and can be extended to incorporate more modalities.
arXiv Detail & Related papers (2024-12-13T16:27:54Z)
Teach Multimodal LLMs to Comprehend Electrocardiographic Images [10.577263066644194]
We introduce ECGInstruct, a comprehensive ECG image instruction tuning dataset of over one million samples. We also develop PULSE, an MLLM tailored for ECG image comprehension. Our experiments show that PULSE sets a new state-of-the-art, outperforming general MLLMs with an average accuracy improvement of 15% to 30%.
arXiv Detail & Related papers (2024-10-21T20:26:41Z)
ECG-Image-Database: A Dataset of ECG Images with Real-World Imaging and Scanning Artifacts; A Foundation for Computerized ECG Image Digitization and Analysis [4.263536786122581]
ECG-Image-Database is a large and diverse collection of electrocardiogram (ECG) images generated from ECG time-series data. We used ECG-Image-Kit, an open-source Python toolkit, to generate realistic images of 12-lead ECG printouts from raw ECG time-series. The resulting dataset includes 35,595 software-labeled ECG images with a wide range of imaging artifacts and distortions.
arXiv Detail & Related papers (2024-09-25T04:30:19Z)
VizECGNet: Visual ECG Image Network for Cardiovascular Diseases Classification with Multi-Modal Training and Knowledge Distillation [0.7405975743268344]
In practice, ECG data is stored as either digitized signals or printed images. We propose VizECGNet, which uses only printed ECG graphics to determine the prognosis of multiple cardiovascular diseases.
arXiv Detail & Related papers (2024-08-06T01:34:43Z)
MEIT: Multi-Modal Electrocardiogram Instruction Tuning on Large Language Models for Report Generation [41.324530807795256]
Electrocardiogram (ECG) is the primary non-invasive diagnostic tool for monitoring cardiac conditions. Recent studies have concentrated on classifying cardiac conditions using ECG data but have overlooked ECG report generation. We propose the Multimodal ECG Instruction Tuning (MEIT) framework, the first attempt to tackle ECG report generation with LLMs and multimodal instructions.
arXiv Detail & Related papers (2024-03-07T23:20:56Z)
Graph Neural Networks for Topological Feature Extraction in ECG Classification [11.337163242503166]
We propose three techniques for classifying heartbeats using graph neural networks. The three proposed techniques are capable of making arrhythmia classification predictions with the accuracy of 99.38, 98.76, and 91.93 percent, respectively.
arXiv Detail & Related papers (2023-11-02T16:14:34Z)
DGSD: Dynamical Graph Self-Distillation for EEG-Based Auditory Spatial Attention Detection [49.196182908826565]
Auditory Attention Detection (AAD) aims to detect target speaker from brain signals in a multi-speaker environment. Current approaches primarily rely on traditional convolutional neural network designed for processing Euclidean data like images. This paper proposes a dynamical graph self-distillation (DGSD) approach for AAD, which does not require speech stimuli as input.
arXiv Detail & Related papers (2023-09-07T13:43:46Z)
LOTUS: Learning to Optimize Task-based US representations [39.81131738128329]
Anatomical segmentation of organs in ultrasound images is essential to many clinical applications. Existing deep neural networks require a large amount of labeled data for training in order to achieve clinically acceptable performance. In this paper, we propose a novel approach for learning to optimize task-based ultra-sound image representations.
arXiv Detail & Related papers (2023-07-29T16:29:39Z)
ECG-Image-Kit: A Synthetic Image Generation Toolbox to Facilitate Deep Learning-Based Electrocardiogram Digitization [3.4579920352329787]
We introduce ECG-Image-Kit, an open-source toolbox for generating synthetic multi-lead ECG images with realistic artifacts from time-series data. As a case study, we used ECG-Image-Kit to create a dataset of 21,801 ECG images from the PhysioNet QT database. We trained a combination of a traditional computer vision and deep neural network model on this dataset to convert synthetic images into time-series data.
arXiv Detail & Related papers (2023-07-04T22:42:55Z)
DreamDiffusion: Generating High-Quality Images from Brain EEG Signals [42.30835251506628]
DreamDiffusion is a novel method for generating high-quality images directly from brain electroencephalogram (EEG) signals. The proposed method overcomes the challenges of using EEG signals for image generation, such as noise, limited information, and individual differences.
arXiv Detail & Related papers (2023-06-29T13:33:02Z)
Automated Cardiovascular Record Retrieval by Multimodal Learning between Electrocardiogram and Clinical Report [28.608260758775316]
We introduce a novel approach to ECG interpretation, leveraging recent breakthroughs in Large Language Models (LLMs) and Vision-Transformer (ViT) models. We propose an alternative method of automatically identifying the most similar clinical cases based on the input ECG data. Our findings could serve as a crucial resource for providing diagnostic services in underdeveloped regions.
arXiv Detail & Related papers (2023-04-13T06:32:25Z)
Auto Lead Extraction and Digitization of ECG Paper Records using cGAN [0.23624125155742054]
ECG signals are generally stored in paper form, which makes it difficult to store and analyze the data. We propose a deep learning-based model for individually extracting all 12 leads from 12-lead ECG images. We also propose a method to convert the paper ECG format into a storable digital format.
arXiv Detail & Related papers (2022-11-12T18:36:29Z)
Preservation of High Frequency Content for Deep Learning-Based Medical Image Classification [74.84221280249876]
An efficient analysis of large amounts of chest radiographs can aid physicians and radiologists. We propose a novel Discrete Wavelet Transform (DWT)-based method for the efficient identification and encoding of visual information.
arXiv Detail & Related papers (2022-05-08T15:29:54Z)
ECG-DelNet: Delineation of Ambulatory Electrocardiograms with Mixed Quality Labeling Using Neural Networks [69.25956542388653]
Deep learning (DL) algorithms are gaining weight in academic and industrial settings. We demonstrate DL can be successfully applied to low interpretative tasks by embedding ECG detection and delineation onto a segmentation framework. The model was trained using PhysioNet's QT database, comprised of 105 ambulatory ECG recordings.
arXiv Detail & Related papers (2020-05-11T16:29:12Z)
Opportunities and Challenges of Deep Learning Methods for Electrocardiogram Data: A Systematic Review [62.490310870300746]
The electrocardiogram (ECG) is one of the most commonly used diagnostic tools in medicine and healthcare. Deep learning methods have achieved promising results on predictive healthcare tasks using ECG signals. This paper presents a systematic review of deep learning methods for ECG data from both modeling and application perspectives.
arXiv Detail & Related papers (2019-12-28T02:44:29Z)

This list is automatically generated from the titles and abstracts of the papers in this site.