ECG-Image-Database: A Dataset of ECG Images with Real-World Imaging and Scanning Artifacts; A Foundation for Computerized ECG Image Digitization and Analysis
- URL: http://arxiv.org/abs/2409.16612v1
- Date: Wed, 25 Sep 2024 04:30:19 GMT
- Title: ECG-Image-Database: A Dataset of ECG Images with Real-World Imaging and Scanning Artifacts; A Foundation for Computerized ECG Image Digitization and Analysis
- Authors: Matthew A. Reyna, Deepanshi, James Weigle, Zuzana Koscova, Kiersten Campbell, Kshama Kodthalu Shivashankara, Soheil Saghafi, Sepideh Nikookar, Mohsen Motie-Shirazi, Yashar Kiarashi, Salman Seyedi, Gari D. Clifford, Reza Sameni,
- Abstract summary: ECG-Image-Database is a large and diverse collection of electrocardiogram (ECG) images generated from ECG time-series data.
We used ECG-Image-Kit, an open-source Python toolkit, to generate realistic images of 12-lead ECG printouts from raw ECG time-series.
The resulting dataset includes 35,595 software-labeled ECG images with a wide range of imaging artifacts and distortions.
- Score: 4.263536786122581
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We introduce the ECG-Image-Database, a large and diverse collection of electrocardiogram (ECG) images generated from ECG time-series data, with real-world scanning, imaging, and physical artifacts. We used ECG-Image-Kit, an open-source Python toolkit, to generate realistic images of 12-lead ECG printouts from raw ECG time-series. The images include realistic distortions such as noise, wrinkles, stains, and perspective shifts, generated both digitally and physically. The toolkit was applied to 977 12-lead ECG records from the PTB-XL database and 1,000 from Emory Healthcare to create high-fidelity synthetic ECG images. These unique images were subjected to both programmatic distortions using ECG-Image-Kit and physical effects like soaking, staining, and mold growth, followed by scanning and photography under various lighting conditions to create real-world artifacts. The resulting dataset includes 35,595 software-labeled ECG images with a wide range of imaging artifacts and distortions. The dataset provides ground truth time-series data alongside the images, offering a reference for developing machine and deep learning models for ECG digitization and classification. The images vary in quality, from clear scans of clean papers to noisy photographs of degraded papers, enabling the development of more generalizable digitization algorithms. ECG-Image-Database addresses a critical need for digitizing paper-based and non-digital ECGs for computerized analysis, providing a foundation for developing robust machine and deep learning models capable of converting ECG images into time-series. The dataset aims to serve as a reference for ECG digitization and computerized annotation efforts. ECG-Image-Database was used in the PhysioNet Challenge 2024 on ECG image digitization and classification.
Related papers
- Comparing Deep Neural Network for Multi-Label ECG Diagnosis From Scanned ECG [1.2499537119440243]
We evaluate the performance of multiple deep neural network architectures, including AlexNet, VGG, ResNet, and Vision Transformer, on scanned ECG datasets.
Our comparative analysis examines model accuracy, robustness to image artifacts, and generalizability across different ECG conditions.
The findings highlight the strengths and limitations of each architecture, providing insights into the feasibility of image-based ECG diagnosis.
arXiv Detail & Related papers (2025-02-19T02:56:27Z) - Teach Multimodal LLMs to Comprehend Electrocardiographic Images [10.577263066644194]
We introduce ECGInstruct, a comprehensive ECG image instruction tuning dataset of over one million samples.
We also develop PULSE, an MLLM tailored for ECG image comprehension.
Our experiments show that PULSE sets a new state-of-the-art, outperforming general MLLMs with an average accuracy improvement of 15% to 30%.
arXiv Detail & Related papers (2024-10-21T20:26:41Z) - Combining Hough Transform and Deep Learning Approaches to Reconstruct ECG Signals From Printouts [2.374912052693646]
This work presents our team's winning contribution to the 2024 George B. Moody PhysioNet Challenge.
The Challenge had two goals: reconstruct ECG signals from printouts and classify them for cardiac diseases.
Our model achieved an average CV signal-to-noise ratio of 17.02 and an official Challenge score of 12.15 on the hidden set, securing first place in the competition.
arXiv Detail & Related papers (2024-10-18T05:36:24Z) - VizECGNet: Visual ECG Image Network for Cardiovascular Diseases Classification with Multi-Modal Training and Knowledge Distillation [0.7405975743268344]
In practice, ECG data is stored as either digitized signals or printed images.
We propose VizECGNet, which uses only printed ECG graphics to determine the prognosis of multiple cardiovascular diseases.
arXiv Detail & Related papers (2024-08-06T01:34:43Z) - End-to-End Model-based Deep Learning for Dual-Energy Computed Tomography Material Decomposition [53.14236375171593]
We propose a deep learning procedure called End-to-End Material Decomposition (E2E-DEcomp) for quantitative material decomposition.
We show the effectiveness of the proposed direct E2E-DEcomp method on the AAPM spectral CT dataset.
arXiv Detail & Related papers (2024-06-01T16:20:59Z) - CMRxRecon: An open cardiac MRI dataset for the competition of
accelerated image reconstruction [62.61209705638161]
There has been growing interest in deep learning-based CMR imaging algorithms.
Deep learning methods require large training datasets.
This dataset includes multi-contrast, multi-view, multi-slice and multi-coil CMR imaging data from 300 subjects.
arXiv Detail & Related papers (2023-09-19T15:14:42Z) - ECG-Image-Kit: A Synthetic Image Generation Toolbox to Facilitate Deep
Learning-Based Electrocardiogram Digitization [3.4579920352329787]
We introduce ECG-Image-Kit, an open-source toolbox for generating synthetic multi-lead ECG images with realistic artifacts from time-series data.
As a case study, we used ECG-Image-Kit to create a dataset of 21,801 ECG images from the PhysioNet QT database.
We trained a combination of a traditional computer vision and deep neural network model on this dataset to convert synthetic images into time-series data.
arXiv Detail & Related papers (2023-07-04T22:42:55Z) - Subjective and Objective Quality Assessment for in-the-Wild Computer
Graphics Images [57.02760260360728]
We build a large-scale in-the-wild CGIQA database consisting of 6,000 CGIs (CGIQA-6k)
We propose an effective deep learning-based no-reference (NR) IQA model by utilizing both distortion and aesthetic quality representation.
Experimental results show that the proposed method outperforms all other state-of-the-art NR IQA methods on the constructed CGIQA-6k database.
arXiv Detail & Related papers (2023-03-14T16:32:24Z) - Auto Lead Extraction and Digitization of ECG Paper Records using cGAN [0.23624125155742054]
ECG signals are generally stored in paper form, which makes it difficult to store and analyze the data.
We propose a deep learning-based model for individually extracting all 12 leads from 12-lead ECG images.
We also propose a method to convert the paper ECG format into a storable digital format.
arXiv Detail & Related papers (2022-11-12T18:36:29Z) - ECG-DelNet: Delineation of Ambulatory Electrocardiograms with Mixed
Quality Labeling Using Neural Networks [69.25956542388653]
Deep learning (DL) algorithms are gaining weight in academic and industrial settings.
We demonstrate DL can be successfully applied to low interpretative tasks by embedding ECG detection and delineation onto a segmentation framework.
The model was trained using PhysioNet's QT database, comprised of 105 ambulatory ECG recordings.
arXiv Detail & Related papers (2020-05-11T16:29:12Z) - Pathological Retinal Region Segmentation From OCT Images Using Geometric
Relation Based Augmentation [84.7571086566595]
We propose improvements over previous GAN-based medical image synthesis methods by jointly encoding the intrinsic relationship of geometry and shape.
The proposed method outperforms state-of-the-art segmentation methods on the public RETOUCH dataset having images captured from different acquisition procedures.
arXiv Detail & Related papers (2020-03-31T11:50:43Z) - Opportunities and Challenges of Deep Learning Methods for
Electrocardiogram Data: A Systematic Review [62.490310870300746]
The electrocardiogram (ECG) is one of the most commonly used diagnostic tools in medicine and healthcare.
Deep learning methods have achieved promising results on predictive healthcare tasks using ECG signals.
This paper presents a systematic review of deep learning methods for ECG data from both modeling and application perspectives.
arXiv Detail & Related papers (2019-12-28T02:44:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.