Investigating Deep Learning Models for Ejection Fraction Estimation from Echocardiography Videos
- URL: http://arxiv.org/abs/2512.22657v1
- Date: Sat, 27 Dec 2025 17:11:17 GMT
- Title: Investigating Deep Learning Models for Ejection Fraction Estimation from Echocardiography Videos
- Authors: Shravan Saranyan, Pramit Saha,
- Abstract summary: Left ventricular ejection fraction (LVEF) is a key indicator of cardiac function.<n>Deep learning approaches offer the potential to achieve performance comparable to that of experienced human experts.
- Score: 2.86829428083307
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Left ventricular ejection fraction (LVEF) is a key indicator of cardiac function and plays a central role in the diagnosis and management of cardiovascular disease. Echocardiography, as a readily accessible and non-invasive imaging modality, is widely used in clinical practice to estimate LVEF. However, manual assessment of cardiac function from echocardiograms is time-consuming and subject to considerable inter-observer variability. Deep learning approaches offer a promising alternative, with the potential to achieve performance comparable to that of experienced human experts. In this study, we investigate the effectiveness of several deep learning architectures for LVEF estimation from echocardiography videos, including 3D Inception, two-stream, and CNN-RNN models. We systematically evaluate architectural modifications and fusion strategies to identify configurations that maximize prediction accuracy. Models were trained and evaluated on the EchoNet-Dynamic dataset, comprising 10,030 echocardiogram videos. Our results demonstrate that modified 3D Inception architectures achieve the best overall performance, with a root mean squared error (RMSE) of 6.79%. Across architectures, we observe a tendency toward overfitting, with smaller and simpler models generally exhibiting improved generalization. Model performance was also found to be highly sensitive to hyperparameter choices, particularly convolutional kernel sizes and normalization strategies. While this study focuses on echocardiography-based LVEF estimation, the insights gained regarding architectural design and training strategies may be applicable to a broader range of medical and non-medical video analysis tasks.
Related papers
- VISION-ICE: Video-based Interpretation and Spatial Identification of Arrhythmia Origins via Neural Networks in Intracardiac Echocardiography [3.3977894536938273]
Arrhythmia source localization is formulated as a three-class classification task, distinguishing normal sinus rhythm, left-sided, and right-sided arrhythmias.<n>We developed a 3D Convolutional Neural Network trained to discriminate among the three aforementioned classes.<n>In ten-fold cross-validation, the model achieved a mean accuracy of 66.2% when evaluated on four previously unseen patients.
arXiv Detail & Related papers (2026-02-04T06:02:03Z) - Left Atrial Segmentation with nnU-Net Using MRI [0.0]
Deep learning methods have recently demonstrated superior performance in medical image segmentation tasks.<n>In this study, we applied the nnU-Net framework, an automated, self-configuring deep learning segmentation architecture, to the Left Atrial Challenge 2013 dataset.<n>The network exhibited robust generalization across variations in left atrial shape, pulmonary contrast, and image quality, accurately delineating both the atrial body and proximal veins.
arXiv Detail & Related papers (2025-11-06T05:23:45Z) - CardiacCLIP: Video-based CLIP Adaptation for LVEF Prediction in a Few-shot Manner [14.429336783145644]
Left ventricular ejection fraction (LVEF) serves as a key indicator of heart function.<n>Existing LVEF estimation methods depend on large-scale annotated video datasets.<n>We propose CardiacCLIP, a video-based framework that enhances LVEF prediction through attention-based frame aggregation and multi-resolution input scaling.
arXiv Detail & Related papers (2025-09-21T12:52:08Z) - Sensing Cardiac Health Across Scenarios and Devices: A Multi-Modal Foundation Model Pretrained on Heterogeneous Data from 1.7 Million Individuals [36.08910150609342]
We present a cardiac sensing foundation model (CSFM) that learns unified representations from vast, heterogeneous health records.<n>Our model is pretrained on an innovative multi-modal integration of data from multiple large-scale datasets.<n> CSFM consistently outperforms traditional one-modal-one-task approaches.
arXiv Detail & Related papers (2025-06-23T20:58:12Z) - Patient-Specific Dynamic Digital-Physical Twin for Coronary Intervention Training: An Integrated Mixed Reality Approach [33.92599418560439]
Existing training systems lack accurate simulation of cardiac physiological dynamics.<n>This study develops a comprehensive dynamic cardiac model research framework based on 4D-CTA.
arXiv Detail & Related papers (2025-05-16T06:13:55Z) - EchoWorld: Learning Motion-Aware World Models for Echocardiography Probe Guidance [79.66329903007869]
We present EchoWorld, a motion-aware world modeling framework for probe guidance.<n>It encodes anatomical knowledge and motion-induced visual dynamics.<n>It is trained on more than one million ultrasound images from over 200 routine scans.
arXiv Detail & Related papers (2025-04-17T16:19:05Z) - Preserving Cardiac Integrity: A Topology-Infused Approach to Whole Heart Segmentation [6.495726693226574]
Whole heart segmentation (WHS) supports cardiovascular disease diagnosis, disease monitoring, treatment planning, and prognosis.
This paper introduces a new topology-preserving module that is integrated into deep neural networks.
The implementation achieves anatomically plausible segmentation by using learned topology-preserving fields, which are based entirely on 3D convolution and are therefore very effective for 3D voxel data.
arXiv Detail & Related papers (2024-10-14T14:32:05Z) - ECG Arrhythmia Detection Using Disease-specific Attention-based Deep Learning Model [0.0]
We propose a disease-specific attention-based deep learning model (DANet) for arrhythmia detection from short ECG recordings.
The novel idea is to introduce a soft-coding or hard-coding waveform enhanced module into existing deep neural networks.
For the soft-coding DANet, we also develop a learning framework combining self-supervised pre-training with two-stage supervised training.
arXiv Detail & Related papers (2024-07-25T13:27:10Z) - A Generalizable Deep Learning System for Cardiac MRI [29.429744474335347]
We describe a foundational vision system for cardiac MRI, capable of representing the breadth of human cardiovascular disease and health.
Our deep learning model is trained via self-supervised contrastive learning, by which visual concepts in cine-sequence cardiac MRI scans are learned from the raw text of the accompanying radiology reports.
We show that our deep learning system is capable of not only understanding the staggering complexity of human cardiovascular disease, but can be directed towards clinical problems of interest yielding impressive, clinical grade diagnostic accuracy with a fraction of the training data typically required for such tasks.
arXiv Detail & Related papers (2023-12-01T05:27:29Z) - Successive Subspace Learning for Cardiac Disease Classification with
Two-phase Deformation Fields from Cine MRI [36.044984400761535]
This work proposes a lightweight successive subspace learning framework for CVD classification.
It is based on an interpretable feedforward design, in conjunction with a cardiac atlas.
Compared with 3D CNN-based approaches, our framework achieves superior classification performance with 140$times$ fewer parameters.
arXiv Detail & Related papers (2023-01-21T15:00:59Z) - HeartBEiT: Vision Transformer for Electrocardiogram Data Improves
Diagnostic Performance at Low Sample Sizes [28.88454028731653]
We create the first vision-based transformer model, HeartBEiT, for electrocardiogram waveform analysis.
We show that HeartBEiT has significantly higher performance at lower sample sizes compared to other models.
We also show that HeartBEiT improves explainability of diagnosis by highlighting biologically relevant regions of the EKG vs. standard CNNs.
arXiv Detail & Related papers (2022-12-13T16:39:21Z) - A Global Benchmark of Algorithms for Segmenting Late Gadolinium-Enhanced
Cardiac Magnetic Resonance Imaging [90.29017019187282]
" 2018 Left Atrium Challenge" using 154 3D LGE-MRIs, currently the world's largest cardiac LGE-MRI dataset.
Analyse of the submitted algorithms using technical and biological metrics was performed.
Results show the top method achieved a dice score of 93.2% and a mean surface to a surface distance of 0.7 mm.
arXiv Detail & Related papers (2020-04-26T08:49:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.