Hierarchical Spatio-temporal Segmentation Network for Ejection Fraction Estimation in Echocardiography Videos
- URL: http://arxiv.org/abs/2508.18681v1
- Date: Tue, 26 Aug 2025 05:04:49 GMT
- Title: Hierarchical Spatio-temporal Segmentation Network for Ejection Fraction Estimation in Echocardiography Videos
- Authors: Dongfang Wang, Jian Yang, Yizhe Zhang, Tao Zhou,
- Abstract summary: Our model aims to improve Ejection Fraction estimation accuracy by synerizing local detail modeling.<n>The hierarchical design avoids issues such as local accumulation when relying solely on single frames or details when using multi-frame data.
- Score: 20.353975738483417
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Automated segmentation of the left ventricular endocardium in echocardiography videos is a key research area in cardiology. It aims to provide accurate assessment of cardiac structure and function through Ejection Fraction (EF) estimation. Although existing studies have achieved good segmentation performance, their results do not perform well in EF estimation. In this paper, we propose a Hierarchical Spatio-temporal Segmentation Network (\ourmodel) for echocardiography video, aiming to improve EF estimation accuracy by synergizing local detail modeling with global dynamic perception. The network employs a hierarchical design, with low-level stages using convolutional networks to process single-frame images and preserve details, while high-level stages utilize the Mamba architecture to capture spatio-temporal relationships. The hierarchical design balances single-frame and multi-frame processing, avoiding issues such as local error accumulation when relying solely on single frames or neglecting details when using only multi-frame data. To overcome local spatio-temporal limitations, we propose the Spatio-temporal Cross Scan (STCS) module, which integrates long-range context through skip scanning across frames and positions. This approach helps mitigate EF calculation biases caused by ultrasound image noise and other factors.
Related papers
- Point Tracking as a Temporal Cue for Robust Myocardial Segmentation in Echocardiography Videos [2.7509305461575875]
Myocardium segmentation in echocardiography videos is a challenging task due to low contrast, noise, and anatomical variability.<n>Traditional deep learning models either process frames independently, ignoring temporal information, or rely on memory-based feature propagation.<n>We propose Point-Seg, a transformer-based segmentation framework that integrates point tracking as a temporal cue.
arXiv Detail & Related papers (2026-01-14T06:23:36Z) - Left Atrial Segmentation with nnU-Net Using MRI [0.0]
Deep learning methods have recently demonstrated superior performance in medical image segmentation tasks.<n>In this study, we applied the nnU-Net framework, an automated, self-configuring deep learning segmentation architecture, to the Left Atrial Challenge 2013 dataset.<n>The network exhibited robust generalization across variations in left atrial shape, pulmonary contrast, and image quality, accurately delineating both the atrial body and proximal veins.
arXiv Detail & Related papers (2025-11-06T05:23:45Z) - A DyL-Unet framework based on dynamic learning for Temporally Consistent Echocardiographic Segmentation [0.328418927821443]
We propose DyL-UNet, a dynamic learning-based temporal consistency U-Net segmentation architecture.<n>The framework constructs an Echo-Dynamics Graph (EDG) through dynamic learning to extract dynamic information from videos.<n>Experiments on the CAMUS and EchoNet-Dynamic datasets demonstrate that DyL-UNet maintains segmentation accuracy comparable to existing methods.
arXiv Detail & Related papers (2025-09-23T14:17:01Z) - CardiacCLIP: Video-based CLIP Adaptation for LVEF Prediction in a Few-shot Manner [14.429336783145644]
Left ventricular ejection fraction (LVEF) serves as a key indicator of heart function.<n>Existing LVEF estimation methods depend on large-scale annotated video datasets.<n>We propose CardiacCLIP, a video-based framework that enhances LVEF prediction through attention-based frame aggregation and multi-resolution input scaling.
arXiv Detail & Related papers (2025-09-21T12:52:08Z) - Multi-Scale Feature Fusion with Image-Driven Spatial Integration for Left Atrium Segmentation from Cardiac MRI Images [0.0]
We propose a framework that integrates DINOv2 as an encoder with a UNet-style decoder, incorporating multi-scale feature fusion and input image integration.<n>We validate our approach on the LAScarQS 2022 dataset and demonstrate improved performance with a 92.3% Dice and 84.1% IoU score for giant architecture.
arXiv Detail & Related papers (2025-02-10T16:12:46Z) - Automatic Left Ventricular Cavity Segmentation via Deep Spatial Sequential Network in 4D Computed Tomography Studies [20.358413194053103]
We propose a new method to automatically segment temporal cardiac images.<n>We introduce a spatial sequential (SS) network to learn the deformation and motion characteristics of the LVC in an unsupervised manner.<n>Our results on a cardiac computed tomography (CT) dataset demonstrated that our network with bi-directional learning (SS-BL) method outperformed existing methods for LVC segmentation.
arXiv Detail & Related papers (2024-12-17T12:29:32Z) - Echocardiography Segmentation Using Neural ODE-based Diffeomorphic
Registration Field [0.0]
We present a novel method for diffevolution image registration using neural ordinary differential equations (Neural ODE)
The proposed method, Echo-ODE, introduces several key improvements compared to the previous state-of-the-art.
The results show that our method surpasses the previous state-of-the-art in multiple aspects.
arXiv Detail & Related papers (2023-06-16T08:37:27Z) - Reliable Joint Segmentation of Retinal Edema Lesions in OCT Images [55.83984261827332]
In this paper, we propose a novel reliable multi-scale wavelet-enhanced transformer network.
We develop a novel segmentation backbone that integrates a wavelet-enhanced feature extractor network and a multi-scale transformer module.
Our proposed method achieves better segmentation accuracy with a high degree of reliability as compared to other state-of-the-art segmentation approaches.
arXiv Detail & Related papers (2022-12-01T07:32:56Z) - Fuzzy Attention Neural Network to Tackle Discontinuity in Airway
Segmentation [67.19443246236048]
Airway segmentation is crucial for the examination, diagnosis, and prognosis of lung diseases.
Some small-sized airway branches (e.g., bronchus and terminaloles) significantly aggravate the difficulty of automatic segmentation.
This paper presents an efficient method for airway segmentation, comprising a novel fuzzy attention neural network and a comprehensive loss function.
arXiv Detail & Related papers (2022-09-05T16:38:13Z) - Light-weight spatio-temporal graphs for segmentation and ejection
fraction prediction in cardiac ultrasound [5.597394612661975]
We propose an automated method called EchoGraphs for predicting ejection fraction and segmenting the left ventricle.
Models for direct coordinate regression based on Graph Conal Networks (GCNs) are used to detect the keypoints.
Compared to semantic segmentation, GCNs show accurate segmentation and improvements in robustness and inference runtime.
arXiv Detail & Related papers (2022-07-06T10:03:44Z) - Segmentation of Cardiac Structures via Successive Subspace Learning with
Saab Transform from Cine MRI [29.894633364282555]
We propose a machine learning model, successive subspace learning with the subspace approximation with adjusted bias (Saab) transform, for accurate and efficient segmentation from cine MRI.
Our framework performed better than state-of-the-art U-Net models with 200$times$ fewer parameters in the left ventricle, right ventricle, and myocardium.
arXiv Detail & Related papers (2021-07-22T14:50:48Z) - Few-shot Medical Image Segmentation using a Global Correlation Network
with Discriminative Embedding [60.89561661441736]
We propose a novel method for few-shot medical image segmentation.
We construct our few-shot image segmentor using a deep convolutional network trained episodically.
We enhance discriminability of deep embedding to encourage clustering of the feature domains of the same class.
arXiv Detail & Related papers (2020-12-10T04:01:07Z) - Weakly-supervised Learning For Catheter Segmentation in 3D Frustum
Ultrasound [74.22397862400177]
We propose a novel Frustum ultrasound based catheter segmentation method.
The proposed method achieved the state-of-the-art performance with an efficiency of 0.25 second per volume.
arXiv Detail & Related papers (2020-10-19T13:56:22Z) - AutoHR: A Strong End-to-end Baseline for Remote Heart Rate Measurement
with Neural Searching [76.4844593082362]
We investigate the reason why existing end-to-end networks perform poorly in challenging conditions and establish a strong baseline for remote HR measurement with architecture search (NAS)
Comprehensive experiments are performed on three benchmark datasets on both intra-temporal and cross-dataset testing.
arXiv Detail & Related papers (2020-04-26T05:43:21Z) - Segmentation-based Method combined with Dynamic Programming for Brain
Midline Delineation [42.91127174025364]
The midline related pathological image features are crucial for evaluating the severity of brain compression caused by stroke or TBI.
Most of the previous methods model the midline by localizing the anatomical points, which are hard to detect or even missing in severe cases.
In this paper, we formulate the brain midline delineation as a segmentation task and propose a three-stage framework.
arXiv Detail & Related papers (2020-02-27T05:12:18Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.