VasoMIM: Vascular Anatomy-Aware Masked Image Modeling for Vessel Segmentation
- URL: http://arxiv.org/abs/2508.10794v1
- Date: Thu, 14 Aug 2025 16:17:02 GMT
- Title: VasoMIM: Vascular Anatomy-Aware Masked Image Modeling for Vessel Segmentation
- Authors: De-Xing Huang, Xiao-Hu Zhou, Mei-Jiang Gui, Xiao-Liang Xie, Shi-Qi Liu, Shuang-Yi Wang, Tian-Yu Xiang, Rui-Ze Ma, Nu-Fang Xiao, Zeng-Guang Hou,
- Abstract summary: We introduce Vascular anatomy-aware Masked Image Modeling (VasoMIM) for X-ray angiograms.<n>VasoMIM integrates anatomical knowledge into the pre-training process.<n>It comprises two complementary components: anatomy-guided masking strategy and anatomical consistency loss.<n>VasoMIM achieves state-of-the-art performance across three datasets.
- Score: 11.157006681644136
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Accurate vessel segmentation in X-ray angiograms is crucial for numerous clinical applications. However, the scarcity of annotated data presents a significant challenge, which has driven the adoption of self-supervised learning (SSL) methods such as masked image modeling (MIM) to leverage large-scale unlabeled data for learning transferable representations. Unfortunately, conventional MIM often fails to capture vascular anatomy because of the severe class imbalance between vessel and background pixels, leading to weak vascular representations. To address this, we introduce Vascular anatomy-aware Masked Image Modeling (VasoMIM), a novel MIM framework tailored for X-ray angiograms that explicitly integrates anatomical knowledge into the pre-training process. Specifically, it comprises two complementary components: anatomy-guided masking strategy and anatomical consistency loss. The former preferentially masks vessel-containing patches to focus the model on reconstructing vessel-relevant regions. The latter enforces consistency in vascular semantics between the original and reconstructed images, thereby improving the discriminability of vascular representations. Empirically, VasoMIM achieves state-of-the-art performance across three datasets. These findings highlight its potential to facilitate X-ray angiogram analysis.
Related papers
- Vascular anatomy-aware self-supervised pre-training for X-ray angiogram analysis [18.553513485588564]
We introduce a vascular anatomy-aware masked image modeling (VasoMIM) framework that explicitly integrates domain-specific anatomical knowledge.<n>VasoMIM comprises two key designs: an anatomy-guided masking strategy and an anatomical consistency loss.<n>In conjunction with VasoMIM, we curate XA-170K, the largest X-ray angiogram pre-training dataset to date.
arXiv Detail & Related papers (2026-02-12T03:52:44Z) - Multi-View Stenosis Classification Leveraging Transformer-Based Multiple-Instance Learning Using Real-World Clinical Data [76.89269238957593]
Coronary artery stenosis is a leading cause of cardiovascular disease, diagnosed by analyzing the coronary arteries from multiple angiography views.<n>We propose SegmentMIL, a transformer-based multi-view multiple-instance learning framework for patient-level stenosis classification.
arXiv Detail & Related papers (2026-02-02T13:07:52Z) - DGSSA: Domain generalization with structural and stylistic augmentation for retinal vessel segmentation [17.396365010722423]
Retinal vascular morphology is crucial for diagnosing diseases such as diabetes, glaucoma, and hypertension.<n>Traditional segmentation methods assume that training and testing data share similar distributions, which can lead to poor performance on unseen domains.<n>This paper presents a novel approach, DGSSA, for retinal vessel image segmentation that enhances model generalization by combining structural and style augmentation strategies.
arXiv Detail & Related papers (2025-01-07T01:47:57Z) - Enhanced MRI Representation via Cross-series Masking [48.09478307927716]
Cross-Series Masking (CSM) Strategy for effectively learning MRI representation in a self-supervised manner.<n>Method achieves state-of-the-art performance on both public and in-house datasets.
arXiv Detail & Related papers (2024-12-10T10:32:09Z) - Spatiotemporal Disentanglement of Arteriovenous Malformations in Digital
Subtraction Angiography [37.44819725897024]
The presented method aims to enhance Digital Subtraction Angiography (DSA) image series by highlighting critical information via automatic classification of vessels.
The method was tested on clinical DSA images series and demonstrated efficient differentiation between arteries and veins.
arXiv Detail & Related papers (2024-02-15T00:29:53Z) - Diffusion Models for Counterfactual Generation and Anomaly Detection in Brain Images [39.94162291765236]
We present a weakly supervised method to generate a healthy version of a diseased image and then use it to obtain a pixel-wise anomaly map.
We employ a diffusion model trained on healthy samples and combine Denoising Diffusion Probabilistic Model (DDPM) and Denoising Implicit Model (DDIM) at each step of the sampling process.
arXiv Detail & Related papers (2023-08-03T21:56:50Z) - C-DARL: Contrastive diffusion adversarial representation learning for
label-free blood vessel segmentation [39.79157116429435]
This paper presents a self-supervised vessel segmentation method, dubbed the contrastive diffusion adversarial representation learning (C-DARL) model.
Our model is composed of a diffusion module and a generation module that learns the distribution of multi-domain blood vessel data.
To validate the efficacy, C-DARL is trained using various vessel datasets, including coronary angiograms, abdominal digital subtraction angiograms, and retinal imaging.
arXiv Detail & Related papers (2023-07-31T23:09:01Z) - Patched Diffusion Models for Unsupervised Anomaly Detection in Brain MRI [55.78588835407174]
We propose a method that reformulates the generation task of diffusion models as a patch-based estimation of healthy brain anatomy.
We evaluate our approach on data of tumors and multiple sclerosis lesions and demonstrate a relative improvement of 25.1% compared to existing baselines.
arXiv Detail & Related papers (2023-03-07T09:40:22Z) - Diffusion Adversarial Representation Learning for Self-supervised Vessel
Segmentation [36.65094442100924]
Vessel segmentation in medical images is one of the important tasks in the diagnosis of vascular diseases and therapy planning.
We introduce a novel diffusion adversarial representation learning (DARL) model that leverages a denoising diffusion probabilistic model with adversarial learning.
Our method significantly outperforms existing unsupervised and self-supervised methods in vessel segmentation.
arXiv Detail & Related papers (2022-09-29T06:06:15Z) - Learning-Based Keypoint Registration for Fetoscopic Mosaicking [65.02392513942533]
In Twin-to-Twin Transfusion Syndrome (TTTS), abnormal vascular anastomoses in the monochorionic placenta can produce uneven blood flow between the two fetuses.
We propose a learning-based framework for in-vivo fetoscopy frame registration for field-of-view expansion.
arXiv Detail & Related papers (2022-07-26T21:21:12Z) - RAVIR: A Dataset and Methodology for the Semantic Segmentation and
Quantitative Analysis of Retinal Arteries and Veins in Infrared Reflectance
Imaging [7.316426736150123]
We present a novel dataset, dubbed RAVIR, for the semantic segmentation of Retinal Arteries and Veins in Infrared Reflectance (IR) imaging.
We propose a novel deep learning-based methodology, denoted as SegRAVIR, for the semantic segmentation of retinal arteries and veins.
Our experiments validate the effectiveness of SegRAVIR and demonstrate its superior performance in comparison to state-of-the-art models.
arXiv Detail & Related papers (2022-03-28T17:30:29Z) - Symmetry-Enhanced Attention Network for Acute Ischemic Infarct
Segmentation with Non-Contrast CT Images [50.55978219682419]
We propose a symmetry enhanced attention network (SEAN) for acute ischemic infarct segmentation.
Our proposed network automatically transforms an input CT image into the standard space where the brain tissue is bilaterally symmetric.
The proposed SEAN outperforms some symmetry-based state-of-the-art methods in terms of both dice coefficient and infarct localization.
arXiv Detail & Related papers (2021-10-11T07:13:26Z) - Retinopathy of Prematurity Stage Diagnosis Using Object Segmentation and
Convolutional Neural Networks [68.96150598294072]
Retinopathy of Prematurity (ROP) is an eye disorder primarily affecting premature infants with lower weights.
It causes proliferation of vessels in the retina and could result in vision loss and, eventually, retinal detachment, leading to blindness.
In recent years, there has been a significant effort to automate the diagnosis using deep learning.
This paper builds upon the success of previous models and develops a novel architecture, which combines object segmentation and convolutional neural networks (CNN)
Our proposed system first trains an object segmentation model to identify the demarcation line at a pixel level and adds the resulting mask as an additional "color" channel in
arXiv Detail & Related papers (2020-04-03T14:07:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.