3D Conditional Image Synthesis of Left Atrial LGE MRI from Composite Semantic Masks
- URL: http://arxiv.org/abs/2601.04588v1
- Date: Thu, 08 Jan 2026 04:35:40 GMT
- Title: 3D Conditional Image Synthesis of Left Atrial LGE MRI from Composite Semantic Masks
- Authors: Yusri Al-Sanaani, Rebecca Thornhill, Sreeraman Rajan,
- Abstract summary: We develop a pipeline to synthesize high-fidelity 3D LGE MRI volumes using 3D conditional generators.<n> SPADE-LDM generates the most realistic and structurally accurate images.<n>When augmented with synthetic LGE images, the Dice score for LA cavity segmentation with a 3D U-Net model improved.
- Score: 1.452875650827562
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Segmentation of the left atrial (LA) wall and endocardium from late gadolinium-enhanced (LGE) MRI is essential for quantifying atrial fibrosis in patients with atrial fibrillation. The development of accurate machine learning-based segmentation models remains challenging due to the limited availability of data and the complexity of anatomical structures. In this work, we investigate 3D conditional generative models as potential solution for augmenting scarce LGE training data and improving LA segmentation performance. We develop a pipeline to synthesize high-fidelity 3D LGE MRI volumes from composite semantic label maps combining anatomical expert annotations with unsupervised tissue clusters, using three 3D conditional generators (Pix2Pix GAN, SPADE-GAN, and SPADE-LDM). The synthetic images are evaluated for realism and their impact on downstream LA segmentation. SPADE-LDM generates the most realistic and structurally accurate images, achieving an FID of 4.063 and surpassing GAN models, which have FIDs of 40.821 and 7.652 for Pix2Pix and SPADE-GAN, respectively. When augmented with synthetic LGE images, the Dice score for LA cavity segmentation with a 3D U-Net model improved from 0.908 to 0.936, showing a statistically significant improvement (p < 0.05) over the baseline.These findings demonstrate the potential of label-conditioned 3D synthesis to enhance the segmentation of under-represented cardiac structures.
Related papers
- Exploiting Completeness Perception with Diffusion Transformer for Unified 3D MRI Synthesis [9.857855424798732]
We propose CoPeDiT, a latent diffusion model equipped with completeness perception for unified synthesis of 3D MRIs.<n>CoPeDiT significantly outperforms state-of-the-art methods, achieving superior robustness, generalizability, and flexibility.
arXiv Detail & Related papers (2026-02-20T18:05:39Z) - ERGO: Excess-Risk-Guided Optimization for High-Fidelity Monocular 3D Gaussian Splatting [63.138778159026934]
We propose an adaptive optimization framework guided by excess risk decomposition, termed ERGO.<n> ERGO dynamically estimates the view-specific excess risk and adaptively adjust loss weights during optimization.<n>Experiments on the Google Scanned Objects dataset and the OmniObject3D dataset demonstrate the superiority of ERGO over existing state-of-the-art methods.
arXiv Detail & Related papers (2026-02-10T20:44:43Z) - Accelerating 3D Photoacoustic Computed Tomography with End-to-End Physics-Aware Neural Operators [74.65171736966131]
Photoacoustic computed tomography (PACT) combines optical contrast with ultrasonic resolution, achieving deep-tissue imaging beyond the optical diffusion limit.<n>Current implementations require dense transducer arrays and prolonged acquisition times, limiting clinical translation.<n>We introduce Pano, an end-to-end physics-aware model that directly learns the inverse acoustic mapping from sensor measurements to volumetric reconstructions.
arXiv Detail & Related papers (2025-09-11T23:12:55Z) - AnatoMaskGAN: GNN-Driven Slice Feature Fusion and Noise Augmentation for Medical Semantic Image Synthesis [1.5295022700131624]
AnatoMaskGAN embeds slice-related spatial features to precisely aggregate inter-slice contextual dependencies.<n>We introduce diverse image-augmentation strategies, and optimize deep feature learning to improve performance on complex medical images.
arXiv Detail & Related papers (2025-08-15T10:19:38Z) - SAGCNet: Spatial-Aware Graph Completion Network for Missing Slice Imputation in Population CMR Imaging [3.0273769091742144]
Volumetric MRI synthesis methods have been developed to imputing missing slices from available ones.<n>The inherent 3D nature of volumetric MRI data, such as cardiac magnetic resonance (CMR), poses significant challenges for missing slice imputation approaches.<n>We present Spatial-Aware Graph Completion Network (SAGCNet) to overcome the dependency on complete volumetric data.
arXiv Detail & Related papers (2025-08-09T16:56:07Z) - TRACE: Temporally Reliable Anatomically-Conditioned 3D CT Generation with Enhanced Efficiency [40.82927972746919]
TRACE is a framework that generates 3D medical images with temporal alignment.<n>An overlapping-frame frame pairs pairs into a flexible length sequence, reconstructed into atemporally and anatomically aligned 3D volume.
arXiv Detail & Related papers (2025-07-01T14:35:39Z) - CLAIM: Clinically-Guided LGE Augmentation for Realistic and Diverse Myocardial Scar Synthesis and Segmentation [3.052913696182197]
We introduce CLAIM: textbfClinically-Guided textbfLGE textbfAugmentation for Realtextbfiiyocardial Scar Synthesis and framework.<n>At its core is the SMILE module, which conditions a diffusion-based generator on the clinically adopted AHA 17-segment model to synthesize images with anatomically consistent and spatially diverse scar patterns.<n> Experimental results show that CLAIM produces anatomically coherent scar patterns and achieves higher Dice similarity with real scar distributions compared to baseline models.
arXiv Detail & Related papers (2025-06-18T15:21:34Z) - Memory-Efficient 3D High-Resolution Medical Image Synthesis Using CRF-Guided GANs [47.873227167456136]
We propose an end-to-end novel GAN architecture that uses Conditional Random field (CRF) to model dependencies.<n>Our architecture outperforms state-of-the-art while it has lower memory usage and less complexity.
arXiv Detail & Related papers (2025-03-13T21:31:15Z) - Guided Synthesis of Labeled Brain MRI Data Using Latent Diffusion Models for Segmentation of Enlarged Ventricles [0.4188114563181614]
Deep learning models in medical contexts face challenges like data scarcity, inhomogeneity, and privacy concerns.
This study focuses on improving ventricular segmentation in brain MRI images using synthetic data.
arXiv Detail & Related papers (2024-11-02T19:44:10Z) - Towards Synergistic Deep Learning Models for Volumetric Cirrhotic Liver Segmentation in MRIs [1.5228650878164722]
Liver cirrhosis, a leading cause of global mortality, requires precise segmentation of ROIs for effective disease monitoring and treatment planning.
Existing segmentation models often fail to capture complex feature interactions and generalize across diverse datasets.
We propose a novel synergistic theory that leverages complementary latent spaces for enhanced feature interaction modeling.
arXiv Detail & Related papers (2024-08-08T14:41:32Z) - A Data Augmentation Pipeline to Generate Synthetic Labeled Datasets of
3D Echocardiography Images using a GAN [6.0419497882916655]
We propose an image generation pipeline to synthesize 3D echocardiographic images with corresponding ground truth labels.
The proposed method utilizes detailed anatomical segmentations of the heart as ground truth label sources.
arXiv Detail & Related papers (2024-03-08T15:26:27Z) - SDR-Former: A Siamese Dual-Resolution Transformer for Liver Lesion
Classification Using 3D Multi-Phase Imaging [59.78761085714715]
This study proposes a novel Siamese Dual-Resolution Transformer (SDR-Former) framework for liver lesion classification.
The proposed framework has been validated through comprehensive experiments on two clinical datasets.
To support the scientific community, we are releasing our extensive multi-phase MR dataset for liver lesion analysis to the public.
arXiv Detail & Related papers (2024-02-27T06:32:56Z) - On the Localization of Ultrasound Image Slices within Point Distribution
Models [84.27083443424408]
Thyroid disorders are most commonly diagnosed using high-resolution Ultrasound (US)
Longitudinal tracking is a pivotal diagnostic protocol for monitoring changes in pathological thyroid morphology.
We present a framework for automated US image slice localization within a 3D shape representation.
arXiv Detail & Related papers (2023-09-01T10:10:46Z) - Pathology Synthesis of 3D-Consistent Cardiac MR Images using 2D VAEs and
GANs [0.5039813366558306]
We propose a method for generating labeled data for the application of supervised deep-learning (DL) training.
The image synthesis consists of label deformation and label-to-image translation tasks.
We demonstrate that such an approach could provide a solution to diversify and enrich an available database of cardiac MR images.
arXiv Detail & Related papers (2022-09-09T10:17:49Z) - Revisiting 3D Context Modeling with Supervised Pre-training for
Universal Lesion Detection in CT Slices [48.85784310158493]
We propose a Modified Pseudo-3D Feature Pyramid Network (MP3D FPN) to efficiently extract 3D context enhanced 2D features for universal lesion detection in CT slices.
With the novel pre-training method, the proposed MP3D FPN achieves state-of-the-art detection performance on the DeepLesion dataset.
The proposed 3D pre-trained weights can potentially be used to boost the performance of other 3D medical image analysis tasks.
arXiv Detail & Related papers (2020-12-16T07:11:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.