POWDR: Pathology-preserving Outpainting with Wavelet Diffusion for 3D MRI
- URL: http://arxiv.org/abs/2601.09044v1
- Date: Wed, 14 Jan 2026 00:20:13 GMT
- Title: POWDR: Pathology-preserving Outpainting with Wavelet Diffusion for 3D MRI
- Authors: Fei Tan, Ashok Vardhan Addala, Bruno Astuto Arouche Nunes, Xucheng Zhu, Ravi Soni,
- Abstract summary: POWDR is a pathology-preserving outpainting framework for 3D MRI based on a conditioned wavelet diffusion model.<n>Our approach leverages wavelet-domain conditioning to enhance high-frequency detail and blurring common in latent diffusion models.
- Score: 7.901173149711112
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Medical imaging datasets often suffer from class imbalance and limited availability of pathology-rich cases, which constrains the performance of machine learning models for segmentation, classification, and vision-language tasks. To address this challenge, we propose POWDR, a pathology-preserving outpainting framework for 3D MRI based on a conditioned wavelet diffusion model. Unlike conventional augmentation or unconditional synthesis, POWDR retains real pathological regions while generating anatomically plausible surrounding tissue, enabling diversity without fabricating lesions. Our approach leverages wavelet-domain conditioning to enhance high-frequency detail and mitigate blurring common in latent diffusion models. We introduce a random connected mask training strategy to overcome conditioning-induced collapse and improve diversity outside the lesion. POWDR is evaluated on brain MRI using BraTS datasets and extended to knee MRI to demonstrate tissue-agnostic applicability. Quantitative metrics (FID, SSIM, LPIPS) confirm image realism, while diversity analysis shows significant improvement with random-mask training (cosine similarity reduced from 0.9947 to 0.9580; KL divergence increased from 0.00026 to 0.01494). Clinically relevant assessments reveal gains in tumor segmentation performance using nnU-Net, with Dice scores improving from 0.6992 to 0.7137 when adding 50 synthetic cases. Tissue volume analysis indicates no significant differences for CSF and GM compared to real images. These findings highlight POWDR as a practical solution for addressing data scarcity and class imbalance in medical imaging. The method is extensible to multiple anatomies and offers a controllable framework for generating diverse, pathology-preserving synthetic data to support robust model development.
Related papers
- Anatomically Guided Latent Diffusion for Brain MRI Progression Modeling [10.62087466710015]
Anatomically Guided Latent Diffusion Model (AG-LDM) is a segmentation-guided framework that enforces anatomically consistent progression.<n>A lightweight 3D tissue segmentation model (WarpSeg) provides explicit anatomical supervision during both autoencoder fine-tuning and diffusion model training.
arXiv Detail & Related papers (2026-01-21T01:45:36Z) - A Semantically Enhanced Generative Foundation Model Improves Pathological Image Synthesis [82.01597026329158]
We introduce a Correlation-Regulated Alignment Framework for Tissue Synthesis (CRAFTS) for pathology-specific text-to-image synthesis.<n>CRAFTS incorporates a novel alignment mechanism that suppresses semantic drift to ensure biological accuracy.<n>This model generates diverse pathological images spanning 30 cancer types, with quality rigorously validated by objective metrics and pathologist evaluations.
arXiv Detail & Related papers (2025-12-15T10:22:43Z) - Scanner-Agnostic MRI Harmonization via SSIM-Guided Disentanglement [0.0]
We present an image-based harmonization framework for 3D T1-weighted brain MRI.<n>The model incorporates a differentiable loss based on the Structural Similarity Index (SSIM) to preserve biologically meaningful features.<n>Visual comparisons, voxel intensity distributions, and SSIM-based metrics demonstrated that harmonized images achieved strong alignment across acquisition settings.
arXiv Detail & Related papers (2025-10-24T23:19:02Z) - Integrating Anatomical Priors into a Causal Diffusion Model [14.471851828800055]
3D brain MRI studies often examine subtle morphometric differences that are hard to detect visually.<n>Counterfactual models struggle to produce plausible MRIs due to the lack of explicit inductive biases to preserve fine-grained anatomical details.<n>We propose to explicitly integrate anatomical constraints on a voxel-level as prior into a generative diffusion framework.
arXiv Detail & Related papers (2025-09-10T23:22:05Z) - Latent Diffusion Autoencoders: Toward Efficient and Meaningful Unsupervised Representation Learning in Medical Imaging [41.446379453352534]
Latent Diffusion Autoencoder (LDAE) is a novel encoder-decoder diffusion-based framework for efficient and meaningful unsupervised learning in medical imaging.<n>This study focuses on Alzheimer disease (AD) using brain MR from the ADNI database as a case study.
arXiv Detail & Related papers (2025-04-11T15:37:46Z) - BrainMRDiff: A Diffusion Model for Anatomically Consistent Brain MRI Synthesis [11.73947657846282]
BrainMRDiff is a novel topology-preserving, anatomy-guided diffusion model for brain MRI.<n>To achieve this, we introduce two key modules: Tumor+Structure Aggregation (TSA) and Topology-Guided Anatomy Preservation (TGAP)<n>TSA integrates diverse anatomical structures with tumor information, forming a comprehensive conditioning mechanism for the diffusion process.<n>Results demonstrate that BrainMRDiff surpasses existing baselines, achieving performance improvements of 23.33% on the BraTS-AG dataset and 33.33% on the BraTS-Met dataset.
arXiv Detail & Related papers (2025-04-06T16:16:50Z) - Latent Drifting in Diffusion Models for Counterfactual Medical Image Synthesis [55.959002385347645]
Latent Drifting enables diffusion models to be conditioned for medical images fitted for the complex task of counterfactual image generation.<n>We evaluate our method on three public longitudinal benchmark datasets of brain MRI and chest X-rays for counterfactual image generation.
arXiv Detail & Related papers (2024-12-30T01:59:34Z) - Inpainting Pathology in Lumbar Spine MRI with Latent Diffusion [4.410798232767917]
We propose an efficient method for inpainting pathological features onto healthy anatomy in MRI.
We evaluate the method's ability to insert disc herniation and central canal stenosis in lumbar spine sagittal T2 MRI.
arXiv Detail & Related papers (2024-06-04T16:47:47Z) - LeFusion: Controllable Pathology Synthesis via Lesion-Focused Diffusion Models [42.922303491557244]
Patient data from real-world clinical practice often suffers from data scarcity and long-tail imbalances.
This study addresses these challenges by generating lesion-containing image-segmentation pairs from lesion-free images.
LeFusion-generated data significantly improves the performance of state-of-the-art segmentation models.
arXiv Detail & Related papers (2024-03-21T01:25:39Z) - Diffusion Models for Counterfactual Generation and Anomaly Detection in Brain Images [39.94162291765236]
We present a weakly supervised method to generate a healthy version of a diseased image and then use it to obtain a pixel-wise anomaly map.
We employ a diffusion model trained on healthy samples and combine Denoising Diffusion Probabilistic Model (DDPM) and Denoising Implicit Model (DDIM) at each step of the sampling process.
arXiv Detail & Related papers (2023-08-03T21:56:50Z) - Brain Imaging-to-Graph Generation using Adversarial Hierarchical Diffusion Models for MCI Causality Analysis [44.45598796591008]
Brain imaging-to-graph generation (BIGG) framework is proposed to map functional magnetic resonance imaging (fMRI) into effective connectivity for mild cognitive impairment analysis.
The hierarchical transformers in the generator are designed to estimate the noise at multiple scales.
Evaluations of the ADNI dataset demonstrate the feasibility and efficacy of the proposed model.
arXiv Detail & Related papers (2023-05-18T06:54:56Z) - Scale-Space Autoencoders for Unsupervised Anomaly Segmentation in Brain
MRI [47.26574993639482]
We show improved anomaly segmentation performance and the general capability to obtain much more crisp reconstructions of input data at native resolution.
The modeling of the laplacian pyramid further enables the delineation and aggregation of lesions at multiple scales.
arXiv Detail & Related papers (2020-06-23T09:20:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.