SP-DiffDose: A Conditional Diffusion Model for Radiation Dose Prediction
Based on Multi-Scale Fusion of Anatomical Structures, Guided by
SwinTransformer and Projector
- URL: http://arxiv.org/abs/2312.06187v1
- Date: Mon, 11 Dec 2023 08:07:41 GMT
- Title: SP-DiffDose: A Conditional Diffusion Model for Radiation Dose Prediction
Based on Multi-Scale Fusion of Anatomical Structures, Guided by
SwinTransformer and Projector
- Authors: Linjie Fu, Xia Li, Xiuding Cai, Yingkai Wang, Xueyao Wang, Yu Yao,
Yali Shen
- Abstract summary: We propose a dose prediction diffusion model based on SwinTransformer and a projector, SP-DiffDose.
To capture the direct correlation between anatomical structure and dose distribution maps, SP-DiffDose uses a structural encoder to extract features from anatomical images.
To enhance the dose prediction distribution for organs at risk, SP-DiffDose utilizes SwinTransformer in the deeper layers of the network to capture features at different scales in the image.
- Score: 14.18016609082685
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Radiation therapy serves as an effective and standard method for cancer
treatment. Excellent radiation therapy plans always rely on high-quality dose
distribution maps obtained through repeated trial and error by experienced
experts. However, due to individual differences and complex clinical
situations, even seasoned expert teams may need help to achieve the best
treatment plan every time quickly. Many automatic dose distribution prediction
methods have been proposed recently to accelerate the radiation therapy
planning process and have achieved good results. However, these results suffer
from over-smoothing issues, with the obtained dose distribution maps needing
more high-frequency details, limiting their clinical application. To address
these limitations, we propose a dose prediction diffusion model based on
SwinTransformer and a projector, SP-DiffDose. To capture the direct correlation
between anatomical structure and dose distribution maps, SP-DiffDose uses a
structural encoder to extract features from anatomical images, then employs a
conditional diffusion process to blend noise and anatomical images at multiple
scales and gradually map them to dose distribution maps. To enhance the dose
prediction distribution for organs at risk, SP-DiffDose utilizes
SwinTransformer in the deeper layers of the network to capture features at
different scales in the image. To learn good representations from the fused
features, SP-DiffDose passes the fused features through a designed projector,
improving dose prediction accuracy. Finally, we evaluate SP-DiffDose on an
internal dataset. The results show that SP-DiffDose outperforms existing
methods on multiple evaluation metrics, demonstrating the superiority and
generalizability of our method.
Related papers
- Latent Spaces Enable Transformer-Based Dose Prediction in Complex Radiotherapy Plans [0.11249583407496219]
Multi-lesion lung SABR plans are complex and require significant resources to create.
We propose a novel two-stage latent transformer framework (LDFormer) for dose prediction of lung SABR plans with varying numbers of lesions.
arXiv Detail & Related papers (2024-07-11T16:28:44Z) - MD-Dose: A Diffusion Model based on the Mamba for Radiotherapy Dose
Prediction [14.18016609082685]
We introduce a novel diffusion model, MD-Dose, for predicting radiation therapy dose distribution in thoracic cancer patients.
In the forward process, MD-Dose adds Gaussian noise to dose distribution maps to obtain pure noise images.
In the backward process, MD-Dose utilizes a noise predictor based on the Mamba to predict the noise, ultimately outputting the dose distribution maps.
arXiv Detail & Related papers (2024-03-13T12:46:36Z) - Triplet-constraint Transformer with Multi-scale Refinement for Dose
Prediction in Radiotherapy [10.232397630125886]
CNNs have automated the radiotherapy plan-making by predicting the dose maps.
Current CNN-based methods ignore the remarkable dose difference in the dose map.
We propose a triplet-constraint transformer (TCtrans) with multi-scale refinement to predict the high-quality dose distribution.
arXiv Detail & Related papers (2024-02-07T04:05:29Z) - ArSDM: Colonoscopy Images Synthesis with Adaptive Refinement Semantic
Diffusion Models [69.9178140563928]
Colonoscopy analysis is essential for assisting clinical diagnosis and treatment.
The scarcity of annotated data limits the effectiveness and generalization of existing methods.
We propose an Adaptive Refinement Semantic Diffusion Model (ArSDM) to generate colonoscopy images that benefit the downstream tasks.
arXiv Detail & Related papers (2023-09-03T07:55:46Z) - DiffDP: Radiotherapy Dose Prediction via a Diffusion Model [13.44191425264393]
We introduce a diffusion-based dose prediction (DiffDP) model for predicting the radiotherapy dose distribution of cancer patients.
In the forward process, DiffDP gradually transforms dose maps into Gaussian noise by adding small noise and trains a noise predictor to predict the noise added in each timestep.
In the reverse process, it removes the noise from the original Gaussian noise in multiple steps with the well-trained noise predictor and finally outputs the predicted dose distribution map.
arXiv Detail & Related papers (2023-07-19T07:25:33Z) - Learning Structure-Guided Diffusion Model for 2D Human Pose Estimation [71.24808323646167]
We propose textbfDiffusionPose, a new scheme for learning keypoints heatmaps by a neural network.
During training, the keypoints are diffused to random distribution by adding noises and the diffusion model learns to recover ground-truth heatmaps from noised heatmaps.
Experiments show the prowess of our scheme with improvements of 1.6, 1.2, and 1.2 mAP on widely-used COCO, CrowdPose, and AI Challenge datasets.
arXiv Detail & Related papers (2023-06-29T16:24:32Z) - DoseDiff: Distance-aware Diffusion Model for Dose Prediction in Radiotherapy [7.934475806787889]
We propose a distance-aware diffusion model (DoseDiff) for precise prediction of dose distribution.
The results demonstrate that our DoseDiff method outperforms state-of-the-art dose prediction methods in terms of both quantitative performance and visual quality.
arXiv Detail & Related papers (2023-06-28T15:58:53Z) - Ambiguous Medical Image Segmentation using Diffusion Models [60.378180265885945]
We introduce a single diffusion model-based approach that produces multiple plausible outputs by learning a distribution over group insights.
Our proposed model generates a distribution of segmentation masks by leveraging the inherent sampling process of diffusion.
Comprehensive results show that our proposed approach outperforms existing state-of-the-art ambiguous segmentation networks.
arXiv Detail & Related papers (2023-04-10T17:58:22Z) - Improving Classification Model Performance on Chest X-Rays through Lung
Segmentation [63.45024974079371]
We propose a deep learning approach to enhance abnormal chest x-ray (CXR) identification performance through segmentations.
Our approach is designed in a cascaded manner and incorporates two modules: a deep neural network with criss-cross attention modules (XLSor) for localizing lung region in CXR images and a CXR classification model with a backbone of a self-supervised momentum contrast (MoCo) model pre-trained on large-scale CXR data sets.
arXiv Detail & Related papers (2022-02-22T15:24:06Z) - Many-to-One Distribution Learning and K-Nearest Neighbor Smoothing for
Thoracic Disease Identification [83.6017225363714]
deep learning has become the most powerful computer-aided diagnosis technology for improving disease identification performance.
For chest X-ray imaging, annotating large-scale data requires professional domain knowledge and is time-consuming.
In this paper, we propose many-to-one distribution learning (MODL) and K-nearest neighbor smoothing (KNNS) methods to improve a single model's disease identification performance.
arXiv Detail & Related papers (2021-02-26T02:29:30Z) - Simultaneous Estimation of X-ray Back-Scatter and Forward-Scatter using
Multi-Task Learning [59.17383024536595]
Back-scatter significantly contributes to patient (skin) dose during complicated interventions.
Forward-scattered radiation reduces contrast in projection images and introduces artifacts in 3-D reconstructions.
We propose a novel approach combining conventional techniques with learning-based methods to simultaneously estimate the forward-scatter reaching the detector.
arXiv Detail & Related papers (2020-07-08T10:47:37Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.