A quality assurance framework for real-time monitoring of deep learning
segmentation models in radiotherapy
- URL: http://arxiv.org/abs/2305.11715v1
- Date: Fri, 19 May 2023 14:51:05 GMT
- Title: A quality assurance framework for real-time monitoring of deep learning
segmentation models in radiotherapy
- Authors: Xiyao Jin, Yao Hao, Jessica Hilliard, Zhehao Zhang, Maria A. Thomas,
Hua Li, Abhinav K. Jha, Geoffrey D. Hugo
- Abstract summary: This work uses cardiac substructure segmentation as an example task to establish a quality assurance framework.
A benchmark dataset consisting of Computed Tomography (CT) images along with manual cardiac delineations of 241 patients was collected.
An image domain shift detector was developed by utilizing a trained Denoising autoencoder (DAE) and two hand-engineered features.
A regression model was trained to predict the per-patient segmentation accuracy, measured by Dice similarity coefficient (DSC)
- Score: 3.5752677591512487
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: To safely deploy deep learning models in the clinic, a quality assurance
framework is needed for routine or continuous monitoring of input-domain shift
and the models' performance without ground truth contours. In this work,
cardiac substructure segmentation was used as an example task to establish a QA
framework. A benchmark dataset consisting of Computed Tomography (CT) images
along with manual cardiac delineations of 241 patients were collected,
including one 'common' image domain and five 'uncommon' domains. Segmentation
models were tested on the benchmark dataset for an initial evaluation of model
capacity and limitations. An image domain shift detector was developed by
utilizing a trained Denoising autoencoder (DAE) and two hand-engineered
features. Another Variational Autoencoder (VAE) was also trained to estimate
the shape quality of the auto-segmentation results. Using the extracted
features from the image/segmentation pair as inputs, a regression model was
trained to predict the per-patient segmentation accuracy, measured by Dice
coefficient similarity (DSC). The framework was tested across 19 segmentation
models to evaluate the generalizability of the entire framework.
As results, the predicted DSC of regression models achieved a mean absolute
error (MAE) ranging from 0.036 to 0.046 with an averaged MAE of 0.041. When
tested on the benchmark dataset, the performances of all segmentation models
were not significantly affected by scanning parameters: FOV, slice thickness
and reconstructions kernels. For input images with Poisson noise, CNN-based
segmentation models demonstrated a decreased DSC ranging from 0.07 to 0.41,
while the transformer-based model was not significantly affected.
Related papers
- Towards Ground-truth-free Evaluation of Any Segmentation in Medical Images [22.36128130052757]
We build a ground-truth-free evaluation model to assess the quality of segmentations generated by the Segment Anything Model (SAM) and its variants in medical imaging.
This evaluation model estimates segmentation quality scores by analyzing the coherence and consistency between the input images and their corresponding segmentation predictions.
arXiv Detail & Related papers (2024-09-23T10:12:08Z) - Handling Geometric Domain Shifts in Semantic Segmentation of Surgical RGB and Hyperspectral Images [67.66644395272075]
We present first analysis of state-of-the-art semantic segmentation models when faced with geometric out-of-distribution data.
We propose an augmentation technique called "Organ Transplantation" to enhance generalizability.
Our augmentation technique improves SOA model performance by up to 67 % for RGB data and 90 % for HSI data, achieving performance at the level of in-distribution performance on real OOD test data.
arXiv Detail & Related papers (2024-08-27T19:13:15Z) - Dimensionality Reduction and Nearest Neighbors for Improving Out-of-Distribution Detection in Medical Image Segmentation [1.2873975765521795]
This work applied the Mahalanobis distance (MD) post hoc to the bottleneck features of four Swin UNETR and nnU-net models that segmented the liver.
Images the models failed on were detected with high performance and minimal computational load.
arXiv Detail & Related papers (2024-08-05T18:24:48Z) - TotalSegmentator MRI: Sequence-Independent Segmentation of 59 Anatomical Structures in MR images [62.53931644063323]
In this study we extended the capabilities of TotalSegmentator to MR images.
We trained an nnU-Net segmentation algorithm on this dataset and calculated similarity coefficients (Dice) to evaluate the model's performance.
The model significantly outperformed two other publicly available segmentation models (Dice score 0.824 versus 0.762; p0.001 and 0.762 versus 0.542; p)
arXiv Detail & Related papers (2024-05-29T20:15:54Z) - Towards Better Certified Segmentation via Diffusion Models [62.21617614504225]
segmentation models can be vulnerable to adversarial perturbations, which hinders their use in critical-decision systems like healthcare or autonomous driving.
Recently, randomized smoothing has been proposed to certify segmentation predictions by adding Gaussian noise to the input to obtain theoretical guarantees.
In this paper, we address the problem of certifying segmentation prediction using a combination of randomized smoothing and diffusion models.
arXiv Detail & Related papers (2023-06-16T16:30:39Z) - Application of the nnU-Net for automatic segmentation of lung lesion on
CT images, and implication on radiomic models [1.8231394717039833]
A deep-learning automatic segmentation method was applied on computed tomography images of non-small-cell lung cancer patients.
The use of manual vs automatic segmentation in the performance of survival radiomic models was assessed, as well.
arXiv Detail & Related papers (2022-09-24T15:04:23Z) - An Efficient End-to-End Deep Neural Network for Interstitial Lung
Disease Recognition and Classification [0.5424799109837065]
This paper introduces an end-to-end deep convolution neural network (CNN) for classifying ILDs patterns.
The proposed model comprises four convolutional layers with different kernel sizes and Rectified Linear Unit (ReLU) activation function.
A dataset consisting of 21328 image patches of 128 CT scans with five classes is taken to train and assess the proposed model.
arXiv Detail & Related papers (2022-04-21T06:36:10Z) - Improving Classification Model Performance on Chest X-Rays through Lung
Segmentation [63.45024974079371]
We propose a deep learning approach to enhance abnormal chest x-ray (CXR) identification performance through segmentations.
Our approach is designed in a cascaded manner and incorporates two modules: a deep neural network with criss-cross attention modules (XLSor) for localizing lung region in CXR images and a CXR classification model with a backbone of a self-supervised momentum contrast (MoCo) model pre-trained on large-scale CXR data sets.
arXiv Detail & Related papers (2022-02-22T15:24:06Z) - Automatic size and pose homogenization with spatial transformer network
to improve and accelerate pediatric segmentation [51.916106055115755]
We propose a new CNN architecture that is pose and scale invariant thanks to the use of Spatial Transformer Network (STN)
Our architecture is composed of three sequential modules that are estimated together during training.
We test the proposed method in kidney and renal tumor segmentation on abdominal pediatric CT scanners.
arXiv Detail & Related papers (2021-07-06T14:50:03Z) - Collaborative Boundary-aware Context Encoding Networks for Error Map
Prediction [65.44752447868626]
We propose collaborative boundaryaware context encoding networks called AEP-Net for error prediction task.
Specifically, we propose a collaborative feature transformation branch for better feature fusion between images and masks, and precise localization of error regions.
The AEP-Net achieves an average DSC of 0.8358, 0.8164 for error prediction task, and shows a high Pearson correlation coefficient of 0.9873.
arXiv Detail & Related papers (2020-06-25T12:42:01Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.