Calibrating the Dice loss to handle neural network overconfidence for
biomedical image segmentation
- URL: http://arxiv.org/abs/2111.00528v1
- Date: Sun, 31 Oct 2021 16:02:02 GMT
- Title: Calibrating the Dice loss to handle neural network overconfidence for
biomedical image segmentation
- Authors: Michael Yeung, Leonardo Rundo, Yang Nan, Evis Sala, Carola-Bibiane
Sch\"onlieb, Guang Yang
- Abstract summary: The Dice similarity coefficient (DSC) is a widely used metric and loss function for biomedical image segmentation.
In this study, we identify poor calibration as an emerging challenge of deep learning based biomedical image segmentation.
We provide a simple yet effective extension of the DSC loss, named the DSC++ loss, that selectively modulates the penalty associated with overconfident, incorrect predictions.
- Score: 2.6465053740712157
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The Dice similarity coefficient (DSC) is both a widely used metric and loss
function for biomedical image segmentation due to its robustness to class
imbalance. However, it is well known that the DSC loss is poorly calibrated,
resulting in overconfident predictions that cannot be usefully interpreted in
biomedical and clinical practice. Performance is often the only metric used to
evaluate segmentations produced by deep neural networks, and calibration is
often neglected. However, calibration is important for translation into
biomedical and clinical practice, providing crucial contextual information to
model predictions for interpretation by scientists and clinicians. In this
study, we identify poor calibration as an emerging challenge of deep learning
based biomedical image segmentation. We provide a simple yet effective
extension of the DSC loss, named the DSC++ loss, that selectively modulates the
penalty associated with overconfident, incorrect predictions. As a standalone
loss function, the DSC++ loss achieves significantly improved calibration over
the conventional DSC loss across five well-validated open-source biomedical
imaging datasets. Similarly, we observe significantly improved when integrating
the DSC++ loss into four DSC-based loss functions. Finally, we use softmax
thresholding to illustrate that well calibrated outputs enable tailoring of
precision-recall bias, an important post-processing technique to adapt the
model predictions to suit the biomedical or clinical task. The DSC++ loss
overcomes the major limitation of the DSC, providing a suitable loss function
for training deep learning segmentation models for use in biomedical and
clinical practice.
Related papers
- We Care Each Pixel: Calibrating on Medical Segmentation Model [15.826029150910566]
pixel-wise Expected Error (pECE) is a novel metric that measures miscalibration at the pixel level.
We also introduce a morphological adaptation strategy that applies morphological operations to ground-truth masks before computing calibration losses.
Our method not only enhances segmentation performance but also improves calibration quality, yielding more trustworthy confidence estimates.
arXiv Detail & Related papers (2025-03-07T03:06:03Z) - SEM-CLIP: Precise Few-Shot Learning for Nanoscale Defect Detection in Scanning Electron Microscope Image [27.17821452832534]
We propose a novel few-shot learning approach, SEM-CLIP, for accurate defect classification and segmentation.
We employ text prompts enriched with domain knowledge as prior information to assist in precise analysis.
SEM-CLIP requires little annotated data, substantially reducing labor demands in the semiconductor industry.
arXiv Detail & Related papers (2025-02-15T14:06:48Z) - Probabilistic 3D Correspondence Prediction from Sparse Unsegmented Images [1.2179682412409507]
We propose SPI-CorrNet, a unified model that predicts 3D correspondences from sparse imaging data.
Experiments on the LGE MRI left atrium dataset and Abdomen CT-1K liver datasets demonstrate that our technique enhances the accuracy and robustness of sparse image-driven SSM.
arXiv Detail & Related papers (2024-07-02T03:56:20Z) - Unveiling Incomplete Modality Brain Tumor Segmentation: Leveraging Masked Predicted Auto-Encoder and Divergence Learning [6.44069573245889]
Brain tumor segmentation remains a significant challenge, particularly in the context of multi-modal magnetic resonance imaging (MRI)
We propose a novel strategy, which is called masked predicted pre-training, enabling robust feature learning from incomplete modality data.
In the fine-tuning phase, we utilize a knowledge distillation technique to align features between complete and missing modality data, simultaneously enhancing model robustness.
arXiv Detail & Related papers (2024-06-12T20:35:16Z) - Towards Reliable Medical Image Segmentation by utilizing Evidential Calibrated Uncertainty [52.03490691733464]
We introduce DEviS, an easily implementable foundational model that seamlessly integrates into various medical image segmentation networks.
By leveraging subjective logic theory, we explicitly model probability and uncertainty for the problem of medical image segmentation.
DeviS incorporates an uncertainty-aware filtering module, which utilizes the metric of uncertainty-calibrated error to filter reliable data.
arXiv Detail & Related papers (2023-01-01T05:02:46Z) - Reliable Joint Segmentation of Retinal Edema Lesions in OCT Images [55.83984261827332]
In this paper, we propose a novel reliable multi-scale wavelet-enhanced transformer network.
We develop a novel segmentation backbone that integrates a wavelet-enhanced feature extractor network and a multi-scale transformer module.
Our proposed method achieves better segmentation accuracy with a high degree of reliability as compared to other state-of-the-art segmentation approaches.
arXiv Detail & Related papers (2022-12-01T07:32:56Z) - DOMINO: Domain-aware Model Calibration in Medical Image Segmentation [51.346121016559024]
Modern deep neural networks are poorly calibrated, compromising trustworthiness and reliability.
We propose DOMINO, a domain-aware model calibration method that leverages the semantic confusability and hierarchical similarity between class labels.
Our results show that DOMINO-calibrated deep neural networks outperform non-calibrated models and state-of-the-art morphometric methods in head image segmentation.
arXiv Detail & Related papers (2022-09-13T15:31:52Z) - BSM loss: A superior way in modeling aleatory uncertainty of
fine_grained classification [0.0]
We propose a modified Bootstrapping loss(BS loss) function with Mixup data augmentation strategy.
Our experiments indicated that BS loss with Mixup(BSM) model can halve the Expected Error(ECE) compared to standard data augmentation.
BSM model is able to perceive the semantic distance of out-of-domain data, demonstrating high potential in real-world clinical practice.
arXiv Detail & Related papers (2022-06-09T13:06:51Z) - Maximum Entropy on Erroneous Predictions (MEEP): Improving model
calibration for medical image segmentation [10.159176702917788]
We introduce MEEP, a training strategy for segmentation networks which selectively penalizes overconfident predictions, focusing only on misclassified pixels.
We benchmark the proposed strategy in two challenging segmentation tasks: white matter hyperintensity lesions in magnetic resonance images (MRI) of the brain, and atrial segmentation in cardiac MRI.
arXiv Detail & Related papers (2021-12-22T20:34:20Z) - An Uncertainty-Driven GCN Refinement Strategy for Organ Segmentation [53.425900196763756]
We propose a segmentation refinement method based on uncertainty analysis and graph convolutional networks.
We employ the uncertainty levels of the convolutional network in a particular input volume to formulate a semi-supervised graph learning problem.
We show that our method outperforms the state-of-the-art CRF refinement method by improving the dice score by 1% for the pancreas and 2% for spleen.
arXiv Detail & Related papers (2020-12-06T18:55:07Z) - Implanting Synthetic Lesions for Improving Liver Lesion Segmentation in
CT Exams [0.0]
We present a method for implanting realistic lesions in CT slices to provide a rich and controllable set of training samples.
We conclude that increasing the variability of lesions synthetically in terms of size, density, shape, and position seems to improve the performance of segmentation models for liver lesion segmentation in CT slices.
arXiv Detail & Related papers (2020-08-11T13:23:04Z) - Collaborative Boundary-aware Context Encoding Networks for Error Map
Prediction [65.44752447868626]
We propose collaborative boundaryaware context encoding networks called AEP-Net for error prediction task.
Specifically, we propose a collaborative feature transformation branch for better feature fusion between images and masks, and precise localization of error regions.
The AEP-Net achieves an average DSC of 0.8358, 0.8164 for error prediction task, and shows a high Pearson correlation coefficient of 0.9873.
arXiv Detail & Related papers (2020-06-25T12:42:01Z) - ECG-DelNet: Delineation of Ambulatory Electrocardiograms with Mixed
Quality Labeling Using Neural Networks [69.25956542388653]
Deep learning (DL) algorithms are gaining weight in academic and industrial settings.
We demonstrate DL can be successfully applied to low interpretative tasks by embedding ECG detection and delineation onto a segmentation framework.
The model was trained using PhysioNet's QT database, comprised of 105 ambulatory ECG recordings.
arXiv Detail & Related papers (2020-05-11T16:29:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.