Devil is in the Queries: Advancing Mask Transformers for Real-world
Medical Image Segmentation and Out-of-Distribution Localization
- URL: http://arxiv.org/abs/2304.00212v1
- Date: Sat, 1 Apr 2023 03:24:03 GMT
- Title: Devil is in the Queries: Advancing Mask Transformers for Real-world
Medical Image Segmentation and Out-of-Distribution Localization
- Authors: Mingze Yuan, Yingda Xia, Hexin Dong, Zifan Chen, Jiawen Yao, Mingyan
Qiu, Ke Yan, Xiaoli Yin, Yu Shi, Xin Chen, Zaiyi Liu, Bin Dong, Jingren Zhou,
Le Lu, Ling Zhang, Li Zhang
- Abstract summary: A trustworthy medical AI algorithm should demonstrate its effectiveness on tail conditions to avoid clinically dangerous damage.
We adopt the concept of object queries in Mask Transformers to formulate semantic segmentation as a soft cluster assignment.
Our framework is tested on two real-world segmentation tasks, i.e., segmentation of pancreatic and liver tumors.
- Score: 40.013449382899566
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Real-world medical image segmentation has tremendous long-tailed complexity
of objects, among which tail conditions correlate with relatively rare diseases
and are clinically significant. A trustworthy medical AI algorithm should
demonstrate its effectiveness on tail conditions to avoid clinically dangerous
damage in these out-of-distribution (OOD) cases. In this paper, we adopt the
concept of object queries in Mask Transformers to formulate semantic
segmentation as a soft cluster assignment. The queries fit the feature-level
cluster centers of inliers during training. Therefore, when performing
inference on a medical image in real-world scenarios, the similarity between
pixels and the queries detects and localizes OOD regions. We term this OOD
localization as MaxQuery. Furthermore, the foregrounds of real-world medical
images, whether OOD objects or inliers, are lesions. The difference between
them is less than that between the foreground and background, possibly
misleading the object queries to focus redundantly on the background. Thus, we
propose a query-distribution (QD) loss to enforce clear boundaries between
segmentation targets and other regions at the query level, improving the inlier
segmentation and OOD indication. Our proposed framework is tested on two
real-world segmentation tasks, i.e., segmentation of pancreatic and liver
tumors, outperforming previous state-of-the-art algorithms by an average of
7.39% on AUROC, 14.69% on AUPR, and 13.79% on FPR95 for OOD localization. On
the other hand, our framework improves the performance of inlier segmentation
by an average of 5.27% DSC when compared with the leading baseline nnUNet.
Related papers
- OOD-SEG: Out-Of-Distribution detection for image SEGmentation with sparse multi-class positive-only annotations [4.9547168429120205]
Deep neural networks in medical and surgical imaging face several challenges, two of which we aim to address in this work.
First, acquiring complete pixel-level segmentation labels for medical images is time-consuming and requires domain expertise.
Second, typical segmentation pipelines cannot detect out-of-distribution pixels, leaving them prone to spurious outputs during deployment.
arXiv Detail & Related papers (2024-11-14T16:06:30Z) - Self-Supervised Correction Learning for Semi-Supervised Biomedical Image
Segmentation [84.58210297703714]
We propose a self-supervised correction learning paradigm for semi-supervised biomedical image segmentation.
We design a dual-task network, including a shared encoder and two independent decoders for segmentation and lesion region inpainting.
Experiments on three medical image segmentation datasets for different tasks demonstrate the outstanding performance of our method.
arXiv Detail & Related papers (2023-01-12T08:19:46Z) - Reliable Joint Segmentation of Retinal Edema Lesions in OCT Images [55.83984261827332]
In this paper, we propose a novel reliable multi-scale wavelet-enhanced transformer network.
We develop a novel segmentation backbone that integrates a wavelet-enhanced feature extractor network and a multi-scale transformer module.
Our proposed method achieves better segmentation accuracy with a high degree of reliability as compared to other state-of-the-art segmentation approaches.
arXiv Detail & Related papers (2022-12-01T07:32:56Z) - Generative Adversarial Networks for Weakly Supervised Generation and Evaluation of Brain Tumor Segmentations on MR Images [0.0]
This work presents a weakly supervised approach to segment anomalies in 2D magnetic resonance images.
We train a generative adversarial network (GAN) that converts cancerous images to healthy variants.
Non-cancerous variants can also be used to evaluate the segmentations in a weakly supervised fashion.
arXiv Detail & Related papers (2022-11-10T00:04:46Z) - Cross-level Contrastive Learning and Consistency Constraint for
Semi-supervised Medical Image Segmentation [46.678279106837294]
We propose a cross-level constrastive learning scheme to enhance representation capacity for local features in semi-supervised medical image segmentation.
With the help of the cross-level contrastive learning and consistency constraint, the unlabelled data can be effectively explored to improve segmentation performance.
arXiv Detail & Related papers (2022-02-08T15:12:11Z) - Detect-and-Segment: a Deep Learning Approach to Automate Wound Image
Segmentation [8.354517822940783]
We present a deep learning approach to produce wound segmentation maps with high generalization capabilities.
In our approach, dedicated deep neural networks detected the wound position, isolated the wound from the uninformative background, and computed the wound segmentation map.
arXiv Detail & Related papers (2021-11-02T13:39:13Z) - Triggering Failures: Out-Of-Distribution detection by learning from
local adversarial attacks in Semantic Segmentation [76.2621758731288]
We tackle the detection of out-of-distribution (OOD) objects in semantic segmentation.
Our main contribution is a new OOD detection architecture called ObsNet associated with a dedicated training scheme based on Local Adversarial Attacks (LAA)
We show it obtains top performances both in speed and accuracy when compared to ten recent methods of the literature on three different datasets.
arXiv Detail & Related papers (2021-08-03T17:09:56Z) - Collaborative Boundary-aware Context Encoding Networks for Error Map
Prediction [65.44752447868626]
We propose collaborative boundaryaware context encoding networks called AEP-Net for error prediction task.
Specifically, we propose a collaborative feature transformation branch for better feature fusion between images and masks, and precise localization of error regions.
The AEP-Net achieves an average DSC of 0.8358, 0.8164 for error prediction task, and shows a high Pearson correlation coefficient of 0.9873.
arXiv Detail & Related papers (2020-06-25T12:42:01Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.