Non-Iterative Scribble-Supervised Learning with Pacing Pseudo-Masks for
Medical Image Segmentation
- URL: http://arxiv.org/abs/2210.10956v2
- Date: Thu, 28 Sep 2023 20:09:59 GMT
- Title: Non-Iterative Scribble-Supervised Learning with Pacing Pseudo-Masks for
Medical Image Segmentation
- Authors: Zefan Yang, Di Lin, Dong Ni, and Yi Wang
- Abstract summary: Scribble-supervised medical image segmentation tackles the limitation of sparse masks.
We propose a non-iterative method where a stream of varying (pacing) pseudo-masks teach a network via consistency training, named PacingPseudo.
The efficacy of the proposed PacingPseudo is validated on three public medical image datasets.
- Score: 13.940364677162968
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Scribble-supervised medical image segmentation tackles the limitation of
sparse masks. Conventional approaches alternate between: labeling pseudo-masks
and optimizing network parameters. However, such iterative two-stage paradigm
is unwieldy and could be trapped in poor local optima since the networks
undesirably regress to the erroneous pseudo-masks. To address these issues, we
propose a non-iterative method where a stream of varying (pacing) pseudo-masks
teach a network via consistency training, named PacingPseudo. Our motivation
lies first in a non-iterative process. Interestingly, it can be achieved
gracefully by a siamese architecture, wherein a stream of pseudo-masks
naturally assimilate a stream of predicted masks during training. Second, we
make the consistency training effective with two necessary designs: (i) entropy
regularization to obtain high-confidence pseudo-masks for effective teaching;
and (ii) distorted augmentations to create discrepancy between the pseudo-mask
and predicted-mask streams for consistency regularization. Third, we devise a
new memory bank mechanism that provides an extra source of ensemble features to
complement scarce labeled pixels. The efficacy of the proposed PacingPseudo is
validated on three public medical image datasets, including the segmentation
tasks of abdominal multi-organs, cardiac structures, and myocardium. Extensive
experiments demonstrate our PacingPseudo improves the baseline by large margins
and consistently outcompetes several previous methods. In some cases, our
PacingPseudo achieves comparable performance with its fully-supervised
counterparts, showing the feasibility of our method for the challenging
scribble-supervised segmentation applications. The code and scribble
annotations will be publicly available.
Related papers
- Variance-insensitive and Target-preserving Mask Refinement for
Interactive Image Segmentation [68.16510297109872]
Point-based interactive image segmentation can ease the burden of mask annotation in applications such as semantic segmentation and image editing.
We introduce a novel method, Variance-Insensitive and Target-Preserving Mask Refinement to enhance segmentation quality with fewer user inputs.
Experiments on GrabCut, Berkeley, SBD, and DAVIS datasets demonstrate our method's state-of-the-art performance in interactive image segmentation.
arXiv Detail & Related papers (2023-12-22T02:31:31Z) - Masking Improves Contrastive Self-Supervised Learning for ConvNets, and Saliency Tells You Where [63.61248884015162]
We aim to alleviate the burden of including masking operation into the contrastive-learning framework for convolutional neural networks.
We propose to explicitly take the saliency constraint into consideration in which the masked regions are more evenly distributed among the foreground and background.
arXiv Detail & Related papers (2023-09-22T09:58:38Z) - Unmasking Anomalies in Road-Scene Segmentation [18.253109627901566]
Anomaly segmentation is a critical task for driving applications.
We propose a paradigm change by shifting from a per-pixel classification to a mask classification.
Mask2Anomaly demonstrates the feasibility of integrating an anomaly detection method in a mask-classification architecture.
arXiv Detail & Related papers (2023-07-25T08:23:10Z) - Learning to Mask and Permute Visual Tokens for Vision Transformer
Pre-Training [59.923672191632065]
We propose a new self-supervised pre-training approach, named Masked and Permuted Vision Transformer (MaPeT)
MaPeT employs autoregressive and permuted predictions to capture intra-patch dependencies.
Our results demonstrate that MaPeT achieves competitive performance on ImageNet.
arXiv Detail & Related papers (2023-06-12T18:12:19Z) - MaskCLIP: Masked Self-Distillation Advances Contrastive Language-Image
Pretraining [138.86293836634323]
MaskCLIP incorporates a newly proposed masked self-distillation into contrastive language-image pretraining.
MaskCLIP achieves superior results in linear probing, finetuning, and zero-shot performance with the guidance of the language encoder.
arXiv Detail & Related papers (2022-08-25T17:59:58Z) - What You See is What You Classify: Black Box Attributions [61.998683569022006]
We train a deep network, the Explainer, to predict attributions for a pre-trained black-box classifier, the Explanandum.
Unlike most existing approaches, ours is capable of directly generating very distinct class-specific masks.
We show that our attributions are superior to established methods both visually and quantitatively.
arXiv Detail & Related papers (2022-05-23T12:30:04Z) - Partial Attack Supervision and Regional Weighted Inference for Masked
Face Presentation Attack Detection [5.71864964818217]
Wearing a mask has proven to be one of the most effective ways to prevent the transmission of SARS-CoV-2 coronavirus.
The main issues facing the mask face PAD are the wrongly classified bona fide masked faces and the wrongly classified partial attacks.
This work proposes a method that considers partial attack labels to supervise the PAD model training, as well as regional weighted inference to further improve the PAD performance.
arXiv Detail & Related papers (2021-11-08T08:53:46Z) - Mask or Non-Mask? Robust Face Mask Detector via Triplet-Consistency
Representation Learning [23.062034116854875]
In the absence of vaccines or medicines to stop COVID-19, one of the effective methods to slow the spread of the coronavirus is to wear a face mask.
To mandate the use of face masks or coverings in public areas, additional human resources are required, which is tedious and attention-intensive.
We propose a face mask detection framework that uses the context attention module to enable the effective attention of the feed-forward convolution neural network.
arXiv Detail & Related papers (2021-10-01T16:44:06Z) - Pseudo-mask Matters inWeakly-supervised Semantic Segmentation [24.73662587701187]
We find some matters related to the pseudo-masks, including high quality pseudo-masks generation from class activation maps (CAMs) and training with noisy pseudo-mask supervision.
We propose the following designs to push the performance to new state-of-art: (i) Coefficient of Variation Smoothing to smooth the CAMs adaptively; (ii) Proportional Pseudo-mask Generation to project the expanded CAMs to pseudo-mask based on a new metric indicating the importance of each class on each location; (iii) Pretended Under-Fitting strategy to suppress the influence of noise in pseudo
arXiv Detail & Related papers (2021-08-30T05:35:28Z) - Image Inpainting by End-to-End Cascaded Refinement with Mask Awareness [66.55719330810547]
Inpainting arbitrary missing regions is challenging because learning valid features for various masked regions is nontrivial.
We propose a novel mask-aware inpainting solution that learns multi-scale features for missing regions in the encoding phase.
Our framework is validated both quantitatively and qualitatively via extensive experiments on three public datasets.
arXiv Detail & Related papers (2021-04-28T13:17:47Z) - Post-DAE: Anatomically Plausible Segmentation via Post-Processing with
Denoising Autoencoders [19.361024564220454]
Post-DAE is a post-processing method based on denoising autoencoders (DAE)
We show how erroneous and noisy segmentation masks can be improved using Post-DAE.
arXiv Detail & Related papers (2020-06-24T15:05:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.