Cross-head mutual Mean-Teaching for semi-supervised medical image
segmentation
- URL: http://arxiv.org/abs/2310.05082v2
- Date: Mon, 16 Oct 2023 04:48:21 GMT
- Title: Cross-head mutual Mean-Teaching for semi-supervised medical image
segmentation
- Authors: Wei Li, Ruifeng Bian, Wenyi Zhao, Weijin Xu, Huihua Yang
- Abstract summary: Semi-supervised medical image segmentation (SSMIS) has witnessed substantial advancements by leveraging limited labeled data and abundant unlabeled data.
Existing state-of-the-art (SOTA) methods encounter challenges in accurately predicting labels for the unlabeled data.
We propose a novel Cross-head mutual mean-teaching Network (CMMT-Net) incorporated strong-weak data augmentation.
- Score: 6.738522094694818
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Semi-supervised medical image segmentation (SSMIS) has witnessed substantial
advancements by leveraging limited labeled data and abundant unlabeled data.
Nevertheless, existing state-of-the-art (SOTA) methods encounter challenges in
accurately predicting labels for the unlabeled data, giving rise to disruptive
noise during training and susceptibility to erroneous information overfitting.
Moreover, applying perturbations to inaccurate predictions further reduces
consistent learning. To address these concerns, we propose a novel Cross-head
mutual mean-teaching Network (CMMT-Net) incorporated strong-weak data
augmentation, thereby benefitting both self-training and consistency learning.
Specifically, our CMMT-Net consists of both teacher-student peer networks with
a share encoder and dual slightly different decoders, and the pseudo labels
generated by one mean teacher head are adopted to supervise the other student
branch to achieve a mutual consistency. Furthermore, we propose mutual virtual
adversarial training (MVAT) to smooth the decision boundary and enhance feature
representations. To diversify the consistency training samples, we employ
Cross-Set CutMix strategy, which also helps address distribution mismatch
issues. Notably, CMMT-Net simultaneously implements data, feature, and network
perturbations, amplifying model diversity and generalization performance.
Experimental results on three publicly available datasets indicate that our
approach yields remarkable improvements over previous SOTA methods across
various semi-supervised scenarios. Code and logs will be available at
https://github.com/Leesoon1984/CMMT-Net.
Related papers
- Leveraging Unlabeled Data for 3D Medical Image Segmentation through
Self-Supervised Contrastive Learning [3.7395287262521717]
Current 3D semi-supervised segmentation methods face significant challenges such as limited consideration of contextual information.
We introduce two distinctworks designed to explore and exploit the discrepancies between them, ultimately correcting the erroneous prediction results.
We employ a self-supervised contrastive learning paradigm to distinguish between reliable and unreliable predictions.
arXiv Detail & Related papers (2023-11-21T14:03:16Z) - JointMatch: A Unified Approach for Diverse and Collaborative
Pseudo-Labeling to Semi-Supervised Text Classification [65.268245109828]
Semi-supervised text classification (SSTC) has gained increasing attention due to its ability to leverage unlabeled data.
Existing approaches based on pseudo-labeling suffer from the issues of pseudo-label bias and error accumulation.
We propose JointMatch, a holistic approach for SSTC that addresses these challenges by unifying ideas from recent semi-supervised learning.
arXiv Detail & Related papers (2023-10-23T05:43:35Z) - Co-training with High-Confidence Pseudo Labels for Semi-supervised
Medical Image Segmentation [27.833321555267116]
We propose an Uncertainty-guided Collaborative Mean-Teacher (UCMT) for semi-supervised semantic segmentation with the high-confidence pseudo labels.
UCMT consists of two main components: 1) collaborative mean-teacher (CMT) for encouraging model disagreement and performing co-training between the sub-networks, and 2) uncertainty-guided region mix (UMIX) for manipulating the input images according to the uncertainty maps of CMT and facilitating CMT to produce high-confidence pseudo labels.
arXiv Detail & Related papers (2023-01-11T13:48:37Z) - Interpolation-based Correlation Reduction Network for Semi-Supervised
Graph Learning [49.94816548023729]
We propose a novel graph contrastive learning method, termed Interpolation-based Correlation Reduction Network (ICRN)
In our method, we improve the discriminative capability of the latent feature by enlarging the margin of decision boundaries.
By combining the two settings, we extract rich supervision information from both the abundant unlabeled nodes and the rare yet valuable labeled nodes for discnative representation learning.
arXiv Detail & Related papers (2022-06-06T14:26:34Z) - Adversarial Dual-Student with Differentiable Spatial Warping for
Semi-Supervised Semantic Segmentation [70.2166826794421]
We propose a differentiable geometric warping to conduct unsupervised data augmentation.
We also propose a novel adversarial dual-student framework to improve the Mean-Teacher.
Our solution significantly improves the performance and state-of-the-art results are achieved on both datasets.
arXiv Detail & Related papers (2022-03-05T17:36:17Z) - Semi-supervised Domain Adaptive Structure Learning [72.01544419893628]
Semi-supervised domain adaptation (SSDA) is a challenging problem requiring methods to overcome both 1) overfitting towards poorly annotated data and 2) distribution shift across domains.
We introduce an adaptive structure learning method to regularize the cooperation of SSL and DA.
arXiv Detail & Related papers (2021-12-12T06:11:16Z) - Semi-supervised Left Atrium Segmentation with Mutual Consistency
Training [60.59108570938163]
We propose a novel Mutual Consistency Network (MC-Net) for semi-supervised left atrium segmentation from 3D MR images.
Our MC-Net consists of one encoder and two slightly different decoders, and the prediction discrepancies of two decoders are transformed as an unsupervised loss.
We evaluate our MC-Net on the public Left Atrium (LA) database and it obtains impressive performance gains by exploiting the unlabeled data effectively.
arXiv Detail & Related papers (2021-03-04T09:34:32Z) - A Transductive Multi-Head Model for Cross-Domain Few-Shot Learning [72.30054522048553]
We present a new method, Transductive Multi-Head Few-Shot learning (TMHFS), to address the Cross-Domain Few-Shot Learning challenge.
The proposed methods greatly outperform the strong baseline, fine-tuning, on four different target domains.
arXiv Detail & Related papers (2020-06-08T02:39:59Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.