Revisiting Weak-to-Strong Consistency in Semi-Supervised Semantic
Segmentation
- URL: http://arxiv.org/abs/2208.09910v2
- Date: Sun, 26 Mar 2023 07:10:13 GMT
- Title: Revisiting Weak-to-Strong Consistency in Semi-Supervised Semantic
Segmentation
- Authors: Lihe Yang, Lei Qi, Litong Feng, Wayne Zhang, Yinghuan Shi
- Abstract summary: We revisit the weak-to-strong consistency framework popularized by FixMatch from semi-supervised classification.
We propose an auxiliary feature perturbation stream as a supplement, leading to an expanded perturbation space.
Our overall Unified Dual-Stream Perturbations approach (UniMatch) surpasses all existing methods significantly across all evaluation protocols.
- Score: 27.831267434546024
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this work, we revisit the weak-to-strong consistency framework,
popularized by FixMatch from semi-supervised classification, where the
prediction of a weakly perturbed image serves as supervision for its strongly
perturbed version. Intriguingly, we observe that such a simple pipeline already
achieves competitive results against recent advanced works, when transferred to
our segmentation scenario. Its success heavily relies on the manual design of
strong data augmentations, however, which may be limited and inadequate to
explore a broader perturbation space. Motivated by this, we propose an
auxiliary feature perturbation stream as a supplement, leading to an expanded
perturbation space. On the other, to sufficiently probe original image-level
augmentations, we present a dual-stream perturbation technique, enabling two
strong views to be simultaneously guided by a common weak view. Consequently,
our overall Unified Dual-Stream Perturbations approach (UniMatch) surpasses all
existing methods significantly across all evaluation protocols on the Pascal,
Cityscapes, and COCO benchmarks. Its superiority is also demonstrated in remote
sensing interpretation and medical image analysis. We hope our reproduced
FixMatch and our results can inspire more future works. Code and logs are
available at https://github.com/LiheYoung/UniMatch.
Related papers
- Visual Prompt Tuning in Null Space for Continual Learning [51.96411454304625]
Existing prompt-tuning methods have demonstrated impressive performances in continual learning (CL)
This paper aims to learn each task by tuning the prompts in the direction orthogonal to the subspace spanned by previous tasks' features.
In practice, an effective null-space-based approximation solution has been proposed to implement the prompt gradient projection.
arXiv Detail & Related papers (2024-06-09T05:57:40Z) - DuPL: Dual Student with Trustworthy Progressive Learning for Robust Weakly Supervised Semantic Segmentation [6.775785126617824]
We propose a dual student framework with trustworthy progressive learning (DuPL)
Experiment results demonstrate the superiority of the proposed DuPL over the recent state-of-the-art alternatives on PASCAL VOC 2012 and MS datasets.
arXiv Detail & Related papers (2024-03-17T12:14:34Z) - Unified Domain Adaptive Semantic Segmentation [96.74199626935294]
Unsupervised Adaptive Domain Semantic (UDA-SS) aims to transfer the supervision from a labeled source domain to an unlabeled target domain.
We propose a Quad-directional Mixup (QuadMix) method, characterized by tackling distinct point attributes and feature inconsistencies.
Our method outperforms the state-of-the-art works by large margins on four challenging UDA-SS benchmarks.
arXiv Detail & Related papers (2023-11-22T09:18:49Z) - Prompting Diffusion Representations for Cross-Domain Semantic
Segmentation [101.04326113360342]
diffusion-pretraining achieves extraordinary domain generalization results for semantic segmentation.
We introduce a scene prompt and a prompt randomization strategy to help further disentangle the domain-invariant information when training the segmentation head.
arXiv Detail & Related papers (2023-07-05T09:28:25Z) - ConMatch: Semi-Supervised Learning with Confidence-Guided Consistency
Regularization [26.542718087103665]
We present a novel semi-supervised learning framework that intelligently leverages the consistency regularization between the model's predictions from two strongly-augmented views of an image, weighted by a confidence of pseudo-label, dubbed ConMatch.
We conduct experiments to demonstrate the effectiveness of our ConMatch over the latest methods and provide extensive ablation studies.
arXiv Detail & Related papers (2022-08-18T04:37:50Z) - Consistency Regularization for Deep Face Anti-Spoofing [69.70647782777051]
Face anti-spoofing (FAS) plays a crucial role in securing face recognition systems.
Motivated by this exciting observation, we conjecture that encouraging feature consistency of different views may be a promising way to boost FAS models.
We enhance both Embedding-level and Prediction-level Consistency Regularization (EPCR) in FAS.
arXiv Detail & Related papers (2021-11-24T08:03:48Z) - Mixed-supervised segmentation: Confidence maximization helps knowledge
distillation [24.892332859630518]
In this work, we propose a dual-branch architecture for deep neural networks.
The upper branch (teacher) receives strong annotations, while the bottom one (student) is driven by limited supervision and guided by the upper branch.
We show that the synergy between the entropy and KL divergence yields substantial improvements in performance.
arXiv Detail & Related papers (2021-09-21T20:06:13Z) - Semi-supervised Semantic Segmentation with Directional Context-aware
Consistency [66.49995436833667]
We focus on the semi-supervised segmentation problem where only a small set of labeled data is provided with a much larger collection of totally unlabeled images.
A preferred high-level representation should capture the contextual information while not losing self-awareness.
We present the Directional Contrastive Loss (DC Loss) to accomplish the consistency in a pixel-to-pixel manner.
arXiv Detail & Related papers (2021-06-27T03:42:40Z) - WSSOD: A New Pipeline for Weakly- and Semi-Supervised Object Detection [75.80075054706079]
We propose a weakly- and semi-supervised object detection framework (WSSOD)
An agent detector is first trained on a joint dataset and then used to predict pseudo bounding boxes on weakly-annotated images.
The proposed framework demonstrates remarkable performance on PASCAL-VOC and MSCOCO benchmark, achieving a high performance comparable to those obtained in fully-supervised settings.
arXiv Detail & Related papers (2021-05-21T11:58:50Z) - Teach me to segment with mixed supervision: Confident students become
masters [27.976487552313113]
Deep segmentation neural networks require large training datasets with pixel-wise segmentations, which are expensive to obtain in practice.
We propose a dual-branch architecture, where the upper branch (teacher) receives strong annotations, while the bottom one (student) is driven by limited supervision and guided by the upper branch.
We demonstrate that our method significantly outperforms other strategies to tackle semantic segmentation within a mixed-supervision framework.
arXiv Detail & Related papers (2020-12-15T02:51:36Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.