Threshold Matters in WSSS: Manipulating the Activation for the Robust
and Accurate Segmentation Model Against Thresholds
- URL: http://arxiv.org/abs/2203.16045v1
- Date: Wed, 30 Mar 2022 04:26:14 GMT
- Title: Threshold Matters in WSSS: Manipulating the Activation for the Robust
and Accurate Segmentation Model Against Thresholds
- Authors: Minhyun Lee, Dongseob Kim, Hyunjung Shim
- Abstract summary: Weakly-supervised semantic segmentation (WSSS) has recently gained much attention for its promise to train segmentation models only with image-level labels.
Existing WSSS methods commonly argue that the sparse coverage of CAM incurs the performance bottleneck of WSSS.
This paper provides analytical and empirical evidence that the actual bottleneck may not be sparse coverage but a global thresholding scheme applied after CAM.
- Score: 16.6833745997519
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Weakly-supervised semantic segmentation (WSSS) has recently gained much
attention for its promise to train segmentation models only with image-level
labels. Existing WSSS methods commonly argue that the sparse coverage of CAM
incurs the performance bottleneck of WSSS. This paper provides analytical and
empirical evidence that the actual bottleneck may not be sparse coverage but a
global thresholding scheme applied after CAM. Then, we show that this issue can
be mitigated by satisfying two conditions; 1) reducing the imbalance in the
foreground activation and 2) increasing the gap between the foreground and the
background activation. Based on these findings, we propose a novel activation
manipulation network with a per-pixel classification loss and a label
conditioning module. Per-pixel classification naturally induces two-level
activation in activation maps, which can penalize the most discriminative
parts, promote the less discriminative parts, and deactivate the background
regions. Label conditioning imposes that the output label of pseudo-masks
should be any of true image-level labels; it penalizes the wrong activation
assigned to non-target classes. Based on extensive analysis and evaluations, we
demonstrate that each component helps produce accurate pseudo-masks, achieving
the robustness against the choice of the global threshold. Finally, our model
achieves state-of-the-art records on both PASCAL VOC 2012 and MS COCO 2014
datasets.
Related papers
- Enhancing Weakly Supervised Semantic Segmentation with Multi-modal Foundation Models: An End-to-End Approach [7.012760526318993]
Weakly-Supervised Semantic (WSSS) offers a cost-efficient workaround to extensive labeling.
Existing WSSS methods have difficulties in learning the boundaries of objects leading to poor segmentation results.
We propose a novel and effective framework that addresses these issues by leveraging visual foundation models inside the bounding box.
arXiv Detail & Related papers (2024-05-10T16:42:25Z) - Progressive Feature Self-reinforcement for Weakly Supervised Semantic
Segmentation [55.69128107473125]
We propose a single-stage approach for Weakly Supervised Semantic (WSSS) with image-level labels.
We adaptively partition the image content into deterministic regions (e.g., confident foreground and background) and uncertain regions (e.g., object boundaries and misclassified categories) for separate processing.
Building upon this, we introduce a complementary self-enhancement method that constrains the semantic consistency between these confident regions and an augmented image with the same class labels.
arXiv Detail & Related papers (2023-12-14T13:21:52Z) - Background Activation Suppression for Weakly Supervised Object
Localization and Semantic Segmentation [84.62067728093358]
Weakly supervised object localization and semantic segmentation aim to localize objects using only image-level labels.
New paradigm has emerged by generating a foreground prediction map to achieve pixel-level localization.
This paper presents two astonishing experimental observations on the object localization learning process.
arXiv Detail & Related papers (2023-09-22T15:44:10Z) - High-fidelity Pseudo-labels for Boosting Weakly-Supervised Segmentation [17.804090651425955]
Image-level weakly-supervised segmentation (WSSS) reduces the usually vast data annotation cost by surrogate segmentation masks during training.
Our work is based on two techniques for improving CAMs; importance sampling, which is a substitute for GAP, and the feature similarity loss.
We reformulate both techniques based on binomial posteriors of multiple independent binary problems.
This has two benefits; their performance is improved and they become more general, resulting in an add-on method that can boost virtually any WSSS method.
arXiv Detail & Related papers (2023-04-05T17:43:57Z) - Ambiguity-Resistant Semi-Supervised Learning for Dense Object Detection [98.66771688028426]
We propose a Ambiguity-Resistant Semi-supervised Learning (ARSL) for one-stage detectors.
Joint-Confidence Estimation (JCE) is proposed to quantifies the classification and localization quality of pseudo labels.
ARSL effectively mitigates the ambiguities and achieves state-of-the-art SSOD performance on MS COCO and PASCAL VOC.
arXiv Detail & Related papers (2023-03-27T07:46:58Z) - Transductive CLIP with Class-Conditional Contrastive Learning [68.51078382124331]
We propose Transductive CLIP, a novel framework for learning a classification network with noisy labels from scratch.
A class-conditional contrastive learning mechanism is proposed to mitigate the reliance on pseudo labels.
ensemble labels is adopted as a pseudo label updating strategy to stabilize the training of deep neural networks with noisy labels.
arXiv Detail & Related papers (2022-06-13T14:04:57Z) - Anti-Adversarially Manipulated Attributions for Weakly Supervised
Semantic Segmentation and Object Localization [31.69344455448125]
We present an attribution map of an image that is manipulated to increase the classification score produced by a classifier before the final softmax or sigmoid layer.
This manipulation is realized in an anti-adversarial manner, so that the original image is perturbed along pixel gradients in directions opposite to those used in an adversarial attack.
In addition, we introduce a new regularization procedure that inhibits the incorrect attribution of regions unrelated to the target object and the excessive concentration of attributions on a small region of the target object.
arXiv Detail & Related papers (2022-04-11T06:18:02Z) - Inferring the Class Conditional Response Map for Weakly Supervised
Semantic Segmentation [27.269847900950943]
We propose a class-conditional inference strategy and an activation aware mask refinement loss function to generate better pseudo labels.
Our method achieves superior WSSS results without requiring re-training of the classifier.
arXiv Detail & Related papers (2021-10-27T09:43:40Z) - Multi-Layer Pseudo-Supervision for Histopathology Tissue Semantic
Segmentation using Patch-level Classification Labels [26.349051136954195]
In this paper, we use only patch-level classification labels to achieve tissue semantic segmentation on histopathology images.
Several technical novelties have been proposed to reduce the information gap between pixel-level and patch-level annotations.
Our proposed model outperforms two state-of-the-art WSSS approaches.
arXiv Detail & Related papers (2021-10-14T08:02:07Z) - Rethinking Pseudo Labels for Semi-Supervised Object Detection [84.697097472401]
We introduce certainty-aware pseudo labels tailored for object detection.
We dynamically adjust the thresholds used to generate pseudo labels and reweight loss functions for each category to alleviate the class imbalance problem.
Our approach improves supervised baselines by up to 10% AP using only 1-10% labeled data from COCO.
arXiv Detail & Related papers (2021-06-01T01:32:03Z) - Causal Intervention for Weakly-Supervised Semantic Segmentation [122.1846968696862]
We aim to generate better pixel-level pseudo-masks by using only image-level labels.
We propose a structural causal model to analyze the causalities among images, contexts, and class labels.
Based on it, we develop a new method: Context Adjustment (CONTA), to remove the confounding bias in image-level classification.
arXiv Detail & Related papers (2020-09-26T09:26:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.