Causal Intervention for Weakly-Supervised Semantic Segmentation
- URL: http://arxiv.org/abs/2009.12547v2
- Date: Wed, 7 Oct 2020 04:20:09 GMT
- Title: Causal Intervention for Weakly-Supervised Semantic Segmentation
- Authors: Dong Zhang, Hanwang Zhang, Jinhui Tang, Xiansheng Hua, Qianru Sun
- Abstract summary: We aim to generate better pixel-level pseudo-masks by using only image-level labels.
We propose a structural causal model to analyze the causalities among images, contexts, and class labels.
Based on it, we develop a new method: Context Adjustment (CONTA), to remove the confounding bias in image-level classification.
- Score: 122.1846968696862
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We present a causal inference framework to improve Weakly-Supervised Semantic
Segmentation (WSSS). Specifically, we aim to generate better pixel-level
pseudo-masks by using only image-level labels -- the most crucial step in WSSS.
We attribute the cause of the ambiguous boundaries of pseudo-masks to the
confounding context, e.g., the correct image-level classification of "horse"
and "person" may be not only due to the recognition of each instance, but also
their co-occurrence context, making the model inspection (e.g., CAM) hard to
distinguish between the boundaries. Inspired by this, we propose a structural
causal model to analyze the causalities among images, contexts, and class
labels. Based on it, we develop a new method: Context Adjustment (CONTA), to
remove the confounding bias in image-level classification and thus provide
better pseudo-masks as ground-truth for the subsequent segmentation model. On
PASCAL VOC 2012 and MS-COCO, we show that CONTA boosts various popular WSSS
methods to new state-of-the-arts.
Related papers
- SemPLeS: Semantic Prompt Learning for Weakly-Supervised Semantic
Segmentation [36.41778553250247]
Weakly-Supervised Semantic (WSSS) aims to train segmentation models using image data with only image-level supervision.
We propose a Semantic Prompt Learning for WSSS (SemPLeS) framework, which learns to effectively prompt the CLIP latent space.
SemPLeS can perform better semantic alignment between object regions and the associated class labels.
arXiv Detail & Related papers (2024-01-22T09:41:05Z) - Progressive Feature Self-reinforcement for Weakly Supervised Semantic
Segmentation [55.69128107473125]
We propose a single-stage approach for Weakly Supervised Semantic (WSSS) with image-level labels.
We adaptively partition the image content into deterministic regions (e.g., confident foreground and background) and uncertain regions (e.g., object boundaries and misclassified categories) for separate processing.
Building upon this, we introduce a complementary self-enhancement method that constrains the semantic consistency between these confident regions and an augmented image with the same class labels.
arXiv Detail & Related papers (2023-12-14T13:21:52Z) - Semantic Connectivity-Driven Pseudo-labeling for Cross-domain
Segmentation [89.41179071022121]
Self-training is a prevailing approach in cross-domain semantic segmentation.
We propose a novel approach called Semantic Connectivity-driven pseudo-labeling.
This approach formulates pseudo-labels at the connectivity level and thus can facilitate learning structured and low-noise semantics.
arXiv Detail & Related papers (2023-12-11T12:29:51Z) - Generative Semantic Segmentation [40.57488730457299]
We present a generative learning approach for semantic segmentation.
Uniquely, we cast semantic segmentation as an image-conditioned mask generation problem.
Experiments show that our GSS can perform competitively to prior art alternatives in the standard semantic segmentation setting.
arXiv Detail & Related papers (2023-03-20T17:55:37Z) - Pointly-Supervised Panoptic Segmentation [106.68888377104886]
We propose a new approach to applying point-level annotations for weakly-supervised panoptic segmentation.
Instead of the dense pixel-level labels used by fully supervised methods, point-level labels only provide a single point for each target as supervision.
We formulate the problem in an end-to-end framework by simultaneously generating panoptic pseudo-masks from point-level labels and learning from them.
arXiv Detail & Related papers (2022-10-25T12:03:51Z) - A Simple Baseline for Zero-shot Semantic Segmentation with Pre-trained
Vision-language Model [61.58071099082296]
It is unclear how to make zero-shot recognition working well on broader vision problems, such as object detection and semantic segmentation.
In this paper, we target for zero-shot semantic segmentation, by building it on an off-the-shelf pre-trained vision-language model, i.e., CLIP.
Our experimental results show that this simple framework surpasses previous state-of-the-arts by a large margin.
arXiv Detail & Related papers (2021-12-29T18:56:18Z) - POPCORN: Progressive Pseudo-labeling with Consistency Regularization and
Neighboring [3.4253416336476246]
Semi-supervised learning (SSL) uses unlabeled data to compensate for the scarcity of images and the lack of method generalization to unseen domains.
We propose POPCORN, a novel method combining consistency regularization and pseudo-labeling designed for image segmentation.
arXiv Detail & Related papers (2021-09-13T23:36:36Z) - Per-Pixel Classification is Not All You Need for Semantic Segmentation [184.2905747595058]
Mask classification is sufficiently general to solve both semantic- and instance-level segmentation tasks.
We propose MaskFormer, a simple mask classification model which predicts a set of binary masks.
Our method outperforms both current state-of-the-art semantic (55.6 mIoU on ADE20K) and panoptic segmentation (52.7 PQ on COCO) models.
arXiv Detail & Related papers (2021-07-13T17:59:50Z) - Zero-Shot Recognition through Image-Guided Semantic Classification [9.291055558504588]
We present a new embedding-based framework for zero-shot learning (ZSL)
Motivated by the binary relevance method for multi-label classification, we propose to inversely learn the mapping between an image and a semantic classifier.
IGSC is conceptually simple and can be realized by a slight enhancement of an existing deep architecture for classification.
arXiv Detail & Related papers (2020-07-23T06:22:40Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.