SFC: Shared Feature Calibration in Weakly Supervised Semantic
Segmentation
- URL: http://arxiv.org/abs/2401.11719v1
- Date: Mon, 22 Jan 2024 06:43:13 GMT
- Title: SFC: Shared Feature Calibration in Weakly Supervised Semantic
Segmentation
- Authors: Xinqiao Zhao, Feilong Tang, Xiaoyang Wang, Jimin Xiao
- Abstract summary: Image-level weakly supervised semantic segmentation has received increasing attention due to its low annotation cost.
Existing methods mainly rely on Class Mapping (CAM) to obtain pseudo-labels for training semantic segmentation models.
In this work, we are the first to demonstrate that long-tailed distribution in training data can cause the CAM calculated through weights over-activated for head classes and under-activated for tail classes due to the shared features among head- and tail- classes.
- Score: 28.846513129022803
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Image-level weakly supervised semantic segmentation has received increasing
attention due to its low annotation cost. Existing methods mainly rely on Class
Activation Mapping (CAM) to obtain pseudo-labels for training semantic
segmentation models. In this work, we are the first to demonstrate that
long-tailed distribution in training data can cause the CAM calculated through
classifier weights over-activated for head classes and under-activated for tail
classes due to the shared features among head- and tail- classes. This degrades
pseudo-label quality and further influences final semantic segmentation
performance. To address this issue, we propose a Shared Feature Calibration
(SFC) method for CAM generation. Specifically, we leverage the class prototypes
that carry positive shared features and propose a Multi-Scaled
Distribution-Weighted (MSDW) consistency loss for narrowing the gap between the
CAMs generated through classifier weights and class prototypes during training.
The MSDW loss counterbalances over-activation and under-activation by
calibrating the shared features in head-/tail-class classifier weights.
Experimental results show that our SFC significantly improves CAM boundaries
and achieves new state-of-the-art performances. The project is available at
https://github.com/Barrett-python/SFC.
Related papers
- Memory-guided Network with Uncertainty-based Feature Augmentation for Few-shot Semantic Segmentation [12.653336728447654]
We propose a class-shared memory (CSM) module consisting of a set of learnable memory vectors.
These memory vectors learn elemental object patterns from base classes during training whilst re-encoding query features during both training and inference.
We integrate CSM and UFA into representative FSS works, with experimental results on the widely-used PASCAL-5$i$ and COCO-20$i$ datasets.
arXiv Detail & Related papers (2024-06-01T19:53:25Z) - Rethinking Few-shot 3D Point Cloud Semantic Segmentation [62.80639841429669]
This paper revisits few-shot 3D point cloud semantic segmentation (FS-PCS)
We focus on two significant issues in the state-of-the-art: foreground leakage and sparse point distribution.
To address these issues, we introduce a standardized FS-PCS setting, upon which a new benchmark is built.
arXiv Detail & Related papers (2024-03-01T15:14:47Z) - BroadCAM: Outcome-agnostic Class Activation Mapping for Small-scale
Weakly Supervised Applications [69.22739434619531]
We propose an outcome-agnostic CAM approach, called BroadCAM, for small-scale weakly supervised applications.
By evaluating BroadCAM on VOC2012 and BCSS-WSSS for WSSS and OpenImages30k for WSOL, BroadCAM demonstrates superior performance.
arXiv Detail & Related papers (2023-09-07T06:45:43Z) - Prediction Calibration for Generalized Few-shot Semantic Segmentation [101.69940565204816]
Generalized Few-shot Semantic (GFSS) aims to segment each image pixel into either base classes with abundant training examples or novel classes with only a handful of (e.g., 1-5) training images per class.
We build a cross-attention module that guides the classifier's final prediction using the fused multi-level features.
Our PCN outperforms the state-the-art alternatives by large margins.
arXiv Detail & Related papers (2022-10-15T13:30:12Z) - Towards Calibrated Hyper-Sphere Representation via Distribution Overlap
Coefficient for Long-tailed Learning [8.208237033120492]
Long-tailed learning aims to tackle the challenge that head classes dominate the training procedure under severe class imbalance in real-world scenarios.
Motivated by this, we generalize the cosine-based classifiers to a von Mises-Fisher (vMF) mixture model.
We measure representation quality upon the hyper-sphere space via calculating distribution overlap coefficient.
arXiv Detail & Related papers (2022-08-22T03:53:29Z) - Saliency Guided Inter- and Intra-Class Relation Constraints for Weakly
Supervised Semantic Segmentation [66.87777732230884]
We propose a saliency guided Inter- and Intra-Class Relation Constrained (I$2$CRC) framework to assist the expansion of the activated object regions.
We also introduce an object guided label refinement module to take a full use of both the segmentation prediction and the initial labels for obtaining superior pseudo-labels.
arXiv Detail & Related papers (2022-06-20T03:40:56Z) - CAD: Co-Adapting Discriminative Features for Improved Few-Shot
Classification [11.894289991529496]
Few-shot classification is a challenging problem that aims to learn a model that can adapt to unseen classes given a few labeled samples.
Recent approaches pre-train a feature extractor, and then fine-tune for episodic meta-learning.
We propose a strategy to cross-attend and re-weight discriminative features for few-shot classification.
arXiv Detail & Related papers (2022-03-25T06:14:51Z) - Dual Prototypical Contrastive Learning for Few-shot Semantic
Segmentation [55.339405417090084]
We propose a dual prototypical contrastive learning approach tailored to the few-shot semantic segmentation (FSS) task.
The main idea is to encourage the prototypes more discriminative by increasing inter-class distance while reducing intra-class distance in prototype feature space.
We demonstrate that the proposed dual contrastive learning approach outperforms state-of-the-art FSS methods on PASCAL-5i and COCO-20i datasets.
arXiv Detail & Related papers (2021-11-09T08:14:50Z) - Weakly supervised segmentation with cross-modality equivariant
constraints [7.757293476741071]
Weakly supervised learning has emerged as an appealing alternative to alleviate the need for large labeled datasets in semantic segmentation.
We present a novel learning strategy that leverages self-supervision in a multi-modal image scenario to significantly enhance original CAMs.
Our approach outperforms relevant recent literature under the same learning conditions.
arXiv Detail & Related papers (2021-04-06T13:14:20Z) - Fine-Grained Visual Classification with Efficient End-to-end
Localization [49.9887676289364]
We present an efficient localization module that can be fused with a classification network in an end-to-end setup.
We evaluate the new model on the three benchmark datasets CUB200-2011, Stanford Cars and FGVC-Aircraft.
arXiv Detail & Related papers (2020-05-11T14:07:06Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.