RecurSeed and EdgePredictMix: Pseudo-Label Refinement Learning for
Weakly Supervised Semantic Segmentation across Single- and Multi-Stage
Frameworks
- URL: http://arxiv.org/abs/2204.06754v4
- Date: Fri, 15 Dec 2023 20:47:11 GMT
- Title: RecurSeed and EdgePredictMix: Pseudo-Label Refinement Learning for
Weakly Supervised Semantic Segmentation across Single- and Multi-Stage
Frameworks
- Authors: Sanghyun Jo, In-Jae Yu, Kyungsu Kim
- Abstract summary: weakly supervised semantic segmentation using only image-level labels (WSSS-IL) is potentially useful.
But its low performance and implementation complexity still limit its application.
We propose RecurSeed, which alternately reduces non- and false detections.
We also propose a novel data augmentation (DA) approach called EdgePredictMix, which further expresses an object's edge.
- Score: 3.3240906432768482
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Although weakly supervised semantic segmentation using only image-level
labels (WSSS-IL) is potentially useful, its low performance and implementation
complexity still limit its application. The main causes are (a) non-detection
and (b) false-detection phenomena: (a) The class activation maps refined from
existing WSSS-IL methods still only represent partial regions for large-scale
objects, and (b) for small-scale objects, over-activation causes them to
deviate from the object edges. We propose RecurSeed, which alternately reduces
non- and false detections through recursive iterations, thereby implicitly
finding an optimal junction that minimizes both errors. We also propose a novel
data augmentation (DA) approach called EdgePredictMix, which further expresses
an object's edge by utilizing the probability difference information between
adjacent pixels in combining the segmentation results, thereby compensating for
the shortcomings when applying the existing DA methods to WSSS. We achieved new
state-of-the-art performances on both the PASCAL VOC 2012 and MS COCO 2014
benchmarks (VOC val: 74.4%, COCO val: 46.4%). The code is available at
https://github.com/shjo-april/RecurSeed_and_EdgePredictMix.
Related papers
- Learning to Detour: Shortcut Mitigating Augmentation for Weakly Supervised Semantic Segmentation [7.5856806269316825]
Weakly supervised semantic segmentation (WSSS) employing weak forms of labels has been actively studied to alleviate the annotation cost of acquiring pixel-level labels.
We propose shortcut mitigating augmentation (SMA) for WSSS, which generates synthetic representations of object-background combinations not seen in the training data to reduce the use of shortcut features.
arXiv Detail & Related papers (2024-05-28T13:07:35Z) - PDiscoNet: Semantically consistent part discovery for fine-grained
recognition [62.12602920807109]
We propose PDiscoNet to discover object parts by using only image-level class labels along with priors encouraging the parts to be.
Our results on CUB, CelebA, and PartImageNet show that the proposed method provides substantially better part discovery performance than previous methods.
arXiv Detail & Related papers (2023-09-06T17:19:29Z) - Small Object Detection via Coarse-to-fine Proposal Generation and
Imitation Learning [52.06176253457522]
We propose a two-stage framework tailored for small object detection based on the Coarse-to-fine pipeline and Feature Imitation learning.
CFINet achieves state-of-the-art performance on the large-scale small object detection benchmarks, SODA-D and SODA-A.
arXiv Detail & Related papers (2023-08-18T13:13:09Z) - Threshold Matters in WSSS: Manipulating the Activation for the Robust
and Accurate Segmentation Model Against Thresholds [16.6833745997519]
Weakly-supervised semantic segmentation (WSSS) has recently gained much attention for its promise to train segmentation models only with image-level labels.
Existing WSSS methods commonly argue that the sparse coverage of CAM incurs the performance bottleneck of WSSS.
This paper provides analytical and empirical evidence that the actual bottleneck may not be sparse coverage but a global thresholding scheme applied after CAM.
arXiv Detail & Related papers (2022-03-30T04:26:14Z) - SparseDet: Improving Sparsely Annotated Object Detection with
Pseudo-positive Mining [76.95808270536318]
We propose an end-to-end system that learns to separate proposals into labeled and unlabeled regions using Pseudo-positive mining.
While the labeled regions are processed as usual, self-supervised learning is used to process the unlabeled regions.
We conduct exhaustive experiments on five splits on the PASCAL-VOC and COCO datasets achieving state-of-the-art performance.
arXiv Detail & Related papers (2022-01-12T18:57:04Z) - Anti-aliasing Semantic Reconstruction for Few-Shot Semantic Segmentation [66.85202434812942]
We reformulate few-shot segmentation as a semantic reconstruction problem.
We convert base class features into a series of basis vectors which span a class-level semantic space for novel class reconstruction.
Our proposed approach, referred to as anti-aliasing semantic reconstruction (ASR), provides a systematic yet interpretable solution for few-shot learning problems.
arXiv Detail & Related papers (2021-06-01T02:17:36Z) - Multi-scale Interactive Network for Salient Object Detection [91.43066633305662]
We propose the aggregate interaction modules to integrate the features from adjacent levels.
To obtain more efficient multi-scale features, the self-interaction modules are embedded in each decoder unit.
Experimental results on five benchmark datasets demonstrate that the proposed method without any post-processing performs favorably against 23 state-of-the-art approaches.
arXiv Detail & Related papers (2020-07-17T15:41:37Z) - CAFENet: Class-Agnostic Few-Shot Edge Detection Network [19.01453512012934]
We tackle a novel few-shot learning challenge, which we call few-shot semantic edge detection.
We also present a Class-Agnostic Few-shot Edge detection Network (CAFENet) based on meta-learning strategy.
arXiv Detail & Related papers (2020-03-18T14:18:59Z) - Frustratingly Simple Few-Shot Object Detection [98.42824677627581]
We find that fine-tuning only the last layer of existing detectors on rare classes is crucial to the few-shot object detection task.
Such a simple approach outperforms the meta-learning methods by roughly 220 points on current benchmarks.
arXiv Detail & Related papers (2020-03-16T00:29:14Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.