Learning Class-Agnostic Pseudo Mask Generation for Box-Supervised
Semantic Segmentation
- URL: http://arxiv.org/abs/2103.05463v1
- Date: Tue, 9 Mar 2021 14:54:54 GMT
- Title: Learning Class-Agnostic Pseudo Mask Generation for Box-Supervised
Semantic Segmentation
- Authors: Chaohao Xie, Dongwei Ren, Lei Wang, Qinghua Hu, Liang Lin, Wangmeng
Zuo
- Abstract summary: We seek for a more accurate learning-based class-agnostic pseudo mask generator tailored to box-supervised semantic segmentation.
Our method can further close the performance gap between box-supervised and fully-supervised models.
- Score: 156.9155100983315
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Recently, several weakly supervised learning methods have been devoted to
utilize bounding box supervision for training deep semantic segmentation
models. Most existing methods usually leverage the generic proposal generators
(\eg, dense CRF and MCG) to produce enhanced segmentation masks for further
training segmentation models. These proposal generators, however, are generic
and not specifically designed for box-supervised semantic segmentation, thereby
leaving some leeway for improving segmentation performance. In this paper, we
aim at seeking for a more accurate learning-based class-agnostic pseudo mask
generator tailored to box-supervised semantic segmentation. To this end, we
resort to a pixel-level annotated auxiliary dataset where the class labels are
non-overlapped with those of the box-annotated dataset. For learning pseudo
mask generator from the auxiliary dataset, we present a bi-level optimization
formulation. In particular, the lower subproblem is used to learn
box-supervised semantic segmentation, while the upper subproblem is used to
learn an optimal class-agnostic pseudo mask generator. The learned pseudo
segmentation mask generator can then be deployed to the box-annotated dataset
for improving weakly supervised semantic segmentation. Experiments on PASCAL
VOC 2012 dataset show that the learned pseudo mask generator is effective in
boosting segmentation performance, and our method can further close the
performance gap between box-supervised and fully-supervised models. Our code
will be made publicly available at
https://github.com/Vious/LPG_BBox_Segmentation .
Related papers
- Bridge the Points: Graph-based Few-shot Segment Anything Semantically [79.1519244940518]
Recent advancements in pre-training techniques have enhanced the capabilities of vision foundation models.
Recent studies extend the SAM to Few-shot Semantic segmentation (FSS)
We propose a simple yet effective approach based on graph analysis.
arXiv Detail & Related papers (2024-10-09T15:02:28Z) - MaskUno: Switch-Split Block For Enhancing Instance Segmentation [0.0]
We propose replacing mask prediction with a Switch-Split block that processes refined ROIs, classifies them, and assigns them to specialized mask predictors.
An increase in the mean Average Precision (mAP) of 2.03% was observed for the high-performing DetectoRS when trained on 80 classes.
arXiv Detail & Related papers (2024-07-31T10:12:14Z) - Extreme Point Supervised Instance Segmentation [28.191795758445352]
This paper introduces a novel approach to learning instance segmentation using extreme points, i.e., the topmost, leftmost, bottommost, and rightmost points, of each object.
These points are readily available in the modern bounding box annotation process while offering strong clues for precise segmentation.
Our model generates high-quality masks when a target object is separated into multiple parts, where previous box-supervised methods often fail.
arXiv Detail & Related papers (2024-05-31T09:37:39Z) - SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete
Diffusion Process [102.18226145874007]
We propose a model-agnostic solution called SegRefiner to enhance the quality of object masks produced by different segmentation models.
SegRefiner takes coarse masks as inputs and refines them using a discrete diffusion process.
It consistently improves both the segmentation metrics and boundary metrics across different types of coarse masks.
arXiv Detail & Related papers (2023-12-19T18:53:47Z) - Segment Anything is A Good Pseudo-label Generator for Weakly Supervised
Semantic Segmentation [8.59909129321984]
This report explores the potential of 'prompt to masks' from the powerful class-agnostic large segmentation model, segment-anything.
Different weak labels are used as prompts to the segment-anything model, generating precise class masks.
Experiments demonstrate that segment-anything can serve as a good pseudo-label generator.
arXiv Detail & Related papers (2023-05-02T09:22:38Z) - MaskRange: A Mask-classification Model for Range-view based LiDAR
Segmentation [34.04740351544143]
We propose a unified mask-classification model, MaskRange, for the range-view based LiDAR semantic and panoptic segmentation.
Our MaskRange achieves state-of-the-art performance with $66.10$ mIoU on semantic segmentation and promising results with $53.10$ PQ on panoptic segmentation with high efficiency.
arXiv Detail & Related papers (2022-06-24T04:39:49Z) - Discovering Object Masks with Transformers for Unsupervised Semantic
Segmentation [75.00151934315967]
MaskDistill is a novel framework for unsupervised semantic segmentation.
Our framework does not latch onto low-level image cues and is not limited to object-centric datasets.
arXiv Detail & Related papers (2022-06-13T17:59:43Z) - Box-Adapt: Domain-Adaptive Medical Image Segmentation using Bounding
BoxSupervision [52.45336255472669]
We propose a weakly supervised do-main adaptation setting for deep learning.
Box-Adapt fully explores the fine-grained segmenta-tion mask in the source domain and the weak bounding box in the target domain.
We demonstrate the effectiveness of our method in the liver segmentation task.
arXiv Detail & Related papers (2021-08-19T01:51:04Z) - Per-Pixel Classification is Not All You Need for Semantic Segmentation [184.2905747595058]
Mask classification is sufficiently general to solve both semantic- and instance-level segmentation tasks.
We propose MaskFormer, a simple mask classification model which predicts a set of binary masks.
Our method outperforms both current state-of-the-art semantic (55.6 mIoU on ADE20K) and panoptic segmentation (52.7 PQ on COCO) models.
arXiv Detail & Related papers (2021-07-13T17:59:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.