SIM: Semantic-aware Instance Mask Generation for Box-Supervised Instance
Segmentation
- URL: http://arxiv.org/abs/2303.08578v1
- Date: Tue, 14 Mar 2023 05:59:25 GMT
- Title: SIM: Semantic-aware Instance Mask Generation for Box-Supervised Instance
Segmentation
- Authors: Ruihuang Li, Chenhang He, Yabin Zhang, Shuai Li, Liyi Chen, Lei Zhang
- Abstract summary: We propose a new box-supervised instance segmentation approach by developing a Semantic-aware Instance Mask (SIM) generation paradigm.
Considering that the semantic-aware prototypes cannot distinguish different instances of the same semantics, we propose a self-correction mechanism.
Extensive experimental results demonstrate the superiority of our proposed SIM approach over other state-of-the-art methods.
- Score: 22.930296667684125
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Weakly supervised instance segmentation using only bounding box annotations
has recently attracted much research attention. Most of the current efforts
leverage low-level image features as extra supervision without explicitly
exploiting the high-level semantic information of the objects, which will
become ineffective when the foreground objects have similar appearances to the
background or other objects nearby. We propose a new box-supervised instance
segmentation approach by developing a Semantic-aware Instance Mask (SIM)
generation paradigm. Instead of heavily relying on local pair-wise affinities
among neighboring pixels, we construct a group of category-wise feature
centroids as prototypes to identify foreground objects and assign them
semantic-level pseudo labels. Considering that the semantic-aware prototypes
cannot distinguish different instances of the same semantics, we propose a
self-correction mechanism to rectify the falsely activated regions while
enhancing the correct ones. Furthermore, to handle the occlusions between
objects, we tailor the Copy-Paste operation for the weakly-supervised instance
segmentation task to augment challenging training data. Extensive experimental
results demonstrate the superiority of our proposed SIM approach over other
state-of-the-art methods. The source code: https://github.com/lslrh/SIM.
Related papers
- Semantics Meets Temporal Correspondence: Self-supervised Object-centric Learning in Videos [63.94040814459116]
Self-supervised methods have shown remarkable progress in learning high-level semantics and low-level temporal correspondence.
We propose a novel semantic-aware masked slot attention on top of the fused semantic features and correspondence maps.
We adopt semantic- and instance-level temporal consistency as self-supervision to encourage temporally coherent object-centric representations.
arXiv Detail & Related papers (2023-08-19T09:12:13Z) - Synthetic Instance Segmentation from Semantic Image Segmentation Masks [15.477053085267404]
We propose a novel paradigm called Synthetic Instance (SISeg)
SISeg instance segmentation results by leveraging image masks generated by existing semantic segmentation models.
In other words, the proposed model does not need extra manpower or higher computational expenses.
arXiv Detail & Related papers (2023-08-02T05:13:02Z) - Perceive, Excavate and Purify: A Novel Object Mining Framework for
Instance Segmentation [4.375012768093524]
We propose a novel object mining framework for instance segmentation.
We first introduce the semantics perceiving subnetwork to capture pixels that may belong to an obvious instance from the bottom up.
In the mechanism, preliminary perceived semantics are regarded as original instances with classifications and locations, and then indistinguishable objects around these original instances are mined.
arXiv Detail & Related papers (2023-04-18T08:47:03Z) - Saliency Guided Inter- and Intra-Class Relation Constraints for Weakly
Supervised Semantic Segmentation [66.87777732230884]
We propose a saliency guided Inter- and Intra-Class Relation Constrained (I$2$CRC) framework to assist the expansion of the activated object regions.
We also introduce an object guided label refinement module to take a full use of both the segmentation prediction and the initial labels for obtaining superior pseudo-labels.
arXiv Detail & Related papers (2022-06-20T03:40:56Z) - Sparse Instance Activation for Real-Time Instance Segmentation [72.23597664935684]
We propose a conceptually novel, efficient, and fully convolutional framework for real-time instance segmentation.
SparseInst has extremely fast inference speed and achieves 40 FPS and 37.9 AP on the COCO benchmark.
arXiv Detail & Related papers (2022-03-24T03:15:39Z) - Learning to Detect Instance-level Salient Objects Using Complementary
Image Labels [55.049347205603304]
We present the first weakly-supervised approach to the salient instance detection problem.
We propose a novel weakly-supervised network with three branches: a Saliency Detection Branch leveraging class consistency information to locate candidate objects; a Boundary Detection Branch exploiting class discrepancy information to delineate object boundaries; and a Centroid Detection Branch using subitizing information to detect salient instance centroids.
arXiv Detail & Related papers (2021-11-19T10:15:22Z) - SOLO: A Simple Framework for Instance Segmentation [84.00519148562606]
"instance categories" assigns categories to each pixel within an instance according to the instance's location.
"SOLO" is a simple, direct, and fast framework for instance segmentation with strong performance.
Our approach achieves state-of-the-art results for instance segmentation in terms of both speed and accuracy.
arXiv Detail & Related papers (2021-06-30T09:56:54Z) - Unsupervised Semantic Segmentation by Contrasting Object Mask Proposals [78.12377360145078]
We introduce a novel two-step framework that adopts a predetermined prior in a contrastive optimization objective to learn pixel embeddings.
This marks a large deviation from existing works that relied on proxy tasks or end-to-end clustering.
In particular, when fine-tuning the learned representations using just 1% of labeled examples on PASCAL, we outperform supervised ImageNet pre-training by 7.1% mIoU.
arXiv Detail & Related papers (2021-02-11T18:54:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.