P2Seg: Pointly-supervised Segmentation via Mutual Distillation
- URL: http://arxiv.org/abs/2401.09709v1
- Date: Thu, 18 Jan 2024 03:41:38 GMT
- Title: P2Seg: Pointly-supervised Segmentation via Mutual Distillation
- Authors: Zipeng Wang, Xuehui Yu, Xumeng Han, Wenwen Yu, Zhixun Huang, Jianbin
Jiao, Zhenjun Han
- Abstract summary: We develop a Mutual Distillation Module (MDM) to leverage the complementary strengths of both instance position and semantic information.
Our method achieves 55.7 mAP$_50$ and 17.6 mAP on the PASCAL VOC and MS COCO datasets.
- Score: 23.979786026101024
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Point-level Supervised Instance Segmentation (PSIS) aims to enhance the
applicability and scalability of instance segmentation by utilizing low-cost
yet instance-informative annotations. Existing PSIS methods usually rely on
positional information to distinguish objects, but predicting precise
boundaries remains challenging due to the lack of contour annotations.
Nevertheless, weakly supervised semantic segmentation methods are proficient in
utilizing intra-class feature consistency to capture the boundary contours of
the same semantic regions. In this paper, we design a Mutual Distillation
Module (MDM) to leverage the complementary strengths of both instance position
and semantic information and achieve accurate instance-level object perception.
The MDM consists of Semantic to Instance (S2I) and Instance to Semantic (I2S).
S2I is guided by the precise boundaries of semantic regions to learn the
association between annotated points and instance contours. I2S leverages
discriminative relationships between instances to facilitate the
differentiation of various objects within the semantic map. Extensive
experiments substantiate the efficacy of MDM in fostering the synergy between
instance and semantic information, consequently improving the quality of
instance-level object representations. Our method achieves 55.7 mAP$_{50}$ and
17.6 mAP on the PASCAL VOC and MS COCO datasets, significantly outperforming
recent PSIS methods and several box-supervised instance segmentation
competitors.
Related papers
- Localization and Expansion: A Decoupled Framework for Point Cloud Few-shot Semantic Segmentation [39.7657197805346]
Point cloud few-shot semantic segmentation (PC-FSS) aims to segment targets of novel categories in a given query point cloud with only a few annotated support samples.
We propose a simple yet effective framework in the spirit of Decoupled Localization and Expansion (DLE)
DLE, including a structural localization module (SLM) and a self-expansion module (SEM), enjoys several merits.
arXiv Detail & Related papers (2024-08-25T07:34:32Z) - BAISeg: Boundary Assisted Weakly Supervised Instance Segmentation [9.6046915661065]
How to extract instance-level masks without instance-level supervision is the main challenge of weakly supervised instance segmentation (WSIS)
Popular WSIS methods estimate a displacement field (DF) via learning inter-pixel relations and perform clustering to identify instances.
We propose Boundary-Assisted Instance (BAISeg), which is a novel paradigm for WSIS that realizes instance segmentation with pixel-level annotations.
arXiv Detail & Related papers (2024-05-27T15:14:09Z) - Semantic Connectivity-Driven Pseudo-labeling for Cross-domain
Segmentation [89.41179071022121]
Self-training is a prevailing approach in cross-domain semantic segmentation.
We propose a novel approach called Semantic Connectivity-driven pseudo-labeling.
This approach formulates pseudo-labels at the connectivity level and thus can facilitate learning structured and low-noise semantics.
arXiv Detail & Related papers (2023-12-11T12:29:51Z) - Semantics Meets Temporal Correspondence: Self-supervised Object-centric Learning in Videos [63.94040814459116]
Self-supervised methods have shown remarkable progress in learning high-level semantics and low-level temporal correspondence.
We propose a novel semantic-aware masked slot attention on top of the fused semantic features and correspondence maps.
We adopt semantic- and instance-level temporal consistency as self-supervision to encourage temporally coherent object-centric representations.
arXiv Detail & Related papers (2023-08-19T09:12:13Z) - Advancing Incremental Few-shot Semantic Segmentation via Semantic-guided
Relation Alignment and Adaptation [98.51938442785179]
Incremental few-shot semantic segmentation aims to incrementally extend a semantic segmentation model to novel classes.
This task faces a severe semantic-aliasing issue between base and novel classes due to data imbalance.
We propose the Semantic-guided Relation Alignment and Adaptation (SRAA) method that fully considers the guidance of prior semantic information.
arXiv Detail & Related papers (2023-05-18T10:40:52Z) - Scaling up Multi-domain Semantic Segmentation with Sentence Embeddings [81.09026586111811]
We propose an approach to semantic segmentation that achieves state-of-the-art supervised performance when applied in a zero-shot setting.
This is achieved by replacing each class label with a vector-valued embedding of a short paragraph that describes the class.
The resulting merged semantic segmentation dataset of over 2 Million images enables training a model that achieves performance equal to that of state-of-the-art supervised methods on 7 benchmark datasets.
arXiv Detail & Related papers (2022-02-04T07:19:09Z) - Semi-supervised Domain Adaptation for Semantic Segmentation [3.946367634483361]
We propose a novel two-step semi-supervised dual-domain adaptation (SSDDA) approach to address both cross- and intra-domain gaps in semantic segmentation.
We demonstrate that the proposed approach outperforms state-of-the-art methods on two common synthetic-to-real semantic segmentation benchmarks.
arXiv Detail & Related papers (2021-10-20T16:13:00Z) - Improving Semi-Supervised and Domain-Adaptive Semantic Segmentation with
Self-Supervised Depth Estimation [94.16816278191477]
We present a framework for semi-adaptive and domain-supervised semantic segmentation.
It is enhanced by self-supervised monocular depth estimation trained only on unlabeled image sequences.
We validate the proposed model on the Cityscapes dataset.
arXiv Detail & Related papers (2021-08-28T01:33:38Z) - Semantic Attention and Scale Complementary Network for Instance
Segmentation in Remote Sensing Images [54.08240004593062]
We propose an end-to-end multi-category instance segmentation model, which consists of a Semantic Attention (SEA) module and a Scale Complementary Mask Branch (SCMB)
SEA module contains a simple fully convolutional semantic segmentation branch with extra supervision to strengthen the activation of interest instances on the feature map.
SCMB extends the original single mask branch to trident mask branches and introduces complementary mask supervision at different scales.
arXiv Detail & Related papers (2021-07-25T08:53:59Z) - SASO: Joint 3D Semantic-Instance Segmentation via Multi-scale Semantic
Association and Salient Point Clustering Optimization [8.519716460338518]
We propose a novel 3D point cloud segmentation framework named SASO, which jointly performs semantic and instance segmentation tasks.
For semantic segmentation task, inspired by the inherent correlation among objects in spatial context, we propose a Multi-scale Semantic Association (MSA) module.
For instance segmentation task, different from previous works that utilize clustering only in inference procedure, we propose a Salient Point Clustering Optimization (SPCO) module.
arXiv Detail & Related papers (2020-06-25T08:55:25Z) - A Benchmark for LiDAR-based Panoptic Segmentation based on KITTI [44.79849028988664]
We present an extension of Semantic KITTI for training and evaluation of laser-based panoptic segmentation.
We provide the data and discuss the processing steps needed to enrich a given semantic annotation with temporally consistent instance information.
We present two strong baselines that combine state-of-the-art LiDAR-based semantic segmentation approaches with a state-of-the-art detector.
arXiv Detail & Related papers (2020-03-04T23:44:40Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.