Advancing Incremental Few-shot Semantic Segmentation via Semantic-guided
Relation Alignment and Adaptation
- URL: http://arxiv.org/abs/2305.10868v1
- Date: Thu, 18 May 2023 10:40:52 GMT
- Title: Advancing Incremental Few-shot Semantic Segmentation via Semantic-guided
Relation Alignment and Adaptation
- Authors: Yuan Zhou, Xin Chen, Yanrong Guo, Shijie Hao, Richang Hong, Qi Tian
- Abstract summary: Incremental few-shot semantic segmentation aims to incrementally extend a semantic segmentation model to novel classes.
This task faces a severe semantic-aliasing issue between base and novel classes due to data imbalance.
We propose the Semantic-guided Relation Alignment and Adaptation (SRAA) method that fully considers the guidance of prior semantic information.
- Score: 98.51938442785179
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Incremental few-shot semantic segmentation (IFSS) aims to incrementally
extend a semantic segmentation model to novel classes according to only a few
pixel-level annotated data, while preserving its segmentation capability on
previously learned base categories. This task faces a severe semantic-aliasing
issue between base and novel classes due to data imbalance, which makes
segmentation results unsatisfactory. To alleviate this issue, we propose the
Semantic-guided Relation Alignment and Adaptation (SRAA) method that fully
considers the guidance of prior semantic information. Specifically, we first
conduct Semantic Relation Alignment (SRA) in the base step, so as to
semantically align base class representations to their semantics. As a result,
the embeddings of base classes are constrained to have relatively low semantic
correlations to categories that are different from them. Afterwards, based on
the semantically aligned base categories, Semantic-Guided Adaptation (SGA) is
employed during the incremental learning stage. It aims to ensure affinities
between visual and semantic embeddings of encountered novel categories, thereby
making the feature representations be consistent with their semantic
information. In this way, the semantic-aliasing issue can be suppressed. We
evaluate our model on the PASCAL VOC 2012 and the COCO dataset. The
experimental results on both these two datasets exhibit its competitive
performance, which demonstrates the superiority of our method.
Related papers
- Learning to Detour: Shortcut Mitigating Augmentation for Weakly Supervised Semantic Segmentation [7.5856806269316825]
Weakly supervised semantic segmentation (WSSS) employing weak forms of labels has been actively studied to alleviate the annotation cost of acquiring pixel-level labels.
We propose shortcut mitigating augmentation (SMA) for WSSS, which generates synthetic representations of object-background combinations not seen in the training data to reduce the use of shortcut features.
arXiv Detail & Related papers (2024-05-28T13:07:35Z) - Semantic Connectivity-Driven Pseudo-labeling for Cross-domain
Segmentation [89.41179071022121]
Self-training is a prevailing approach in cross-domain semantic segmentation.
We propose a novel approach called Semantic Connectivity-driven pseudo-labeling.
This approach formulates pseudo-labels at the connectivity level and thus can facilitate learning structured and low-noise semantics.
arXiv Detail & Related papers (2023-12-11T12:29:51Z) - Open-Vocabulary Segmentation with Semantic-Assisted Calibration [73.39366775301382]
We study open-vocabulary segmentation (OVS) through calibrating in-vocabulary and domain-biased embedding space with contextual prior of CLIP.
We present a Semantic-assisted CAlibration Network (SCAN) to achieve state-of-the-art performance on open-vocabulary segmentation benchmarks.
arXiv Detail & Related papers (2023-12-07T07:00:09Z) - Primitive Generation and Semantic-related Alignment for Universal
Zero-Shot Segmentation [13.001629605405954]
We study universal zero-shot segmentation in this work to achieve panoptic, instance, and semantic segmentation for novel categories without any training samples.
We introduce a generative model to synthesize features for unseen categories, which links semantic and visual spaces.
The proposed approach achieves impressively state-of-the-art performance on zero-shot panoptic segmentation, instance segmentation, and semantic segmentation.
arXiv Detail & Related papers (2023-06-19T17:59:16Z) - Class Enhancement Losses with Pseudo Labels for Zero-shot Semantic
Segmentation [40.09476732999614]
Mask proposal models have significantly improved the performance of zero-shot semantic segmentation.
The use of a background' embedding during training in these methods is problematic as the resulting model tends to over-learn and assign all unseen classes as the background class instead of their correct labels.
This paper proposes novel class enhancement losses to bypass the use of the background embbedding during training, and simultaneously exploit the semantic relationship between text embeddings and mask proposals by ranking the similarity scores.
arXiv Detail & Related papers (2023-01-18T06:55:02Z) - Graph Adaptive Semantic Transfer for Cross-domain Sentiment
Classification [68.06496970320595]
Cross-domain sentiment classification (CDSC) aims to use the transferable semantics learned from the source domain to predict the sentiment of reviews in the unlabeled target domain.
We present Graph Adaptive Semantic Transfer (GAST) model, an adaptive syntactic graph embedding method that is able to learn domain-invariant semantics from both word sequences and syntactic graphs.
arXiv Detail & Related papers (2022-05-18T07:47:01Z) - Scaling up Multi-domain Semantic Segmentation with Sentence Embeddings [81.09026586111811]
We propose an approach to semantic segmentation that achieves state-of-the-art supervised performance when applied in a zero-shot setting.
This is achieved by replacing each class label with a vector-valued embedding of a short paragraph that describes the class.
The resulting merged semantic segmentation dataset of over 2 Million images enables training a model that achieves performance equal to that of state-of-the-art supervised methods on 7 benchmark datasets.
arXiv Detail & Related papers (2022-02-04T07:19:09Z) - Weak-shot Semantic Segmentation by Transferring Semantic Affinity and
Boundary [23.331708585468814]
We show that existing fully-annotated base categories can help segment objects of novel categories with only image-level labels.
We propose a method under the WSSS framework to transfer semantic affinity and boundary from base categories to novel ones.
arXiv Detail & Related papers (2021-10-04T15:37:25Z) - Anti-aliasing Semantic Reconstruction for Few-Shot Semantic Segmentation [66.85202434812942]
We reformulate few-shot segmentation as a semantic reconstruction problem.
We convert base class features into a series of basis vectors which span a class-level semantic space for novel class reconstruction.
Our proposed approach, referred to as anti-aliasing semantic reconstruction (ASR), provides a systematic yet interpretable solution for few-shot learning problems.
arXiv Detail & Related papers (2021-06-01T02:17:36Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.