Relevant Intrinsic Feature Enhancement Network for Few-Shot Semantic
Segmentation
- URL: http://arxiv.org/abs/2312.06474v1
- Date: Mon, 11 Dec 2023 16:02:57 GMT
- Title: Relevant Intrinsic Feature Enhancement Network for Few-Shot Semantic
Segmentation
- Authors: Xiaoyi Bao, Jie Qin, Siyang Sun, Yun Zheng, Xingang Wang
- Abstract summary: We propose the Relevant Intrinsic Feature Enhancement Network (RiFeNet) to improve semantic consistency of foreground instances.
RiFeNet surpasses the state-of-the-art methods on PASCAL-5i and COCO benchmarks.
- Score: 34.257289290796315
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: For few-shot semantic segmentation, the primary task is to extract
class-specific intrinsic information from limited labeled data. However, the
semantic ambiguity and inter-class similarity of previous methods limit the
accuracy of pixel-level foreground-background classification. To alleviate
these issues, we propose the Relevant Intrinsic Feature Enhancement Network
(RiFeNet). To improve the semantic consistency of foreground instances, we
propose an unlabeled branch as an efficient data utilization method, which
teaches the model how to extract intrinsic features robust to intra-class
differences. Notably, during testing, the proposed unlabeled branch is excluded
without extra unlabeled data and computation. Furthermore, we extend the
inter-class variability between foreground and background by proposing a novel
multi-level prototype generation and interaction module. The different-grained
complementarity between global and local prototypes allows for better
distinction between similar categories. The qualitative and quantitative
performance of RiFeNet surpasses the state-of-the-art methods on PASCAL-5i and
COCO benchmarks.
Related papers
- Collaborative Feature-Logits Contrastive Learning for Open-Set Semi-Supervised Object Detection [75.02249869573994]
In open-set scenarios, the unlabeled dataset contains both in-distribution (ID) classes and out-of-distribution (OOD) classes.
Applying semi-supervised detectors in such settings can lead to misclassifying OOD class as ID classes.
We propose a simple yet effective method, termed Collaborative Feature-Logits Detector (CFL-Detector)
arXiv Detail & Related papers (2024-11-20T02:57:35Z) - SSA-Seg: Semantic and Spatial Adaptive Pixel-level Classifier for Semantic Segmentation [11.176993272867396]
In this paper, we propose a novel Semantic and Spatial Adaptive (SSA-Seg) to address the challenges of semantic segmentation.
Specifically, we employ the coarse masks obtained from the fixed prototypes as a guide to adjust the fixed prototype towards the center of the semantic and spatial domains in the test image.
Results show that the proposed SSA-Seg significantly improves the segmentation performance of the baseline models with only a minimal increase in computational cost.
arXiv Detail & Related papers (2024-05-10T15:14:23Z) - Learning Context-aware Classifier for Semantic Segmentation [88.88198210948426]
In this paper, contextual hints are exploited via learning a context-aware classifier.
Our method is model-agnostic and can be easily applied to generic segmentation models.
With only negligible additional parameters and +2% inference time, decent performance gain has been achieved on both small and large models.
arXiv Detail & Related papers (2023-03-21T07:00:35Z) - An Upper Bound for the Distribution Overlap Index and Its Applications [18.481370450591317]
This paper proposes an easy-to-compute upper bound for the overlap index between two probability distributions.
The proposed bound shows its value in one-class classification and domain shift analysis.
Our work shows significant promise toward broadening the applications of overlap-based metrics.
arXiv Detail & Related papers (2022-12-16T20:02:03Z) - Interclass Prototype Relation for Few-Shot Segmentation [0.0]
With few-shot segmentation, the target class data distribution in the feature space is sparse and has low coverage because of the slight variations in the sample data.
This study proposes the Interclass Prototype Relation Network (IPRNet) which improves the separation performance by reducing the similarity between other classes.
arXiv Detail & Related papers (2022-11-16T05:27:52Z) - A Task-aware Dual Similarity Network for Fine-grained Few-shot Learning [19.90385022248391]
Task-aware Dual Similarity Network( TDSNet) proposed to explore global invariant features and discriminative local details.
TDSNet achieves competitive performance by comparing with other state-of-the-art algorithms.
arXiv Detail & Related papers (2022-10-22T04:24:55Z) - Multi-dataset Pretraining: A Unified Model for Semantic Segmentation [97.61605021985062]
We propose a unified framework, termed as Multi-Dataset Pretraining, to take full advantage of the fragmented annotations of different datasets.
This is achieved by first pretraining the network via the proposed pixel-to-prototype contrastive loss over multiple datasets.
In order to better model the relationship among images and classes from different datasets, we extend the pixel level embeddings via cross dataset mixing.
arXiv Detail & Related papers (2021-06-08T06:13:11Z) - Semi-Supervised Semantic Segmentation with Pixel-Level Contrastive
Learning from a Class-wise Memory Bank [5.967279020820772]
We propose a novel representation learning module based on contrastive learning.
This module enforces the segmentation network to yield similar pixel-level feature representations for same-class samples.
In an end-to-end training, the features from both labeled and unlabeled data are optimized to be similar to same-class samples from the memory bank.
arXiv Detail & Related papers (2021-04-27T18:19:33Z) - Minimax Active Learning [61.729667575374606]
Active learning aims to develop label-efficient algorithms by querying the most representative samples to be labeled by a human annotator.
Current active learning techniques either rely on model uncertainty to select the most uncertain samples or use clustering or reconstruction to choose the most diverse set of unlabeled examples.
We develop a semi-supervised minimax entropy-based active learning algorithm that leverages both uncertainty and diversity in an adversarial manner.
arXiv Detail & Related papers (2020-12-18T19:03:40Z) - ReMarNet: Conjoint Relation and Margin Learning for Small-Sample Image
Classification [49.87503122462432]
We introduce a novel neural network termed Relation-and-Margin learning Network (ReMarNet)
Our method assembles two networks of different backbones so as to learn the features that can perform excellently in both of the aforementioned two classification mechanisms.
Experiments on four image datasets demonstrate that our approach is effective in learning discriminative features from a small set of labeled samples.
arXiv Detail & Related papers (2020-06-27T13:50:20Z) - Few-shot 3D Point Cloud Semantic Segmentation [138.80825169240302]
We propose a novel attention-aware multi-prototype transductive few-shot point cloud semantic segmentation method.
Our proposed method shows significant and consistent improvements compared to baselines in different few-shot point cloud semantic segmentation settings.
arXiv Detail & Related papers (2020-06-22T08:05:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.