PFENet++: Boosting Few-shot Semantic Segmentation with the
Noise-filtered Context-aware Prior Mask
- URL: http://arxiv.org/abs/2109.13788v2
- Date: Tue, 21 Nov 2023 09:08:50 GMT
- Title: PFENet++: Boosting Few-shot Semantic Segmentation with the
Noise-filtered Context-aware Prior Mask
- Authors: Xiaoliu Luo, Zhuotao Tian, Taiping Zhang, Bei Yu, Yuan Yan Tang, Jiaya
Jia
- Abstract summary: We revisit the prior mask guidance proposed in Guided Feature Enrichment Network for Few-Shot''
We propose the Context-aware Prior Mask (CAPM) that leverages additional nearby semantic cues for better locating the objects in query images.
We take one step further by incorporating a lightweight Noise Suppression Module (NSM) to screen out the unnecessary responses.
- Score: 62.37727055343632
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this work, we revisit the prior mask guidance proposed in ``Prior Guided
Feature Enrichment Network for Few-Shot Segmentation''. The prior mask serves
as an indicator that highlights the region of interests of unseen categories,
and it is effective in achieving better performance on different frameworks of
recent studies. However, the current method directly takes the maximum
element-to-element correspondence between the query and support features to
indicate the probability of belonging to the target class, thus the broader
contextual information is seldom exploited during the prior mask generation. To
address this issue, first, we propose the Context-aware Prior Mask (CAPM) that
leverages additional nearby semantic cues for better locating the objects in
query images. Second, since the maximum correlation value is vulnerable to
noisy features, we take one step further by incorporating a lightweight Noise
Suppression Module (NSM) to screen out the unnecessary responses, yielding
high-quality masks for providing the prior knowledge. Both two contributions
are experimentally shown to have substantial practical merit, and the new model
named PFENet++ significantly outperforms the baseline PFENet as well as all
other competitors on three challenging benchmarks PASCAL-5$^i$, COCO-20$^i$ and
FSS-1000. The new state-of-the-art performance is achieved without compromising
the efficiency, manifesting the potential for being a new strong baseline in
few-shot semantic segmentation. Our code will be available at
https://github.com/luoxiaoliu/PFENet2Plus.
Related papers
- Bridge the Points: Graph-based Few-shot Segment Anything Semantically [79.1519244940518]
Recent advancements in pre-training techniques have enhanced the capabilities of vision foundation models.
Recent studies extend the SAM to Few-shot Semantic segmentation (FSS)
We propose a simple yet effective approach based on graph analysis.
arXiv Detail & Related papers (2024-10-09T15:02:28Z) - Semantic Refocused Tuning for Open-Vocabulary Panoptic Segmentation [42.020470627552136]
Open-vocabulary panoptic segmentation is an emerging task aiming to accurately segment the image into semantically meaningful masks.
mask classification is the main performance bottleneck for open-vocab panoptic segmentation.
We propose Semantic Refocused Tuning, a novel framework that greatly enhances open-vocab panoptic segmentation.
arXiv Detail & Related papers (2024-09-24T17:50:28Z) - ColorMAE: Exploring data-independent masking strategies in Masked AutoEncoders [53.3185750528969]
Masked AutoEncoders (MAE) have emerged as a robust self-supervised framework.
We introduce a data-independent method, termed ColorMAE, which generates different binary mask patterns by filtering random noise.
We demonstrate our strategy's superiority in downstream tasks compared to random masking.
arXiv Detail & Related papers (2024-07-17T22:04:00Z) - Variance-insensitive and Target-preserving Mask Refinement for
Interactive Image Segmentation [68.16510297109872]
Point-based interactive image segmentation can ease the burden of mask annotation in applications such as semantic segmentation and image editing.
We introduce a novel method, Variance-Insensitive and Target-Preserving Mask Refinement to enhance segmentation quality with fewer user inputs.
Experiments on GrabCut, Berkeley, SBD, and DAVIS datasets demonstrate our method's state-of-the-art performance in interactive image segmentation.
arXiv Detail & Related papers (2023-12-22T02:31:31Z) - Temporal-aware Hierarchical Mask Classification for Video Semantic
Segmentation [62.275143240798236]
Video semantic segmentation dataset has limited categories per video.
Less than 10% of queries could be matched to receive meaningful gradient updates during VSS training.
Our method achieves state-of-the-art performance on the latest challenging VSS benchmark VSPW without bells and whistles.
arXiv Detail & Related papers (2023-09-14T20:31:06Z) - MaskRange: A Mask-classification Model for Range-view based LiDAR
Segmentation [34.04740351544143]
We propose a unified mask-classification model, MaskRange, for the range-view based LiDAR semantic and panoptic segmentation.
Our MaskRange achieves state-of-the-art performance with $66.10$ mIoU on semantic segmentation and promising results with $53.10$ PQ on panoptic segmentation with high efficiency.
arXiv Detail & Related papers (2022-06-24T04:39:49Z) - HMFS: Hybrid Masking for Few-Shot Segmentation [27.49000348046462]
We develop a simple, effective, and efficient approach to enhance feature masking (FM)
We compensate for the loss of fine-grained spatial details in FM technique by investigating and leveraging a complementary basic input masking method.
Experimental results on three publicly available benchmarks reveal that HMFS outperforms the current state-of-the-art methods by visible margins.
arXiv Detail & Related papers (2022-03-24T03:07:20Z) - SE-PSNet: Silhouette-based Enhancement Feature for Panoptic Segmentation
Network [5.353718408751182]
We propose a solution to tackle the panoptic segmentation task.
The structure combines the bottom-up method and the top-down method.
The network mainly pays attention to the quality of the mask.
arXiv Detail & Related papers (2021-07-11T17:20:32Z) - Prior Guided Feature Enrichment Network for Few-Shot Segmentation [64.91560451900125]
State-of-the-art semantic segmentation methods require sufficient labeled data to achieve good results.
Few-shot segmentation is proposed to tackle this problem by learning a model that quickly adapts to new classes with a few labeled support samples.
Theses frameworks still face the challenge of generalization ability reduction on unseen classes due to inappropriate use of high-level semantic information.
arXiv Detail & Related papers (2020-08-04T10:41:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.