Attentional Prototype Inference for Few-Shot Segmentation
- URL: http://arxiv.org/abs/2105.06668v3
- Date: Tue, 30 May 2023 01:28:07 GMT
- Title: Attentional Prototype Inference for Few-Shot Segmentation
- Authors: Haoliang Sun, Xiankai Lu, Haochen Wang, Yilong Yin, Xiantong Zhen,
Cees G. M. Snoek, and Ling Shao
- Abstract summary: We propose attentional prototype inference (API), a probabilistic latent variable framework for few-shot segmentation.
We define a global latent variable to represent the prototype of each object category, which we model as a probabilistic distribution.
We conduct extensive experiments on four benchmarks, where our proposal obtains at least competitive and often better performance than state-of-the-art prototype-based methods.
- Score: 128.45753577331422
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This paper aims to address few-shot segmentation. While existing
prototype-based methods have achieved considerable success, they suffer from
uncertainty and ambiguity caused by limited labeled examples. In this work, we
propose attentional prototype inference (API), a probabilistic latent variable
framework for few-shot segmentation. We define a global latent variable to
represent the prototype of each object category, which we model as a
probabilistic distribution. The probabilistic modeling of the prototype
enhances the model's generalization ability by handling the inherent
uncertainty caused by limited data and intra-class variations of objects. To
further enhance the model, we introduce a local latent variable to represent
the attention map of each query image, which enables the model to attend to
foreground objects while suppressing the background. The optimization of the
proposed model is formulated as a variational Bayesian inference problem, which
is established by amortized inference networks. We conduct extensive
experiments on four benchmarks, where our proposal obtains at least competitive
and often better performance than state-of-the-art prototype-based methods. We
also provide comprehensive analyses and ablation studies to gain insight into
the effectiveness of our method for few-shot segmentation.
Related papers
- Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation [56.87049651707208]
Few-shot Semantic has evolved into In-context tasks, morphing into a crucial element in assessing generalist segmentation models.
Our initial focus lies in understanding how to facilitate interaction between the query image and the support image, resulting in the proposal of a KV fusion method within the self-attention framework.
Based on our analysis, we establish a simple and effective framework named DiffewS, maximally retaining the original Latent Diffusion Model's generative framework.
arXiv Detail & Related papers (2024-10-03T10:33:49Z) - Multi-Scale Grouped Prototypes for Interpretable Semantic Segmentation [7.372346036256517]
Prototypical part learning is emerging as a promising approach for making semantic segmentation interpretable.
We propose a method for interpretable semantic segmentation that leverages multi-scale image representation for prototypical part learning.
Experiments conducted on Pascal VOC, Cityscapes, and ADE20K demonstrate that the proposed method increases model sparsity, improves interpretability over existing prototype-based methods, and narrows the performance gap with the non-interpretable counterpart models.
arXiv Detail & Related papers (2024-09-14T17:52:59Z) - Out-of-Distribution Detection via Deep Multi-Comprehension Ensemble [11.542472900306745]
Multi-Comprehension (MC) Ensemble is proposed as a strategy to augment the Out-of-Distribution (OOD) feature representation field.
Our experimental results demonstrate the superior performance of the MC Ensemble strategy in OOD detection.
This underscores the effectiveness of our proposed approach in enhancing the model's capability to detect instances outside its training distribution.
arXiv Detail & Related papers (2024-03-24T18:43:04Z) - An Additive Instance-Wise Approach to Multi-class Model Interpretation [53.87578024052922]
Interpretable machine learning offers insights into what factors drive a certain prediction of a black-box system.
Existing methods mainly focus on selecting explanatory input features, which follow either locally additive or instance-wise approaches.
This work exploits the strengths of both methods and proposes a global framework for learning local explanations simultaneously for multiple target classes.
arXiv Detail & Related papers (2022-07-07T06:50:27Z) - A General Divergence Modeling Strategy for Salient Object Detection [32.53501439077824]
Salient object detection is subjective in nature, which implies that multiple estimations should be related to the same input image.
Most existing salient object detection models are deterministic following a point to point estimation, making them incapable to estimate the predictive distribution.
Although latent variable model based prediction network exists to model the prediction variants, the latent space based on the single clean saliency is less reliable in exploring the subjective nature of saliency.
arXiv Detail & Related papers (2021-11-23T12:47:51Z) - A Prototype-Oriented Framework for Unsupervised Domain Adaptation [52.25537670028037]
We provide a memory and computation-efficient probabilistic framework to extract class prototypes and align the target features with them.
We demonstrate the general applicability of our method on a wide range of scenarios, including single-source, multi-source, class-imbalance, and source-private domain adaptation.
arXiv Detail & Related papers (2021-10-22T19:23:22Z) - SCNet: Enhancing Few-Shot Semantic Segmentation by Self-Contrastive
Background Prototypes [56.387647750094466]
Few-shot semantic segmentation aims to segment novel-class objects in a query image with only a few annotated examples.
Most of advanced solutions exploit a metric learning framework that performs segmentation through matching each pixel to a learned foreground prototype.
This framework suffers from biased classification due to incomplete construction of sample pairs with the foreground prototype only.
arXiv Detail & Related papers (2021-04-19T11:21:47Z) - Part-aware Prototype Network for Few-shot Semantic Segmentation [50.581647306020095]
We propose a novel few-shot semantic segmentation framework based on the prototype representation.
Our key idea is to decompose the holistic class representation into a set of part-aware prototypes.
We develop a novel graph neural network model to generate and enhance the proposed part-aware prototypes.
arXiv Detail & Related papers (2020-07-13T11:03:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.