Few-shot Image Classification with Multi-Facet Prototypes
- URL: http://arxiv.org/abs/2102.00801v1
- Date: Mon, 1 Feb 2021 12:43:03 GMT
- Title: Few-shot Image Classification with Multi-Facet Prototypes
- Authors: Kun Yan, Zied Bouraoui, Ping Wang, Shoaib Jameel, Steven Schockaert
- Abstract summary: We organize visual features into facets, which intuitively group features of the same kind.
It is possible to predict facet importance from a pre-trained embedding of the category names.
In particular, we propose an adaptive similarity measure, relying on predicted facet importance weights for a given set of categories.
- Score: 48.583388368897126
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The aim of few-shot learning (FSL) is to learn how to recognize image
categories from a small number of training examples. A central challenge is
that the available training examples are normally insufficient to determine
which visual features are most characteristic of the considered categories. To
address this challenge, we organize these visual features into facets, which
intuitively group features of the same kind (e.g. features that are relevant to
shape, color, or texture). This is motivated from the assumption that (i) the
importance of each facet differs from category to category and (ii) it is
possible to predict facet importance from a pre-trained embedding of the
category names. In particular, we propose an adaptive similarity measure,
relying on predicted facet importance weights for a given set of categories.
This measure can be used in combination with a wide array of existing
metric-based methods. Experiments on miniImageNet and CUB show that our
approach improves the state-of-the-art in metric-based FSL.
Related papers
- Learning to Adapt Category Consistent Meta-Feature of CLIP for Few-Shot Classification [1.6385815610837167]
Recent CLIP-based methods have shown promising zero-shot and few-shot performance on image classification tasks.
We propose the Meta-Feature Adaption method (MF-Adapter) that combines the complementary strengths of both LRs and high-level semantic representations.
Our proposed method is superior to the state-of-the-art CLIP downstream few-shot classification methods, even showing stronger performance on a set of challenging visual classification tasks.
arXiv Detail & Related papers (2024-07-08T06:18:04Z) - Self-similarity Driven Scale-invariant Learning for Weakly Supervised
Person Search [66.95134080902717]
We propose a novel one-step framework, named Self-similarity driven Scale-invariant Learning (SSL)
We introduce a Multi-scale Exemplar Branch to guide the network in concentrating on the foreground and learning scale-invariant features.
Experiments on PRW and CUHK-SYSU databases demonstrate the effectiveness of our method.
arXiv Detail & Related papers (2023-02-25T04:48:11Z) - Semantic Cross Attention for Few-shot Learning [9.529264466445236]
We propose a multi-task learning approach to view semantic features of label text as an auxiliary task.
Our proposed model uses word-embedding representations as semantic features to help train the embedding network and a semantic cross-attention module to bridge the semantic features into the typical visual modal.
arXiv Detail & Related papers (2022-10-12T15:24:59Z) - Not All Instances Contribute Equally: Instance-adaptive Class
Representation Learning for Few-Shot Visual Recognition [94.04041301504567]
Few-shot visual recognition refers to recognize novel visual concepts from a few labeled instances.
We propose a novel metric-based meta-learning framework termed instance-adaptive class representation learning network (ICRL-Net) for few-shot visual recognition.
arXiv Detail & Related papers (2022-09-07T10:00:18Z) - Learning Primitive-aware Discriminative Representations for Few-shot
Learning [28.17404445820028]
Few-shot learning aims to learn a classifier that can be easily adapted to recognize novel classes with only a few labeled examples.
We propose a Primitive Mining and Reasoning Network (PMRN) to learn primitive-aware representations.
Our method achieves state-of-the-art results on six standard benchmarks.
arXiv Detail & Related papers (2022-08-20T16:22:22Z) - Semantic Representation and Dependency Learning for Multi-Label Image
Recognition [76.52120002993728]
We propose a novel and effective semantic representation and dependency learning (SRDL) framework to learn category-specific semantic representation for each category.
Specifically, we design a category-specific attentional regions (CAR) module to generate channel/spatial-wise attention matrices to guide model.
We also design an object erasing (OE) module to implicitly learn semantic dependency among categories by erasing semantic-aware regions.
arXiv Detail & Related papers (2022-04-08T00:55:15Z) - Discovering Relationships between Object Categories via Universal
Canonical Maps [80.07703460198198]
We tackle the problem of learning the geometry of multiple categories of deformable objects jointly.
Recent work has shown that it is possible to learn a unified dense pose predictor for several categories of related objects.
We show that improved correspondences can be learned automatically as a natural byproduct of learning category-specific dense pose predictors.
arXiv Detail & Related papers (2021-06-17T18:38:18Z) - Progressive Co-Attention Network for Fine-grained Visual Classification [20.838908090777885]
Fine-grained visual classification aims to recognize images belonging to multiple sub-categories within a same category.
Most existing methods only take individual image as input.
We propose an effective method called progressive co-attention network (PCA-Net) to tackle this problem.
arXiv Detail & Related papers (2021-01-21T10:19:02Z) - Match Them Up: Visually Explainable Few-shot Image Classification [27.867833878756553]
Few-shot learning is usually based on an assumption that the pre-trained knowledge can be obtained from base (seen) categories and can be well transferred to novel (unseen) categories.
In this paper, we reveal a new way to perform FSL for image classification, using visual representations from the backbone model and weights generated by a newly-emerged explainable classifier.
Experimental results prove that the proposed method can achieve both good accuracy and satisfactory explainability on three mainstream datasets.
arXiv Detail & Related papers (2020-11-25T05:47:35Z) - Background Splitting: Finding Rare Classes in a Sea of Background [55.03789745276442]
We focus on the real-world problem of training accurate deep models for image classification of a small number of rare categories.
In these scenarios, almost all images belong to the background category in the dataset (>95% of the dataset is background)
We demonstrate that both standard fine-tuning approaches and state-of-the-art approaches for training on imbalanced datasets do not produce accurate deep models in the presence of this extreme imbalance.
arXiv Detail & Related papers (2020-08-28T23:05:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.