Bi-directional Feature Reconstruction Network for Fine-Grained Few-Shot
Image Classification
- URL: http://arxiv.org/abs/2211.17161v1
- Date: Wed, 30 Nov 2022 16:55:14 GMT
- Title: Bi-directional Feature Reconstruction Network for Fine-Grained Few-Shot
Image Classification
- Authors: Jijie Wu, Dongliang Chang, Aneeshan Sain, Xiaoxu Li, Zhanyu Ma, Jie
Cao, Jun Guo, Yi-Zhe Song
- Abstract summary: We introduce a bi-reconstruction mechanism that can simultaneously accommodate for inter-class and intra-class variations.
This design effectively helps the model to explore more subtle and discriminative features.
Experimental results on three widely used fine-grained image classification datasets consistently show considerable improvements.
- Score: 61.411869453639845
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The main challenge for fine-grained few-shot image classification is to learn
feature representations with higher inter-class and lower intra-class
variations, with a mere few labelled samples. Conventional few-shot learning
methods however cannot be naively adopted for this fine-grained setting -- a
quick pilot study reveals that they in fact push for the opposite (i.e., lower
inter-class variations and higher intra-class variations). To alleviate this
problem, prior works predominately use a support set to reconstruct the query
image and then utilize metric learning to determine its category. Upon careful
inspection, we further reveal that such unidirectional reconstruction methods
only help to increase inter-class variations and are not effective in tackling
intra-class variations. In this paper, we for the first time introduce a
bi-reconstruction mechanism that can simultaneously accommodate for inter-class
and intra-class variations. In addition to using the support set to reconstruct
the query set for increasing inter-class variations, we further use the query
set to reconstruct the support set for reducing intra-class variations. This
design effectively helps the model to explore more subtle and discriminative
features which is key for the fine-grained problem in hand. Furthermore, we
also construct a self-reconstruction module to work alongside the
bi-directional module to make the features even more discriminative.
Experimental results on three widely used fine-grained image classification
datasets consistently show considerable improvements compared with other
methods. Codes are available at: https://github.com/PRIS-CV/Bi-FRN.
Related papers
- PatchMix Augmentation to Identify Causal Features in Few-shot Learning [55.64873998196191]
Few-shot learning aims to transfer knowledge learned from base with sufficient categories labelled data to novel categories with scarce known information.
We propose a novel data augmentation strategy dubbed as PatchMix that can break this spurious dependency.
We show that such an augmentation mechanism, different from existing ones, is able to identify the causal features.
arXiv Detail & Related papers (2022-11-29T08:41:29Z) - CAD: Co-Adapting Discriminative Features for Improved Few-Shot
Classification [11.894289991529496]
Few-shot classification is a challenging problem that aims to learn a model that can adapt to unseen classes given a few labeled samples.
Recent approaches pre-train a feature extractor, and then fine-tune for episodic meta-learning.
We propose a strategy to cross-attend and re-weight discriminative features for few-shot classification.
arXiv Detail & Related papers (2022-03-25T06:14:51Z) - Learning What Not to Segment: A New Perspective on Few-Shot Segmentation [63.910211095033596]
Recently few-shot segmentation (FSS) has been extensively developed.
This paper proposes a fresh and straightforward insight to alleviate the problem.
In light of the unique nature of the proposed approach, we also extend it to a more realistic but challenging setting.
arXiv Detail & Related papers (2022-03-15T03:08:27Z) - AF$_2$: Adaptive Focus Framework for Aerial Imagery Segmentation [86.44683367028914]
Aerial imagery segmentation has some unique challenges, the most critical one among which lies in foreground-background imbalance.
We propose Adaptive Focus Framework (AF$), which adopts a hierarchical segmentation procedure and focuses on adaptively utilizing multi-scale representations.
AF$ has significantly improved the accuracy on three widely used aerial benchmarks, as fast as the mainstream method.
arXiv Detail & Related papers (2022-02-18T10:14:45Z) - Dual Path Structural Contrastive Embeddings for Learning Novel Objects [6.979491536753043]
Recent research shows that gaining information on a good feature space can be an effective solution to achieve favorable performance on few-shot tasks.
We propose a simple but effective paradigm that decouples the tasks of learning feature representations and classifiers.
Our method can still achieve promising results for both standard and generalized few-shot problems in either an inductive or transductive inference setting.
arXiv Detail & Related papers (2021-12-23T04:43:31Z) - Few-shot learning with improved local representations via bias rectify
module [13.230636224045137]
We propose a Deep Bias Rectify Network (DBRN) to fully exploit the spatial information that exists in the structure of the feature representations.
bias rectify module is able to focus on the features that are more discriminative for classification by given different weights.
To make full use of the training data, we design a prototype augment mechanism that can make the prototypes generated from the support set to be more representative.
arXiv Detail & Related papers (2021-11-01T08:08:00Z) - Ortho-Shot: Low Displacement Rank Regularization with Data Augmentation
for Few-Shot Learning [23.465747123791772]
In few-shot classification, the primary goal is to learn representations that generalize well for novel classes.
We propose an efficient low displacement rank (LDR) regularization strategy termed Ortho-Shot.
arXiv Detail & Related papers (2021-10-18T14:58:36Z) - A Little Robustness Goes a Long Way: Leveraging Universal Features for
Targeted Transfer Attacks [4.511923587827301]
We show that training the source classifier to be "slightly robust" substantially improves the transferability of targeted attacks.
We argue that this result supports a non-versa hypothesis.
arXiv Detail & Related papers (2021-06-03T19:53:46Z) - Revisiting Deep Local Descriptor for Improved Few-Shot Classification [56.74552164206737]
We show how one can improve the quality of embeddings by leveraging textbfDense textbfClassification and textbfAttentive textbfPooling.
We suggest to pool feature maps by applying attentive pooling instead of the widely used global average pooling (GAP) to prepare embeddings for few-shot classification.
arXiv Detail & Related papers (2021-03-30T00:48:28Z) - Attention Model Enhanced Network for Classification of Breast Cancer
Image [54.83246945407568]
AMEN is formulated in a multi-branch fashion with pixel-wised attention model and classification submodular.
To focus more on subtle detail information, the sample image is enhanced by the pixel-wised attention map generated from former branch.
Experiments conducted on three benchmark datasets demonstrate the superiority of the proposed method under various scenarios.
arXiv Detail & Related papers (2020-10-07T08:44:21Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.