Disentangled Feature Representation for Few-shot Image Classification
- URL: http://arxiv.org/abs/2109.12548v1
- Date: Sun, 26 Sep 2021 09:53:11 GMT
- Title: Disentangled Feature Representation for Few-shot Image Classification
- Authors: Hao Cheng, Yufei Wang, Haoliang Li, Alex C. Kot, Bihan Wen
- Abstract summary: We propose a novel Disentangled Feature Representation framework, dubbed DFR, for few-shot learning applications.
DFR can adaptively decouple the discriminative features that are modeled by the classification branch, from the class-irrelevant component of the variation branch.
In general, most of the popular deep few-shot learning methods can be plugged in as the classification branch, thus DFR can boost their performance on various few-shot tasks.
- Score: 64.40410801469106
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Learning the generalizable feature representation is critical for few-shot
image classification. While recent works exploited task-specific feature
embedding using meta-tasks for few-shot learning, they are limited in many
challenging tasks as being distracted by the excursive features such as the
background, domain and style of the image samples. In this work, we propose a
novel Disentangled Feature Representation framework, dubbed DFR, for few-shot
learning applications. DFR can adaptively decouple the discriminative features
that are modeled by the classification branch, from the class-irrelevant
component of the variation branch. In general, most of the popular deep
few-shot learning methods can be plugged in as the classification branch, thus
DFR can boost their performance on various few-shot tasks. Furthermore, we
propose a novel FS-DomainNet dataset based on DomainNet, for benchmarking the
few-shot domain generalization tasks. We conducted extensive experiments to
evaluate the proposed DFR on general and fine-grained few-shot classification,
as well as few-shot domain generalization, using the corresponding four
benchmarks, i.e., mini-ImageNet, tiered-ImageNet, CUB, as well as the proposed
FS-DomainNet. Thanks to the effective feature disentangling, the DFR-based
few-shot classifiers achieved the state-of-the-art results on all datasets.
Related papers
- Fine-grained Recognition with Learnable Semantic Data Augmentation [68.48892326854494]
Fine-grained image recognition is a longstanding computer vision challenge.
We propose diversifying the training data at the feature-level to alleviate the discriminative region loss problem.
Our method significantly improves the generalization performance on several popular classification networks.
arXiv Detail & Related papers (2023-09-01T11:15:50Z) - AF$_2$: Adaptive Focus Framework for Aerial Imagery Segmentation [86.44683367028914]
Aerial imagery segmentation has some unique challenges, the most critical one among which lies in foreground-background imbalance.
We propose Adaptive Focus Framework (AF$), which adopts a hierarchical segmentation procedure and focuses on adaptively utilizing multi-scale representations.
AF$ has significantly improved the accuracy on three widely used aerial benchmarks, as fast as the mainstream method.
arXiv Detail & Related papers (2022-02-18T10:14:45Z) - Few-shot Unsupervised Domain Adaptation with Image-to-class Sparse
Similarity Encoding [24.64900089320843]
This paper investigates a valuable setting called few-shot unsupervised domain adaptation (FS-UDA)
In this setting, the source domain data are labelled, but with few-shot per category, while the target domain data are unlabelled.
We develop a general UDA model to solve the few-shot labeled data per category and the domain adaptation between support and query sets.
arXiv Detail & Related papers (2021-08-06T06:15:02Z) - External-Memory Networks for Low-Shot Learning of Targets in
Forward-Looking-Sonar Imagery [8.767175335575386]
We propose a memory-based framework for real-time, data-efficient target analysis in forward-looking-sonar (FLS) imagery.
Our framework relies on first removing non-discriminative details from the imagery using a small-scale DenseNet-inspired network.
We then cascade the filtered imagery into a novel NeuralRAM-based convolutional matching network, NRMN, for low-shot target recognition.
arXiv Detail & Related papers (2021-07-22T07:50:44Z) - Region Comparison Network for Interpretable Few-shot Image
Classification [97.97902360117368]
Few-shot image classification has been proposed to effectively use only a limited number of labeled examples to train models for new classes.
We propose a metric learning based method named Region Comparison Network (RCN), which is able to reveal how few-shot learning works.
We also present a new way to generalize the interpretability from the level of tasks to categories.
arXiv Detail & Related papers (2020-09-08T07:29:05Z) - Group Based Deep Shared Feature Learning for Fine-grained Image
Classification [31.84610555517329]
We present a new deep network architecture that explicitly models shared features and removes their effect to achieve enhanced classification results.
We call this framework Group based deep Shared Feature Learning (GSFL) and the resulting learned network as GSFL-Net.
A key benefit of our specialized autoencoder is that it is versatile and can be combined with state-of-the-art fine-grained feature extraction models and trained together with them to improve their performance directly.
arXiv Detail & Related papers (2020-04-04T00:01:11Z) - CRNet: Cross-Reference Networks for Few-Shot Segmentation [59.85183776573642]
Few-shot segmentation aims to learn a segmentation model that can be generalized to novel classes with only a few training images.
With a cross-reference mechanism, our network can better find the co-occurrent objects in the two images.
Experiments on the PASCAL VOC 2012 dataset show that our network achieves state-of-the-art performance.
arXiv Detail & Related papers (2020-03-24T04:55:43Z) - Weakly-supervised Object Localization for Few-shot Learning and
Fine-grained Few-shot Learning [0.5156484100374058]
Few-shot learning aims to learn novel visual categories from very few samples.
We propose a Self-Attention Based Complementary Module (SAC Module) to fulfill the weakly-supervised object localization.
We also produce the activated masks for selecting discriminative deep descriptors for few-shot classification.
arXiv Detail & Related papers (2020-03-02T14:07:05Z) - Cross-Domain Few-Shot Classification via Learned Feature-Wise
Transformation [109.89213619785676]
Few-shot classification aims to recognize novel categories with only few labeled images in each class.
Existing metric-based few-shot classification algorithms predict categories by comparing the feature embeddings of query images with those from a few labeled images.
While promising performance has been demonstrated, these methods often fail to generalize to unseen domains.
arXiv Detail & Related papers (2020-01-23T18:55:43Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.