Adversarial Feature Hallucination Networks for Few-Shot Learning
- URL: http://arxiv.org/abs/2003.13193v2
- Date: Tue, 27 Oct 2020 19:16:50 GMT
- Title: Adversarial Feature Hallucination Networks for Few-Shot Learning
- Authors: Kai Li, Yulun Zhang, Kunpeng Li, Yun Fu
- Abstract summary: Adversarial Feature Hallucination Networks (AFHN) is based on conditional Wasserstein Generative Adversarial networks (cWGAN)
Two novel regularizers are incorporated into AFHN to encourage discriminability and diversity of the synthesized features.
- Score: 84.31660118264514
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The recent flourish of deep learning in various tasks is largely accredited
to the rich and accessible labeled data. Nonetheless, massive supervision
remains a luxury for many real applications, boosting great interest in
label-scarce techniques such as few-shot learning (FSL), which aims to learn
concept of new classes with a few labeled samples. A natural approach to FSL is
data augmentation and many recent works have proved the feasibility by
proposing various data synthesis models. However, these models fail to well
secure the discriminability and diversity of the synthesized data and thus
often produce undesirable results. In this paper, we propose Adversarial
Feature Hallucination Networks (AFHN) which is based on conditional Wasserstein
Generative Adversarial networks (cWGAN) and hallucinates diverse and
discriminative features conditioned on the few labeled samples. Two novel
regularizers, i.e., the classification regularizer and the anti-collapse
regularizer, are incorporated into AFHN to encourage discriminability and
diversity of the synthesized features, respectively. Ablation study verifies
the effectiveness of the proposed cWGAN based feature hallucination framework
and the proposed regularizers. Comparative results on three common benchmark
datasets substantiate the superiority of AFHN to existing data augmentation
based FSL approaches and other state-of-the-art ones.
Related papers
- SAU: A Dual-Branch Network to Enhance Long-Tailed Recognition via Generative Models [9.340077455871736]
Long-tailed distributions in image recognition pose a considerable challenge due to the severe imbalance between a few dominant classes.
Recently, the use of large generative models to create synthetic data for image classification has been realized.
We propose the use of synthetic data as a complement to long-tailed datasets to eliminate the impact of data imbalance.
arXiv Detail & Related papers (2024-08-29T05:33:59Z) - Exploring Data Efficiency in Zero-Shot Learning with Diffusion Models [38.36200871549062]
Zero-Shot Learning (ZSL) aims to enable classifiers to identify unseen classes by enhancing data efficiency at the class level.
This is achieved by generating image features from pre-defined semantics of unseen classes.
In this paper, we demonstrate that limited seen examples generally result in deteriorated performance of generative models.
This unified framework incorporates diffusion models to improve data efficiency at both the class and instance levels.
arXiv Detail & Related papers (2024-06-05T04:37:06Z) - Continual Learning with Pre-Trained Models: A Survey [61.97613090666247]
Continual Learning aims to overcome the catastrophic forgetting of former knowledge when learning new ones.
This paper presents a comprehensive survey of the latest advancements in PTM-based CL.
arXiv Detail & Related papers (2024-01-29T18:27:52Z) - Dual-View Data Hallucination with Semantic Relation Guidance for Few-Shot Image Recognition [49.26065739704278]
We propose a framework that exploits semantic relations to guide dual-view data hallucination for few-shot image recognition.
An instance-view data hallucination module hallucinates each sample of a novel class to generate new data.
A prototype-view data hallucination module exploits semantic-aware measure to estimate the prototype of a novel class.
arXiv Detail & Related papers (2024-01-13T12:32:29Z) - Detail Reinforcement Diffusion Model: Augmentation Fine-Grained Visual Categorization in Few-Shot Conditions [11.121652649243119]
Diffusion models have been widely adopted in data augmentation due to their outstanding diversity in data generation.
We propose a novel approach termed the detail reinforcement diffusion model(DRDM)
It leverages the rich knowledge of large models for fine-grained data augmentation and comprises two key components including discriminative semantic recombination (DSR) and spatial knowledge reference(SKR)
arXiv Detail & Related papers (2023-09-15T01:28:59Z) - No Fear of Heterogeneity: Classifier Calibration for Federated Learning
with Non-IID Data [78.69828864672978]
A central challenge in training classification models in the real-world federated system is learning with non-IID data.
We propose a novel and simple algorithm called Virtual Representations (CCVR), which adjusts the classifier using virtual representations sampled from an approximated ssian mixture model.
Experimental results demonstrate that CCVR state-of-the-art performance on popular federated learning benchmarks including CIFAR-10, CIFAR-100, and CINIC-10.
arXiv Detail & Related papers (2021-06-09T12:02:29Z) - DFS: A Diverse Feature Synthesis Model for Generalized Zero-Shot
Learning [12.856168667514947]
Generative based strategy has shown great potential in the Generalized Zero-Shot Learning task.
Generative based strategy has shown great potential in the Generalized Zero-Shot Learning task.
We propose to enhance the generalizability of GZSL models via improving feature diversity of unseen classes.
arXiv Detail & Related papers (2021-03-19T12:24:42Z) - Exploring Complementary Strengths of Invariant and Equivariant
Representations for Few-Shot Learning [96.75889543560497]
In many real-world problems, collecting a large number of labeled samples is infeasible.
Few-shot learning is the dominant approach to address this issue, where the objective is to quickly adapt to novel categories in presence of a limited number of samples.
We propose a novel training mechanism that simultaneously enforces equivariance and invariance to a general set of geometric transformations.
arXiv Detail & Related papers (2021-03-01T21:14:33Z) - Few-Shot Named Entity Recognition: A Comprehensive Study [92.40991050806544]
We investigate three schemes to improve the model generalization ability for few-shot settings.
We perform empirical comparisons on 10 public NER datasets with various proportions of labeled data.
We create new state-of-the-art results on both few-shot and training-free settings.
arXiv Detail & Related papers (2020-12-29T23:43:16Z) - Improving Deep Hyperspectral Image Classification Performance with
Spectral Unmixing [3.84448093764973]
We propose an abundance-based multi-HSI classification method.
We convert every HSI from the spectral domain to the abundance domain by a dataset-specific autoencoder.
Secondly, the abundance representations from multiple HSIs are collected to form an enlarged dataset.
arXiv Detail & Related papers (2020-04-01T17:14:05Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.