Mutual-Information Based Few-Shot Classification
- URL: http://arxiv.org/abs/2106.12252v1
- Date: Wed, 23 Jun 2021 09:17:23 GMT
- Title: Mutual-Information Based Few-Shot Classification
- Authors: Malik Boudiaf, Ziko Imtiaz Masud, J\'er\^ome Rony, Jose Dolz, Ismail
Ben Ayed, Pablo Piantanida
- Abstract summary: We introduce Transductive Infomation Maximization (TIM) for few-shot learning.
Our method maximizes the mutual information between the query features and their label predictions for a given few-shot task.
We propose a new alternating-direction solver, which speeds up transductive inference over gradient-based optimization.
- Score: 34.95314059362982
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: We introduce Transductive Infomation Maximization (TIM) for few-shot
learning. Our method maximizes the mutual information between the query
features and their label predictions for a given few-shot task, in conjunction
with a supervision loss based on the support set. We motivate our transductive
loss by deriving a formal relation between the classification accuracy and
mutual-information maximization. Furthermore, we propose a new
alternating-direction solver, which substantially speeds up transductive
inference over gradient-based optimization, while yielding competitive
accuracy. We also provide a convergence analysis of our solver based on
Zangwill's theory and bound-optimization arguments. TIM inference is modular:
it can be used on top of any base-training feature extractor. Following
standard transductive few-shot settings, our comprehensive experiments
demonstrate that TIM outperforms state-of-the-art methods significantly across
various datasets and networks, while used on top of a fixed feature extractor
trained with simple cross-entropy on the base classes, without resorting to
complex meta-learning schemes. It consistently brings between 2 % and 5 %
improvement in accuracy over the best performing method, not only on all the
well-established few-shot benchmarks but also on more challenging scenarios,
with random tasks, domain shift and larger numbers of classes, as in the
recently introduced META-DATASET. Our code is publicly available at
https://github.com/mboudiaf/TIM. We also publicly release a standalone PyTorch
implementation of META-DATASET, along with additional benchmarking results, at
https://github.com/mboudiaf/pytorch-meta-dataset.
Related papers
- SMaRt: Improving GANs with Score Matching Regularity [94.81046452865583]
Generative adversarial networks (GANs) usually struggle in learning from highly diverse data, whose underlying manifold is complex.
We show that score matching serves as a promising solution to this issue thanks to its capability of persistently pushing the generated data points towards the real data manifold.
We propose to improve the optimization of GANs with score matching regularity (SMaRt)
arXiv Detail & Related papers (2023-11-30T03:05:14Z) - Okapi: Generalising Better by Making Statistical Matches Match [7.392460712829188]
Okapi is a simple, efficient, and general method for robust semi-supervised learning based on online statistical matching.
Our method uses a nearest-neighbours-based matching procedure to generate cross-domain views for a consistency loss.
We show that it is in fact possible to leverage additional unlabelled data to improve upon empirical risk minimisation.
arXiv Detail & Related papers (2022-11-07T12:41:17Z) - Towards Accurate Knowledge Transfer via Target-awareness Representation
Disentanglement [56.40587594647692]
We propose a novel transfer learning algorithm, introducing the idea of Target-awareness REpresentation Disentanglement (TRED)
TRED disentangles the relevant knowledge with respect to the target task from the original source model and used as a regularizer during fine-tuning the target model.
Experiments on various real world datasets show that our method stably improves the standard fine-tuning by more than 2% in average.
arXiv Detail & Related papers (2020-10-16T17:45:08Z) - Fast Few-Shot Classification by Few-Iteration Meta-Learning [173.32497326674775]
We introduce a fast optimization-based meta-learning method for few-shot classification.
Our strategy enables important aspects of the base learner objective to be learned during meta-training.
We perform a comprehensive experimental analysis, demonstrating the speed and effectiveness of our approach.
arXiv Detail & Related papers (2020-10-01T15:59:31Z) - Revisiting LSTM Networks for Semi-Supervised Text Classification via
Mixed Objective Function [106.69643619725652]
We develop a training strategy that allows even a simple BiLSTM model, when trained with cross-entropy loss, to achieve competitive results.
We report state-of-the-art results for text classification task on several benchmark datasets.
arXiv Detail & Related papers (2020-09-08T21:55:22Z) - Transductive Information Maximization For Few-Shot Learning [41.461586994394565]
We introduce Transductive Infomation Maximization (TIM) for few-shot learning.
Our method maximizes the mutual information between the query features and their label predictions for a given few-shot task.
We propose a new alternating-direction solver for our mutual-information loss.
arXiv Detail & Related papers (2020-08-25T22:38:41Z) - Laplacian Regularized Few-Shot Learning [35.381119443377195]
We propose a transductive Laplacian-regularized inference for few-shot tasks.
Our inference does not re-train the base model, and can be viewed as a graph clustering of the query set.
Our LaplacianShot consistently outperforms state-of-the-art methods by significant margins across different models.
arXiv Detail & Related papers (2020-06-28T02:17:52Z) - Meta-Learned Confidence for Few-shot Learning [60.6086305523402]
A popular transductive inference technique for few-shot metric-based approaches, is to update the prototype of each class with the mean of the most confident query examples.
We propose to meta-learn the confidence for each query sample, to assign optimal weights to unlabeled queries.
We validate our few-shot learning model with meta-learned confidence on four benchmark datasets.
arXiv Detail & Related papers (2020-02-27T10:22:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.