Constrained Few-shot Class-incremental Learning
- URL: http://arxiv.org/abs/2203.16588v1
- Date: Wed, 30 Mar 2022 18:19:36 GMT
- Title: Constrained Few-shot Class-incremental Learning
- Authors: Michael Hersche, Geethan Karunaratne, Giovanni Cherubini, Luca Benini,
Abu Sebastian, Abbas Rahimi
- Abstract summary: Continually learning new classes from fresh data without forgetting previous knowledge of old classes is a very challenging research problem.
We propose C-FSCIL, which is architecturally composed of a frozen meta-learned feature extractor, a trainable fixed-size fully connected layer, and a rewritable dynamically growing memory.
C-FSCIL provides three update modes that offer a trade-off between accuracy and compute-memory cost of learning novel classes.
- Score: 14.646083882851928
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Continually learning new classes from fresh data without forgetting previous
knowledge of old classes is a very challenging research problem. Moreover, it
is imperative that such learning must respect certain memory and computational
constraints such as (i) training samples are limited to only a few per class,
(ii) the computational cost of learning a novel class remains constant, and
(iii) the memory footprint of the model grows at most linearly with the number
of classes observed. To meet the above constraints, we propose C-FSCIL, which
is architecturally composed of a frozen meta-learned feature extractor, a
trainable fixed-size fully connected layer, and a rewritable dynamically
growing memory that stores as many vectors as the number of encountered
classes. C-FSCIL provides three update modes that offer a trade-off between
accuracy and compute-memory cost of learning novel classes. C-FSCIL exploits
hyperdimensional embedding that allows to continually express many more classes
than the fixed dimensions in the vector space, with minimal interference. The
quality of class vector representations is further improved by aligning them
quasi-orthogonally to each other by means of novel loss functions. Experiments
on the CIFAR100, miniImageNet, and Omniglot datasets show that C-FSCIL
outperforms the baselines with remarkable accuracy and compression. It also
scales up to the largest problem size ever tried in this few-shot setting by
learning 423 novel classes on top of 1200 base classes with less than 1.6%
accuracy drop. Our code is available at
https://github.com/IBM/constrained-FSCIL.
Related papers
- 12 mJ per Class On-Device Online Few-Shot Class-Incremental Learning [12.768324871562891]
Few-Shot Class-Incremental Learning (FSCIL) enables machine learning systems to expand their inference capabilities to new classes using only a few labeled examples.
O-FSCIL obtains an average accuracy of 68.62% on the FSCIL CIFAR100 benchmark, achieving state-of-the-art results.
Tailored for ultra-low-power platforms, we implement O-FSCIL on the 60 mW GAP9 microcontroller, demonstrating online learning capabilities within just 12 mJ per new class.
arXiv Detail & Related papers (2024-03-12T17:43:20Z) - CEAT: Continual Expansion and Absorption Transformer for Non-Exemplar
Class-Incremental Learning [34.59310641291726]
In real-world applications, dynamic scenarios require the models to possess the capability to learn new tasks continuously without forgetting the old knowledge.
We propose a new architecture, named continual expansion and absorption transformer(CEAT)
The model can learn the novel knowledge by extending the expanded-fusion layers in parallel with the frozen previous parameters.
To improve the learning ability of the model, we designed a novel prototype contrastive loss to reduce the overlap between old and new classes in the feature space.
arXiv Detail & Related papers (2024-03-11T12:40:12Z) - Complementary Learning Subnetworks for Parameter-Efficient
Class-Incremental Learning [40.13416912075668]
We propose a rehearsal-free CIL approach that learns continually via the synergy between two Complementary Learning Subnetworks.
Our method achieves competitive results against state-of-the-art methods, especially in accuracy gain, memory cost, training efficiency, and task-order.
arXiv Detail & Related papers (2023-06-21T01:43:25Z) - Class-Incremental Learning: A Survey [84.30083092434938]
Class-Incremental Learning (CIL) enables the learner to incorporate the knowledge of new classes incrementally.
CIL tends to catastrophically forget the characteristics of former ones, and its performance drastically degrades.
We provide a rigorous and unified evaluation of 17 methods in benchmark image classification tasks to find out the characteristics of different algorithms.
arXiv Detail & Related papers (2023-02-07T17:59:05Z) - Neural Collapse Inspired Feature-Classifier Alignment for Few-Shot Class
Incremental Learning [120.53458753007851]
Few-shot class-incremental learning (FSCIL) has been a challenging problem as only a few training samples are accessible for each novel class in the new sessions.
We deal with this misalignment dilemma in FSCIL inspired by the recently discovered phenomenon named neural collapse.
We propose a neural collapse inspired framework for FSCIL. Experiments on the miniImageNet, CUB-200, and CIFAR-100 datasets demonstrate that our proposed framework outperforms the state-of-the-art performances.
arXiv Detail & Related papers (2023-02-06T18:39:40Z) - Few-Shot Class-Incremental Learning by Sampling Multi-Phase Tasks [59.12108527904171]
A model should recognize new classes and maintain discriminability over old classes.
The task of recognizing few-shot new classes without forgetting old classes is called few-shot class-incremental learning (FSCIL)
We propose a new paradigm for FSCIL based on meta-learning by LearnIng Multi-phase Incremental Tasks (LIMIT)
arXiv Detail & Related papers (2022-03-31T13:46:41Z) - Self-Supervised Class Incremental Learning [51.62542103481908]
Existing Class Incremental Learning (CIL) methods are based on a supervised classification framework sensitive to data labels.
When updating them based on the new class data, they suffer from catastrophic forgetting: the model cannot discern old class data clearly from the new.
In this paper, we explore the performance of Self-Supervised representation learning in Class Incremental Learning (SSCIL) for the first time.
arXiv Detail & Related papers (2021-11-18T06:58:19Z) - Improving Calibration for Long-Tailed Recognition [68.32848696795519]
We propose two methods to improve calibration and performance in such scenarios.
For dataset bias due to different samplers, we propose shifted batch normalization.
Our proposed methods set new records on multiple popular long-tailed recognition benchmark datasets.
arXiv Detail & Related papers (2021-04-01T13:55:21Z) - Learning Adaptive Embedding Considering Incremental Class [55.21855842960139]
Class-Incremental Learning (CIL) aims to train a reliable model with the streaming data, which emerges unknown classes sequentially.
Different from traditional closed set learning, CIL has two main challenges: 1) Novel class detection.
After the novel classes are detected, the model needs to be updated without re-training using entire previous data.
arXiv Detail & Related papers (2020-08-31T04:11:24Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.