Spatial Transformer Networks for Curriculum Learning
- URL: http://arxiv.org/abs/2108.09696v1
- Date: Sun, 22 Aug 2021 12:04:55 GMT
- Title: Spatial Transformer Networks for Curriculum Learning
- Authors: Fatemeh Azimi, Jean-Francois Jacques Nicolas Nies, Sebastian Palacio,
Federico Raue, J\"orn Hees, Andreas Dengel
- Abstract summary: We take inspiration from Spatial Transformer Networks (STNs) in order to form an easy-to-hard curriculum.
We perform various experiments on cluttered MNIST and Fashion-MNIST datasets.
- Score: 6.107943372244105
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Curriculum learning is a bio-inspired training technique that is widely
adopted to machine learning for improved optimization and better training of
neural networks regarding the convergence rate or obtained accuracy. The main
concept in curriculum learning is to start the training with simpler tasks and
gradually increase the level of difficulty. Therefore, a natural question is
how to determine or generate these simpler tasks. In this work, we take
inspiration from Spatial Transformer Networks (STNs) in order to form an
easy-to-hard curriculum. As STNs have been proven to be capable of removing the
clutter from the input images and obtaining higher accuracy in image
classification tasks, we hypothesize that images processed by STNs can be seen
as easier tasks and utilized in the interest of curriculum learning. To this
end, we study multiple strategies developed for shaping the training
curriculum, using the data generated by STNs. We perform various experiments on
cluttered MNIST and Fashion-MNIST datasets, where on the former, we obtain an
improvement of $3.8$pp in classification accuracy compared to the baseline.
Related papers
- EfficientTrain++: Generalized Curriculum Learning for Efficient Visual Backbone Training [79.96741042766524]
We reformulate the training curriculum as a soft-selection function.
We show that exposing the contents of natural images can be readily achieved by the intensity of data augmentation.
The resulting method, EfficientTrain++, is simple, general, yet surprisingly effective.
arXiv Detail & Related papers (2024-05-14T17:00:43Z) - EfficientTrain: Exploring Generalized Curriculum Learning for Training
Visual Backbones [80.662250618795]
This paper presents a new curriculum learning approach for the efficient training of visual backbones (e.g., vision Transformers)
As an off-the-shelf method, it reduces the wall-time training cost of a wide variety of popular models by >1.5x on ImageNet-1K/22K without sacrificing accuracy.
arXiv Detail & Related papers (2022-11-17T17:38:55Z) - Pre-Train Your Loss: Easy Bayesian Transfer Learning with Informative
Priors [59.93972277761501]
We show that we can learn highly informative posteriors from the source task, through supervised or self-supervised approaches.
This simple modular approach enables significant performance gains and more data-efficient learning on a variety of downstream classification and segmentation tasks.
arXiv Detail & Related papers (2022-05-20T16:19:30Z) - Friendly Training: Neural Networks Can Adapt Data To Make Learning
Easier [23.886422706697882]
We propose a novel training procedure named Friendly Training.
We show that Friendly Training yields improvements with respect to informed data sub-selection and random selection.
Results suggest that adapting the input data is a feasible way to stabilize learning and improve the skills generalization of the network.
arXiv Detail & Related papers (2021-06-21T10:50:34Z) - Rectification-based Knowledge Retention for Continual Learning [49.1447478254131]
Deep learning models suffer from catastrophic forgetting when trained in an incremental learning setting.
We propose a novel approach to address the task incremental learning problem, which involves training a model on new tasks that arrive in an incremental manner.
Our approach can be used in both the zero-shot and non zero-shot task incremental learning settings.
arXiv Detail & Related papers (2021-03-30T18:11:30Z) - Statistical Measures For Defining Curriculum Scoring Function [5.328970912536596]
We show improvements in performance with convolutional and fully-connected neural networks on real image datasets.
Motivated by our insights from implicit curriculum ordering, we introduce a simple curriculum learning strategy.
We also propose and study the performance of a dynamic curriculum learning algorithm.
arXiv Detail & Related papers (2021-02-27T07:25:49Z) - Expert Training: Task Hardness Aware Meta-Learning for Few-Shot
Classification [62.10696018098057]
We propose an easy-to-hard expert meta-training strategy to arrange the training tasks properly.
A task hardness aware module is designed and integrated into the training procedure to estimate the hardness of a task.
Experimental results on the miniImageNet and tieredImageNetSketch datasets show that the meta-learners can obtain better results with our expert training strategy.
arXiv Detail & Related papers (2020-07-13T08:49:00Z) - Few Is Enough: Task-Augmented Active Meta-Learning for Brain Cell
Classification [8.998976678920236]
We propose a tAsk-auGmented actIve meta-LEarning (AGILE) method to efficiently adapt Deep Neural Networks to new tasks.
AGILE combines a meta-learning algorithm with a novel task augmentation technique which we use to generate an initial adaptive model.
We show that the proposed task-augmented meta-learning framework can learn to classify new cell types after a single gradient step.
arXiv Detail & Related papers (2020-07-09T18:03:12Z) - Curriculum By Smoothing [52.08553521577014]
Convolutional Neural Networks (CNNs) have shown impressive performance in computer vision tasks such as image classification, detection, and segmentation.
We propose an elegant curriculum based scheme that smoothes the feature embedding of a CNN using anti-aliasing or low-pass filters.
As the amount of information in the feature maps increases during training, the network is able to progressively learn better representations of the data.
arXiv Detail & Related papers (2020-03-03T07:27:44Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.