Applying curriculum to meta-learning for few shot classification

Last update: Oct 25, 2022

Related tags

Overview

Curriculum Meta-Learning for Few-shot Classification

We propose an adaptation of the curriculum training framework, applicable to state-of-the-art meta learning techniques for few-shot classification. Curriculum-based training popularly attempts to mimic human learning by progressively increasing the training complexity to enable incremental concept learning. As the meta-learner's goal is learning how to learn from as few samples as possible, the exact number of those samples (i.e. the size of the support set) arises as a natural proxy of a given task's difficulty. We define a simple yet novel curriculum schedule that begins with a larger support size and progressively reduces it throughout training to eventually match the desired shot-size of the test setup. This proposed method boosts the learning efficiency as well as the generalization capability. Our experiments with the MAML algorithm on two few-shot image classification tasks show significant gains with the curriculum training framework. Ablation studies corroborate the independence of our proposed method from the model architecture as well as the meta-learning hyperparameters.

How to reproduce

Our code is based on the learn2learn library. Specifically we start from their MAML implementation and extend with the ideas presented in our paper. Each of the results presented in the paper (incl. Ablation studies) can be reproduced by invoking the main script with appropriate arguments.

Requirements

Install dependencies:

pip install torch
pip install learn2learn

Examples

5 way - 5 shot MiniImagenet using a Convolutional neural network.

# Vanilla, achieves ~ 58% accuracy
python3 curriculum_meta_learning.py --dataset mini-imagenet --multiplier 1 --shot 5 --ways 5

# Ours, achieves ~ 66% accuracy
python3 curriculum_meta_learning.py --dataset mini-imagenet --multiplier 5 --shot 5 --ways 5

5 way - 1 shot OmniGlot using a Fully Connected neural network.

# Vanilla, achieves ~ 90% accuracy
python3 curriculum_meta_learning.py --dataset omniglot --multiplier 1 --shot 1 --ways 5 --fc

# Ours, achieves ~ 94% accuracy
python3 curriculum_meta_learning.py --dataset omniglot --multiplier 5 --shot 1 --ways 5 --fc

Ablation: disable LR annealing or query size adaptation durinng training.

python3 curriculum_meta_learning.py --multiplier 3 --freeze_lr

python3 curriculum_meta_learning.py --multiplier 3 --freeze_l

Ablation: Use a statically larger support size instead of curriculum.

python3 curriculum_meta_learning.py --dataset mini-imagenet --multiplier 5 --shot 5 --ways 5 --freeze_multiplier

Authors

Stergiadis Emmanouil (@steremma), [email protected]
Priyanka Agrawal (@pagrawal-ml), [email protected]
Oliver Squire (@ojsquire), [email protected]

Applying curriculum to meta-learning for few shot classification

Related tags

Overview

Curriculum Meta-Learning for Few-shot Classification

How to reproduce

Requirements

Examples

5 way - 5 shot MiniImagenet using a Convolutional neural network.

5 way - 1 shot OmniGlot using a Fully Connected neural network.

Ablation: disable LR annealing or query size adaptation durinng training.

Ablation: Use a statically larger support size instead of curriculum.

Authors

Owner

Stergiadis Manos

Implementation of CoCa, Contrastive Captioners are Image-Text Foundation Models, in Pytorch

This repository accompanies the ACM TOIS paper "What can I cook with these ingredients?" - Understanding cooking-related information needs in conversational search

This library is a location of the LegacyLogger for PyTorch Lightning.

The official TensorFlow implementation of the paper Action Transformer: A Self-Attention Model for Short-Time Pose-Based Human Action Recognition

This is the official implementation for "Do Transformers Really Perform Bad for Graph Representation?".

⚾🤖⚾ Automatic baseball pitching overlay in realtime

A lightweight library to compare different PyTorch implementations of the same network architecture.

A tiny, pedagogical neural network library with a pytorch-like API.

SweiNet is an uncertainty-quantifying shear wave speed (SWS) estimator for ultrasound shear wave elasticity (SWE) imaging.

A python-image-classification web application project, written in Python and served through the Flask Microframework

This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"

a Pytorch easy re-implement of "YOLOX: Exceeding YOLO Series in 2021"

Large scale PTM - PPI relation extraction

This source code is implemented using keras library based on "Automatic ocular artifacts removal in EEG using deep learning"

Pixel-level Crack Detection From Images Of Levee Systems : A Comparative Study

Robotics with GPU computing

PyTorch code for our paper "Image Super-Resolution with Non-Local Sparse Attention" (CVPR2021).

An Ensemble of CNN (Python 3.5.1 Tensorflow 1.3 numpy 1.13)

Metadata-Extractor - Metadata Extractor Script can be used to read in exif metadata

SPRING is a seq2seq model for Text-to-AMR and AMR-to-Text (AAAI2021).