Learning to Prompt for Continual Learning

Last update: Jan 06, 2023

Overview

Learning to Prompt for Continual Learning (L2P) Official Jax Implementation

L2P is a novel continual learning technique which learns to dynamically prompt a pre-trained model to learn tasks sequentially under different task transitions. Different from mainstream rehearsal-based or architecture-based methods, L2P requires neither a rehearsal buffer nor test-time task identity. L2P can be generalized to various continual learning settings including the most challenging and realistic task-agnostic setting. L2P consistently outperforms prior state-of-the-art methods. Surprisingly, L2P achieves competitive results against rehearsal-based methods even without a rehearsal buffer.

Code is written by Zifeng Wang. Acknowledgement to https://github.com/google-research/nested-transformer.

This is not an officially supported Google product.

Enviroment setup

pip install -r requirements.txt

Getting pretrained ViT model

ViT-B/16 model used in this paper can be downloaded at here.

Instructions on running L2P

We provide the configuration file to train and evaluate L2P on multiple benchmarks in configs.

To run our method on the Split CIFAR-100 dataset (class-incremental setting):

python -m main.py --my_config configs/cifar100_l2p.py --workdir=./cifar100_l2p --my_config.init_checkpoint=<ViT-saved-path/ViT-B_16.npz>

To run our method on the more complex Gaussian Scheduled CIFAR-100 dataset (task-agnostic setting):

python -m main.py --my_config configs/cifar100_gaussian_l2p.py --workdir=./cifar100_gaussian_l2p --my_config.init_checkpoint=<ViT-saved-path/ViT-B_16.npz>

Note: we run our experiments using 8 V100 GPUs or 4 TPUs, and we specify a per device batch size of 16 in the config files. This indicates that we use a total batch size of 128.

Visualize results

We use tensorboard to visualize the result. For example, if the working directory specified to run L2P is workdir=./cifar100_l2p, the command to check result is as follows:

tensorboard --logdir ./cifar100_l2p

Here are the important metrics to keep track of, and their corresponding meanings:

Metric	Description
accuracy_n	Accuracy of the n-th task
forgetting	Average forgetting up until the current task
avg_acc	Average evaluation accuracy up until the current task

Cite

@inproceedings{wang2021learning,
  title={Learning to Prompt for Continual Learning},
  author={Zifeng Wang and Zizhao Zhang and Chen-Yu Lee and Han Zhang and Ruoxi Sun and Xiaoqi Ren and Guolong Su and Vincent Perot and Jennifer Dy and Tomas Pfister},
  booktitle={arXiv preprint arXiv:2112.08654},
  year={2021}
}

Learning to Prompt for Continual Learning

Related tags

Overview

Learning to Prompt for Continual Learning (L2P) Official Jax Implementation

Enviroment setup

Getting pretrained ViT model

Instructions on running L2P

Visualize results

Cite

Owner

Google Research

Tensorflow implementation for "Improved Transformer for High-Resolution GANs" (NeurIPS 2021).

Meta-Learning Sparse Implicit Neural Representations (NeurIPS 2021)

Code for technical report "An Improved Baseline for Sentence-level Relation Extraction".

Torch-based tool for quantizing high-dimensional vectors using additive codebooks

The official implementation of paper Siamese Transformer Pyramid Networks for Real-Time UAV Tracking, accepted by WACV22

Calculates JMA (Japan Meteorological Agency) seismic intensity (shindo) scale from acceleration data recorded in NumPy array

Code for NAACL 2021 full paper "Efficient Attentions for Long Document Summarization"

Stochastic Scene-Aware Motion Prediction

Code for "Neural Parts: Learning Expressive 3D Shape Abstractions with Invertible Neural Networks", CVPR 2021

PERIN is Permutation-Invariant Semantic Parser developed for MRP 2020

Face2webtoon - Despite its importance, there are few previous works applying I2I translation to webtoon.

GUI for TOAD-GAN, a PCG-ML algorithm for Token-based Super Mario Bros. Levels.

Speedy Implementation of Instance-based Learning (IBL) agents in Python

Method for facial emotion recognition compitition of Xunfei and Datawhale .

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

Curating a dataset for bioimage transfer learning

《Fst Lerning of Temporl Action Proposl vi Dense Boundry Genertor》(AAAI 2020)

Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition

code for "Feature Importance-aware Transferable Adversarial Attacks"

Official implementation of "MetaSDF: Meta-learning Signed Distance Functions"