Repo for the paper Extrapolating from a Single Image to a Thousand Classes using Distillation

Last update: Nov 04, 2022

Overview

Extrapolating from a Single Image to a Thousand Classes using Distillation

by Yuki M. Asano* and Aaqib Saeed* (*Equal Contribution)

Extrapolating from one image. Strongly augmented patches from a single image are used to train a student (S) to distinguish semantic classes, such as those in ImageNet. The student neural network is initialized randomly and learns from a pretrained teacher (T) via KL-divergence. Although almost none of target categories are present in the image, we find student performances of > 59% for classifying ImageNet's 1000 classes. In this paper, we develop this single datum learning framework and investigate it across datasets and domains.

Key contributions

A minimal framework for training neural networks with a single datum from scratch using distillation.
Extensive ablations of the proposed method, such as the dependency on the source image, the choice of augmentations and network architectures.
Large scale empirical evidence of neural networks' ability to extrapolate on > 13 image, video and audio datasets.
Qualitative insights on what and how neural networks trained with a single image learn.

Neuron visualizations

We compare activation-maximization-based visualizations using the Lucent library. Even though the model has never seen an image of a panda, the model trained with a teacher and only single-image inputs has a good idea of how a panda looks like.

Running the experiments

Installation

In each folder cifar\in1k\video you will find a requirements.txt file. Install packages as follows:

pip3 install -r requirements.txt

1. Prepare Dataset:

To generate single image data, we refer to the data_generation folder

2. Run Experiments:

There is a main "distill.py" file for each experiment type: small-scale and large-scale images and video. Note: 2a uses tensorflow and 2b, 2c use pytorch.

2a. Run distillation experiments for CIFAR-10/100

e.g. with Animal single-image dataset as follows:

# in cifar folder:
python3 distill.py --dataset=cifar10 --image=/path/to/single_image_dataset/ \
                   --student=wrn_16_4 --teacher=wrn_40_4

Note that we provide a pretrained teacher model for reproducibility.

2b. Run distillation experiments for ImageNet with single-image dataset as follows:

# in in1k folder:
python3 distill.py --dataset=in1k --testdir /ILSVRC12/val/ \
                   --traindir=/path/to/dataset/ --student_arch=resnet50 --teacher_arch=resnet18

Note that teacher models are automatically downloaded from torchvision or timm.

2c. Run distillation experiments for Kinetics with single-image-created video dataset as follows:

# in video folder:
python3 distill.py --dataset=k400 --traindir=/dataset/with/vids --test_data_path /path/to/k400/val

Note that teacher models are automatically downloaded from torchvideo when you distill a K400 model.

Pretrained models

Large-scale (224x224-sized) image ResNet-50 models trained for 200ep:

Dataset	Teacher	Student	Performance	Checkpoint
ImageNet-12	R18	R50	59.1%	R50 weights
ImageNet-12	R50	R50	53.5%	R50 weights
Places365	R18	R50	54.7%	R50 weights
Flowers101	R18	R50	58.1%	R50 weights
Pets37	R18	R50	83.7%	R50 weights
IN100	R18	R50	74.1%	R50 weights
STL-10	R18	R50	93.0%	R50 weights

Video x3d_s_e (expanded) models (160x160 crop, 4frames) trained for 400ep:

Dataset	Teacher	Student	Performance	Checkpoint
K400	x3d_xs	x3d_xs_e	53.57%	weights
UCF101	x3d_xs	x3d_xs_e	77.32%	weights

Citation

@inproceedings{asano2021extrapolating,
  title={Extrapolating from a Single Image to a Thousand Classes using Distillation},
  author={Asano, Yuki M. and Saeed, Aaqib},
  journal={arXiv preprint arXiv:2112.00725},
  year={2021}
}

Repo for the paper Extrapolating from a Single Image to a Thousand Classes using Distillation

Related tags

Overview

Extrapolating from a Single Image to a Thousand Classes using Distillation

Key contributions

Neuron visualizations

Running the experiments

Installation

1. Prepare Dataset:

2. Run Experiments:

2a. Run distillation experiments for CIFAR-10/100

2b. Run distillation experiments for ImageNet with single-image dataset as follows:

2c. Run distillation experiments for Kinetics with single-image-created video dataset as follows:

Pretrained models

Citation

Owner

Yuki M. Asano

A framework for annotating 3D meshes using the predictions of a 2D semantic segmentation model.

A project for developing transformer-based models for clinical relation extraction

Pytorch implementation of Rosca, Mihaela, et al. "Variational Approaches for Auto-Encoding Generative Adversarial Networks."

Change is Everywhere: Single-Temporal Supervised Object Change Detection in Remote Sensing Imagery (ICCV 2021)

Defocus Map Estimation and Deblurring from a Single Dual-Pixel Image

The codebase for our paper "Generative Occupancy Fields for 3D Surface-Aware Image Synthesis" (NeurIPS 2021)

A simple, clean TensorFlow implementation of Generative Adversarial Networks with a focus on modeling illustrations.

A sketch extractor for anime/illustration.

[NIPS 2021] UOTA: Improving Self-supervised Learning with Automated Unsupervised Outlier Arbitration.

Resilient projection-based consensus actor-critic (RPBCAC) algorithm

The codebase for Data-driven general-purpose voice activity detection.

Centroid-UNet is deep neural network model to detect centroids from satellite images.

Repository for the semantic WMI loss

Segmentation for medical image.

Contains modeling practice materials and homework for the Computational Neuroscience course at Okinawa Institute of Science and Technology

Character Controllers using Motion VAEs

Keras Implementation of The One Hundred Layers Tiramisu: Fully Convolutional DenseNets for Semantic Segmentation by (Simon Jégou, Michal Drozdzal, David Vazquez, Adriana Romero, Yoshua Bengio)

Classify the disease status of a plant given an image of a passion fruit

Its a Plant Leaf Disease Detection System based on Machine Learning.

Combining Automatic Labelers and Expert Annotations for Accurate Radiology Report Labeling Using BERT