PyTorch implementation of "PatchGame: Learning to Signal Mid-level Patches in Referential Games" to appear in NeurIPS 2021

Last update: Mar 16, 2022

Overview

PatchGame: Learning to Signal Mid-level Patches in Referential Games

This repository is the official implementation of the paper - "PatchGame: Learning to SignalMid-level Patches in Referential Games"

Requirements

We recommend using anaconda or miniconda for python. Our code has been tested with python=3.8 on linux.

To create a new environment with conda

conda create -n patchgame python=3.8
conda activate patchgame

We recommend installing the latest pytorch and torchvision packages You can install them using

conda install pytorch torchvision -c pytorch

Make sure the following requirements are met

torch>=1.8.1
torchvision>=0.9.1

Installing `torchsort`

Note we only tried installing torchsort with following cuda==10.2.89 and gcc==6.3.0.

export TORCH_CUDA_ARCH_LIST="Pascal;Volta;Turing"
unzip torchsort.zip && cd torchsort
python setup.py install --user
cd .. && rm -rf torchsort

Dataset

We use ImageNet-1k (ILSVRC2012) data in all our experiments. Please download and save the data from the official website.

Training

To train the model(s) in the paper on 1-8 GPUs, run this command (where nproc_per_node is the number of gpus):

python -m torch.distributed.launch --nproc_per_node=1 train.py \
    --data_path /patch/to/imagenet/dir/train \
    --output_dir /path/to/checkpoint/dir \
    --patch_size 32 --epochs 100

Pre-trained Models

You can download pretrained models here trained on ImageNet using parameters using above command (and default hyperparameters).

Evaluation

PatchRank with ViT

python eval_patchrank.py --patch-model mymodel.pth --data-path <path to dataset> --topk <no. of patches to use>

This achieves the following accuracy on ImageNet.

Model name	Top 1 Accuracy	Top 5 Accuracy
PatchGame(S=32, topk=75, size=384x384)	58.4%	80.9%

k-NN classification ImageNet with listener's vision module

python -m torch.distributed.launch --nproc_per_node=1 eval_knn.py \
    --pretrained_weights /path/to/checkpoint/dir/checkpoint.pth \
    --arch resnet18 --nb_knn 20 \
    --batch_size_per_gpu 1024 --use_cuda 0 \
    --data_path /patch/to/imagenet/dir

This achieves the following accuracy on ImageNet

Model name	Top 1 Accuracy	Top 5 Accuracy
PatchGame(S=32)	30.3%	49.9%

Acknowledgements

We would like to thank several public repos from where we borrowed various utilities

License

This repository is released under the Apache 2.0 license as found in the LICENSE file.

PyTorch implementation of "PatchGame: Learning to Signal Mid-level Patches in Referential Games" to appear in NeurIPS 2021

Related tags

Overview

PatchGame: Learning to Signal Mid-level Patches in Referential Games

Requirements

Installing `torchsort`

Dataset

Training

Pre-trained Models

Evaluation

PatchRank with ViT

k-NN classification ImageNet with listener's vision module

Acknowledgements

License

Owner

Kamal Gupta

PyTorch implementation of paper "Neural Scene Flow Fields for Space-Time View Synthesis of Dynamic Scenes", CVPR 2021

Source code to accompany Defunctland's video "FASTPASS: A Complicated Legacy"

MVSDF - Learning Signed Distance Field for Multi-view Surface Reconstruction

Adversarial Reweighting for Partial Domain Adaptation

A torch implementation of "Pixel-Level Domain Transfer"

Structured Edge Detection Toolbox

Exploring Simple Siamese Representation Learning

Born-Infeld (BI) for AI: Energy-Conserving Descent (ECD) for Optimization

Fast and Context-Aware Framework for Space-Time Video Super-Resolution (VCIP 2021)

A very simple tool for situations where optimization with onnx-simplifier would exceed the Protocol Buffers upper file size limit of 2GB, or simply to separate onnx files to any size you want.

Trading and Backtesting environment for training reinforcement learning agent or simple rule base algo.

This is a repository for a No-Code object detection inference API using the OpenVINO. It's supported on both Windows and Linux Operating systems.

Image-based Navigation in Real-World Environments via Multiple Mid-level Representations: Fusion Models Benchmark and Efficient Evaluation

Geometry-Free View Synthesis: Transformers and no 3D Priors

RM Operation can equivalently convert ResNet to VGG, which is better for pruning; and can help RepVGG perform better when the depth is large.

Binary classification for arrythmia detection with ECG datasets.

Research on Event Accumulator Settings for Event-Based SLAM

This is a simple face recognition mini project that was completed by a team of 3 members in 1 week's time

GAN example for Keras. Cuz MNIST is too small and there should be something more realistic.

HMLLDB is a collection of LLDB commands to assist in the debugging of iOS apps.

PyTorch implementation of "PatchGame: Learning to Signal Mid-level Patches in Referential Games" to appear in NeurIPS 2021

Related tags

Overview

PatchGame: Learning to Signal Mid-level Patches in Referential Games

Requirements

Installing torchsort

Dataset

Training

Pre-trained Models

Evaluation

PatchRank with ViT

k-NN classification ImageNet with listener's vision module

Acknowledgements

License

Owner

Kamal Gupta

PyTorch implementation of paper "Neural Scene Flow Fields for Space-Time View Synthesis of Dynamic Scenes", CVPR 2021

Source code to accompany Defunctland's video "FASTPASS: A Complicated Legacy"

MVSDF - Learning Signed Distance Field for Multi-view Surface Reconstruction

Adversarial Reweighting for Partial Domain Adaptation

A torch implementation of "Pixel-Level Domain Transfer"

Structured Edge Detection Toolbox

Exploring Simple Siamese Representation Learning

Born-Infeld (BI) for AI: Energy-Conserving Descent (ECD) for Optimization

Fast and Context-Aware Framework for Space-Time Video Super-Resolution (VCIP 2021)

A very simple tool for situations where optimization with onnx-simplifier would exceed the Protocol Buffers upper file size limit of 2GB, or simply to separate onnx files to any size you want.

Trading and Backtesting environment for training reinforcement learning agent or simple rule base algo.

This is a repository for a No-Code object detection inference API using the OpenVINO. It's supported on both Windows and Linux Operating systems.

Image-based Navigation in Real-World Environments via Multiple Mid-level Representations: Fusion Models Benchmark and Efficient Evaluation

Geometry-Free View Synthesis: Transformers and no 3D Priors

RM Operation can equivalently convert ResNet to VGG, which is better for pruning; and can help RepVGG perform better when the depth is large.

Binary classification for arrythmia detection with ECG datasets.

Research on Event Accumulator Settings for Event-Based SLAM

This is a simple face recognition mini project that was completed by a team of 3 members in 1 week's time

GAN example for Keras. Cuz MNIST is too small and there should be something more realistic.

HMLLDB is a collection of LLDB commands to assist in the debugging of iOS apps.

Installing `torchsort`