This project uses ViT to perform image classification tasks on DATA set CIFAR10.

Last update: Jun 03, 2022

Overview

Vision-Transformer-Multiprocess-DistributedDataParallel-Apex

Introduction

This project uses ViT to perform image classification tasks on DATA set CIFAR10. The implement of Vit and pretrained weight are from https://github.com/asyml/vision-transformer-pytorch. Different from https://github.com/Kaicheng-Yang0828/Vision-Transformer-ViT, this project use multi-process distributed training and it also use Apex to reduce GPU resource consumption.

Requirments

pytorch 1.7.1
python 3.7.3

Install Apex

1、 git clone https://github.com/NVIDIA/apex.git
2、 cd apex
3、 python setup.py install

Datasets

Download the CIFAR10 from http://www.cs.toronto.edu/~kriz/cifar.html or you can get it from https://pan.baidu.com/s/1ogAFopdVzswge2Aaru_lvw (code: k5v8), creat data floder and unzip the cifar-10-python.tar.gz under './data'

Pre_trained model

You can download the pretrained file from https://pan.baidu.com/s/1CuUj-XIXwecxWMEcLoJzPg (code: ox9n), creat Vit_weights floder and pretrained file under ./Vit_weights

Train

python main.py

Result

Base on the pretrained weight, after one epoch, I get 98.1 Accuracy (I didn't adjust the parameters carefully, you can get better results by adjusting the parameters)

model	dataset	acc
ViT-B_16	CIFAR10	98.1

Attention

1、Multi-process parallel training reduces the training time by one-fifth
2、Apex reduce about 30% GPU resources under the premise of ensuring the same accuracy rate

This project uses ViT to perform image classification tasks on DATA set CIFAR10.

Related tags

Overview

Vision-Transformer-Multiprocess-DistributedDataParallel-Apex

Introduction

Requirments

Install Apex

Datasets

Pre_trained model

Train

Result

Attention

Owner

Kaicheng Yang

Reinforcement learning framework and algorithms implemented in PyTorch.

Investigating automatic navigation towards standard US views integrating MARL with the virtual US environment developed in CT2US simulation

A Low Complexity Speech Enhancement Framework for Full-Band Audio (48kHz) based on Deep Filtering.

Code image classification of MNIST dataset using different architectures: simple linear NN, autoencoder, and highway network

Multi-task Multi-agent Soft Actor Critic for SMAC

Application of the L2HMC algorithm to simulations in lattice QCD.

A pytorch-based deep learning framework for multi-modal 2D/3D medical image segmentation

Caffe implementation for Hu et al. Segmentation for Natural Language Expressions

Training Structured Neural Networks Through Manifold Identification and Variance Reduction

UnpNet - Rethinking 3-D LiDAR Point Cloud Segmentation(IEEE TNNLS)

Implementation of SETR model, Original paper: Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers.

Code for the paper A Theoretical Analysis of the Repetition Problem in Text Generation

A Structured Self-attentive Sentence Embedding

GPU-accelerated Image Processing library using OpenCL

Reverse engineer your pytorch vision models, in style

A PyTorch implementation for Unsupervised Domain Adaptation by Backpropagation(DANN), support Office-31 and Office-Home dataset

A large-scale video dataset for the training and evaluation of 3D human pose estimation models

Data manipulation and transformation for audio signal processing, powered by PyTorch

MVSDF - Learning Signed Distance Field for Multi-view Surface Reconstruction

Deep Learning agent of Starcraft2, similar to AlphaStar of DeepMind except size of network.