Accuracy Aligned. Concise Implementation of Swin Transformer

Last update: Dec 16, 2022

Related tags

Overview

Accuracy Aligned. Concise Implementation of Swin Transformer

This repository contains the implementation of Swin Transformer, and the training codes on ImageNet datasets. We have aligned the output of our network with the official one, that is, using the same input and random seed, the output is identical to the official one.

Our implementation is highly based on einops, which makes the implementation more concise, and easy to be understand. (Intuitively, we use only 200 lines of codes compared with ~600 lines of official codes.) Besides, our implementation keeps the same training speed.

Model	Epoch	[email protected](our)	[email protected](our)	[email protected](official)	[email protected](official)	pretrained model
Swin-T	300	81.3	95.5	81.2	95.5	here

Usage

Train on ImageNet:

Train Swin-T

python -m torch.distributed.launch --nproc_per_node=8 --use_env train.py --model Swin_T \
--batch-size 128 --drop-path 0.2 --data-path ~/ILSVRC2012/ --output_dir /data/SwinTransformer_exp/SwinT/

Train Swin-S

python -m torch.distributed.launch --nproc_per_node=8 --use_env train.py --model Swin_S \
--batch-size 128 --drop-path 0.3 --data-path ~/ILSVRC2012/ --output_dir /data/SwinTransformer_exp/SwinS/

Train Swin-B

python -m torch.distributed.launch --nproc_per_node=8 --use_env train.py --model Swin_B \
--batch-size 128 --drop-path 0.5 --data-path ~/ILSVRC2012/ --output_dir /data/SwinTransformer_exp/SwinB/

Reference

The training process involves many training and augmentation tricks, such as stochastic depth, mixup, cutmix and random erasing. I borrow large from Deit (https://github.com/facebookresearch/deit).

Citations

@misc{liu2021swin,
      title={Swin Transformer: Hierarchical Vision Transformer using Shifted Windows}, 
      author={Ze Liu and Yutong Lin and Yue Cao and Han Hu and Yixuan Wei and Zheng Zhang and Stephen Lin and Baining Guo},
      year={2021},
      eprint={2103.14030},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Accuracy Aligned. Concise Implementation of Swin Transformer

Related tags

Overview

Accuracy Aligned. Concise Implementation of Swin Transformer

Usage

Reference

Citations

Owner

FengWang

Revitalizing CNN Attention via Transformers in Self-Supervised Visual Representation Learning

Anomaly Detection Based on Hierarchical Clustering of Mobile Robot Data

Multi Camera Calibration

A tool to visualise the results of AlphaFold2 and inspect the quality of structural predictions

The story of Chicken for Club Bing

NeoDTI: Neural integration of neighbor information from a heterogeneous network for discovering new drug-target interactions

An index of algorithms for learning causality with data

Implemenets the Contourlet-CNN as described in C-CNN: Contourlet Convolutional Neural Networks, using PyTorch

Face and Body Tracking for VRM 3D models on the web.

MQBench: Towards Reproducible and Deployable Model Quantization Benchmark

Pytorch implementation of Deep Recursive Residual Network for Super Resolution (DRRN)

🥇 LG-AI-Challenge 2022 1위 솔루션 입니다.

PyTorch code for EMNLP 2021 paper: Don't be Contradicted with Anything! CI-ToD: Towards Benchmarking Consistency for Task-oriented Dialogue System

Pytorch reimplementation of PSM-Net: "Pyramid Stereo Matching Network"

Implementation of "Unsupervised Domain Adaptive 3D Detection with Multi-Level Consistency"

[TIP 2020] Multi-Temporal Scene Classification and Scene Change Detection with Correlation based Fusion

Official Pytorch Implementation of 'Learning Action Completeness from Points for Weakly-supervised Temporal Action Localization' (ICCV-21 Oral)

Python implementation of Lightning-rod Agent, the Stack4Things board-side probe

Flexible Option Learning - NeurIPS 2021

The LaTeX and Python code for generating the paper, experiments' results and visualizations reported in each paper is available (whenever possible) in the paper's directory