PyTorch implementation of SimSiam: Exploring Simple Siamese Representation Learning

Last update: Dec 30, 2022

Related tags

Overview

SimSiam: Exploring Simple Siamese Representation Learning

This is a PyTorch implementation of the SimSiam paper:

@Article{chen2020simsiam,
  author  = {Xinlei Chen and Kaiming He},
  title   = {Exploring Simple Siamese Representation Learning},
  journal = {arXiv preprint arXiv:2011.10566},
  year    = {2020},
}

Preparation

Install PyTorch and download the ImageNet dataset following the official PyTorch ImageNet training code. Similar to MoCo, the code release contains minimal modifications for both unsupervised pre-training and linear classification to that code.

In addition, install apex for the LARS implementation needed for linear classification.

Unsupervised Pre-Training

Only multi-gpu, DistributedDataParallel training is supported; single-gpu or DataParallel training is not supported.

To do unsupervised pre-training of a ResNet-50 model on ImageNet in an 8-gpu machine, run:

python main_simsiam.py \
  -a resnet50 \
  --dist-url 'tcp://localhost:10001' --multiprocessing-distributed --world-size 1 --rank 0 \
  --fix-pred-lr \
  [your imagenet-folder with train and val folders]

The script uses all the default hyper-parameters as described in the paper, and uses the default augmentation recipe from MoCo v2.

The above command performs pre-training with a non-decaying predictor learning rate for 100 epochs, corresponding to the last row of Table 1 in the paper.

Linear Classification

With a pre-trained model, to train a supervised linear classifier on frozen features/weights in an 8-gpu machine, run:

python main_lincls.py \
  -a resnet50 \
  --dist-url 'tcp://localhost:10001' --multiprocessing-distributed --world-size 1 --rank 0 \
  --pretrained [your checkpoint path]/checkpoint_0099.pth.tar \
  --lars \
  [your imagenet-folder with train and val folders]

The above command uses LARS optimizer and a default batch size of 4096.

Models and Logs

Our pre-trained ResNet-50 models and logs:

pre-train epochs	batch size	pre-train ckpt	pre-train log	linear cls. ckpt	linear cls. log	top-1 acc.
100	512	link	link	link	link	68.1
100	256	link	link	link	link	68.3

Settings for the above: 8 NVIDIA V100 GPUs, CUDA 10.1/CuDNN 7.6.5, PyTorch 1.7.0.

Transferring to Object Detection

Same as MoCo for object detection transfer, please see moco/detection.

License

This project is under the CC-BY-NC 4.0 license. See LICENSE for details.

PyTorch implementation of SimSiam: Exploring Simple Siamese Representation Learning

Related tags

Overview

SimSiam: Exploring Simple Siamese Representation Learning

Preparation

Unsupervised Pre-Training

Linear Classification

Models and Logs

Transferring to Object Detection

License

Owner

Facebook Research

Deep Compression for Dense Point Cloud Maps.

Code and real data for the paper "Counterfactual Temporal Point Processes", available at arXiv.

PyTorch EO aims to make Deep Learning for Earth Observation data easy and accessible to real-world cases and research alike.

[ICME 2021 Oral] CORE-Text: Improving Scene Text Detection with Contrastive Relational Reasoning

Deep learning model, heat map, data prepo

CV backbones including GhostNet, TinyNet and TNT, developed by Huawei Noah's Ark Lab.

A-ESRGAN aims to provide better super-resolution images by using multi-scale attention U-net discriminators.

SSD-based Object Detection in PyTorch

Cross-view Transformers for real-time Map-view Semantic Segmentation (CVPR 2022 Oral)

FairMOT for Multi-Class MOT using YOLOX as Detector

Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".

Tensorflow 2.x implementation of Panoramic BlitzNet for object detection and semantic segmentation on indoor panoramic images.

SimplEx - Explaining Latent Representations with a Corpus of Examples

Top #1 Submission code for the first https://alphamev.ai MEV competition with best AUC (0.9893) and MSE (0.0982).

Create UIs for prototyping your machine learning model in 3 minutes

This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".

Official PyTorch implementation of the preprint paper "Stylized Neural Painting", accepted to CVPR 2021.

git《Self-Attention Attribution: Interpreting Information Interactions Inside Transformer》(AAAI 2021) GitHub:

An example of semantic segmentation using tensorflow in eager execution.

A lightweight tool to get an AI Infrastructure Stack up in minutes not days.