PyTorch implementation of DirectCLR from paper Understanding Dimensional Collapse in Contrastive Self-supervised Learning

Last update: Dec 21, 2022

Related tags

Deep Learning directclr

Overview

DirectCLR

DirectCLR is a simple contrastive learning model for visual representation learning. It does not require a trainable projector as SimCLR. It is able to prevent dimensional collapse and outperform SimCLR with a linear projector.

PyTorch implementation of DirectCLR from paper Understanding Dimensional Collapse in Contrastive Self-supervised Learning.

@article{Jing2021UnderstandingDC,
  title={Understanding Dimensional Collapse in Contrastive Self-supervised Learning},
  author={Li Jing and Pascal Vincent and Yann LeCun and Yuandong Tian},
  journal={arXiv preprint arXiv:2110.09348},
  year={2021}
}

DirectCLR Training

Install PyTorch and download ImageNet by following the instructions in the requirements section of the PyTorch ImageNet training example. The code has been developed for PyTorch version 1.7.1 and torchvision version 0.8.2, but it should work with other versions just as well.

Our best model is obtained by running the following command:

python main.py --data /path/to/imagenet/ --mode directclr --dim 360

Mode can be chosen as:

simclr: standard SimCLR with two layer nonlinear projector;

single: SimCLR with single layer linear projector;

baseline: SimCLR without a projector;

directclr: DirectCLR with single layer linear projector;

Training time is approximately 7 hours on 32 v100 GPUs.

Evaluation: Linear Classification

Train a linear probe on the representations. Freeze the weights of the resnet and use the entire ImageNet training set.

python linear_probe.py /path/to/imagenet/ /path/to/checkpoint/resnet50.pth

Linear probe time is approximately 20 hours on 8 v100 GPUs.

License

This project is under the CC-BY-NC 4.0 license. See LICENSE for details.

PyTorch implementation of DirectCLR from paper Understanding Dimensional Collapse in Contrastive Self-supervised Learning

Related tags

Overview

DirectCLR

DirectCLR Training

Evaluation: Linear Classification

License

Owner

Meta Research

Neural network pruning for finding a sparse computational model for controlling a biological motor task.

Hierarchical Motion Encoder-Decoder Network for Trajectory Forecasting (HMNet)

Joint project of the duo Hacker Ninjas

Code release for BlockGAN: Learning 3D Object-aware Scene Representations from Unlabelled Images

A new play-and-plug method of controlling an existing generative model with conditioning attributes and their compositions.

High performance Cross-platform Inference-engine, you could run Anakin on x86-cpu,arm, nv-gpu, amd-gpu,bitmain and cambricon devices.

The Wearables Development Toolkit - a development environment for activity recognition applications with sensor signals

Measure WWjj polarization fraction

Fuzzer for Linux Kernel Drivers

Tackling Obstacle Tower Challenge using PPO & A2C combined with ICM.

ROCKET: Exceptionally fast and accurate time series classification using random convolutional kernels

Exe-to-xlsm - Simple script to create VBscript of exe and inject to xlsm

STMTrack: Template-free Visual Tracking with Space-time Memory Networks

[CVPR 2021] Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers

Linear Variational State Space Filters

VID-Fusion: Robust Visual-Inertial-Dynamics Odometry for Accurate External Force Estimation

[CVPR 2022] Structured Sparse R-CNN for Direct Scene Graph Generation

Continuous Diffusion Graph Neural Network

Tello Drone Trajectory Tracking

Official PyTorch implementation of UACANet: Uncertainty Aware Context Attention for Polyp Segmentation