"Moshpit SGD: Communication-Efficient Decentralized Training on Heterogeneous Unreliable Devices", official implementation

Last update: Oct 18, 2022

Related tags

Deep Learning moshpit-sgd

Overview

Moshpit SGD: Communication-Efficient Decentralized Training on Heterogeneous Unreliable Devices

This repository contains the official PyTorch implementation of experiments for the paper.

Note (05.03.2021): as of now, this repository contains only the minimal (largely untested) version of the implementation. We intend to make the training code more robust and to add tested code for more experiments (including image classification) in the coming months. In the meantime, feel free to create an issue or contact us by email if you are having any troubles.

Setup

To launch the code in this repository, you will need Python 3.8+ and PyTorch 1.7. Also, install the dependencies by running pip install -r requirements.txt.

Experiments

The links below The first experiment is a self-contained Jupyter notebook; for the other two experiments, refer to README.md in their respective directories:

References

@misc{ryabinin2021moshpit,
      title={Moshpit SGD: Communication-Efficient Decentralized Training on Heterogeneous Unreliable Devices}, 
      author={Max Ryabinin and Eduard Gorbunov and Vsevolod Plokhotnyuk and Gennady Pekhimenko},
      year={2021},
      eprint={2103.03239},
      archivePrefix={arXiv},
      primaryClass={cs.LG}
}

"Moshpit SGD: Communication-Efficient Decentralized Training on Heterogeneous Unreliable Devices", official implementation

Related tags

Overview

Moshpit SGD: Communication-Efficient Decentralized Training on Heterogeneous Unreliable Devices

Setup

Experiments

References

Owner

Yandex Research

Taking A Closer Look at Domain Shift: Category-level Adversaries for Semantics Consistent Domain Adaptation

A torch.Tensor-like DataFrame library supporting multiple execution runtimes and Arrow as a common memory format

Bulk2Space is a spatial deconvolution method based on deep learning frameworks

Hand Gesture Volume Control | Open CV | Computer Vision

Rate-limit-semaphore - Semaphore implementation with rate limit restriction for async-style (any core)

SwinTrack: A Simple and Strong Baseline for Transformer Tracking

This repo is developed for Strong Baseline For Vehicle Re-Identification in Track 2 Ai-City-2021 Challenges

Toolkit for collecting and applying prompts

[BMVC2021] The official implementation of "DomainMix: Learning Generalizable Person Re-Identification Without Human Annotations"

Pytorch implementation of PTNet for high-resolution and longitudinal infant MRI synthesis

[TIP 2020] Multi-Temporal Scene Classification and Scene Change Detection with Correlation based Fusion

Learn about quantum computing and algorithm on quantum computing

MWPToolkit is a PyTorch-based toolkit for Math Word Problem (MWP) solving.

[ICLR 2022] Pretraining Text Encoders with Adversarial Mixture of Training Signal Generators

Snapchat-filters-app-opencv-python - Here we used opencv and other inbuilt python modules to create filter application like snapchat

Lenia - Mathematical Life Forms

IEEE-CIS Technical Challenge on Predict+Optimize for Renewable Energy Scheduling

Official PyTorch Implementation of Hypercorrelation Squeeze for Few-Shot Segmentation, arXiv 2021

pytorch implementation of "Contrastive Multiview Coding", "Momentum Contrast for Unsupervised Visual Representation Learning", and "Unsupervised Feature Learning via Non-Parametric Instance-level Discrimination"

Permute Me Softly: Learning Soft Permutations for Graph Representations