Code to reprudece NeurIPS paper: Accelerated Sparse Neural Training: A Provable and Efficient Method to Find N:M Transposable Masks

Last update: Feb 23, 2022

Overview

Accelerated Sparse Neural Training: A Provable and Efficient Method to FindN:M Transposable Masks

Recently, researchers proposed pruning deep neural network weights (DNNs) using an $N:M$ fine-grained block sparsity mask. In this mask, for each block of M weights, we have at least N zeros. In contrast to unstructured sparsity, N:M fine-grained block sparsity allows acceleration in actual modern hardware. Previously suggested solutions enabled DNN acceleration at the inference phase. To also allow such acceleration in the training phase, we suggest a novel transposable-fine-grained sparsity mask where the same mask can be used for both forward and backward passes. Our transposable mask ensures that both the weight matrix and its transpose follow the same sparsity pattern; thus the matrix multiplication required for passing the error backward can also be accelerated. We discuss the transposable constraint and devise a new measure for mask constraints, called mask-diversity (MD), which correlates with their expected accuracy. Lastly, we formulate the problem of finding the optimal transposable mask as a minimum-cost-flow problem and suggest a fast linear approximation that can be used when the masks dynamically change while training. Our experiments suggest 2x speed-up with no accuracy degradation over vision and language models. A reference implementation is available in the supplementary material.

Reproducing the results

This repository is partially based on convNet.pytorch repo. please ensure that you are using pytorch 1.7+. Reproducing AdaPrune results

cd AdaPrune
sh scripts/adaprune_dense_bnt.sh
sh scripts/adaprune_sparse.sh

Reproducing static NM-transposable starting from dense pre-trained model:

cd static_TNM
sh scripts/prune_pretrained_R50.sh

Reproducing dynamic NM-transposable from scratch:

cd dynamic_TNM
sh scripts/clone_and_copy.sh
sh scripts/run_R18.sh
sh scripts/run_R50.sh

Code to reprudece NeurIPS paper: Accelerated Sparse Neural Training: A Provable and Efficient Method to Find N:M Transposable Masks

Related tags

Overview

Accelerated Sparse Neural Training: A Provable and Efficient Method to FindN:M Transposable Masks

Reproducing the results

Owner

itay hubara

BERT Attention Analysis

Azure Text-to-speech service for Home Assistant

Textlesslib - Library for Textless Spoken Language Processing

A very simple framework for state-of-the-art Natural Language Processing (NLP)

硕士期间自学的NLP子任务，供学习参考

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

Transformer training code for sequential tasks

ProtFeat is protein feature extraction tool that utilizes POSSUM and iFeature.

Code for the ACL 2021 paper "Structural Guidance for Transformer Language Models"

This is the writeup of all the challenges from Advent-of-cyber-2019 of TryHackMe

Code for text augmentation method leveraging large-scale language models

Espial is an engine for automated organization and discovery of personal knowledge

A number of methods in order to perform Natural Language Processing on live data derived from Twitter

Code for Emergent Translation in Multi-Agent Communication

Simple Python script to scrape youtube channles of "Parity Technologies and Web3 Foundation" and translate them to well-known braille language or any language

2021 2학기 데이터크롤링 기말프로젝트

A 10000+ hours dataset for Chinese speech recognition

Predict an emoji that is associated with a text

Experiments in converting wikidata to ftm

Web Scraping, Document Deduplication & GPT-2 Fine-tuning with a newly created scam dataset.