[CVPRW 21] "BNN - BN = ? Training Binary Neural Networks without Batch Normalization", Tianlong Chen, Zhenyu Zhang, Xu Ouyang, Zechun Liu, Zhiqiang Shen, Zhangyang Wang

Last update: Dec 30, 2022

Overview

BNN - BN = ? Training Binary Neural Networks without Batch Normalization

Codes for this paper BNN - BN = ? Training Binary Neural Networks without Batch Normalization. [CVPR BiVision Workshop 2021]

Tianlong Chen, Zhenyu Zhang, Xu Ouyang, Zechun Liu, Zhiqiang Shen, Zhangyang Wang.

Overview

Batch normalization (BN) is a key facilitator and considered essential for state-of-the-art binary neural networks (BNN). However, the BN layer is costly to calculate and is typically implemented with non-binary parameters, leaving a hurdle for the efficient implementation of BNN training. It also introduces undesirable dependence between samples within each batch.

Inspired by the latest advance on Batch Normalization Free (BN-Free) training, we extend their framework to training BNNs, and for the first time demonstrate that BNs can be completely removed from BNN training and inference regimes. By plugging in and customizing techniques including adaptive gradient clipping, scale weight standardization, and specialized bottleneck block, a BN-free BNN is capable of maintaining competitive accuracy compared to its BN-based counterpart. Experimental results can be found in our paper.

BN-Free Binary Neural Networks

Reproduce

Environment

pytorch == 1.5.0
torchvision == 0.6.0
timm == 0.4.5

Training on ImageNet

./script/imagenet_reactnet_A_bf.sh (BN-Free ReActNet-A)
./script/imagenet_reactnet_A_bn.sh (with BN ReActNet-A)
./script/imagenet_reactnet_A_none.sh (without BN ReActNet-A)

Citation

@article{gaur2020training,
  title={Training Deep Neural Networks Without Batch Normalization},
  author={Gaur, Divya and Folz, Joachim and Dengel, Andreas},
  journal={arXiv preprint arXiv:2008.07970},
  year={2020}
}

Acknowledgement

https://github.com/liuzechun/ReActNet

https://github.com/liuzechun/Bi-Real-net

https://github.com/vballoli/nfnets-pytorch

https://github.com/deepmind/deepmind-research/tree/master/nfnets

[CVPRW 21] "BNN - BN = ? Training Binary Neural Networks without Batch Normalization", Tianlong Chen, Zhenyu Zhang, Xu Ouyang, Zechun Liu, Zhiqiang Shen, Zhangyang Wang

Related tags

Overview

BNN - BN = ? Training Binary Neural Networks without Batch Normalization

Overview

BN-Free Binary Neural Networks

Reproduce

Environment

Training on ImageNet

Citation

Acknowledgement

Owner

VITA

PyExplainer: A Local Rule-Based Model-Agnostic Technique (Explainable AI)

Pytorch Implementation of Interaction Networks for Learning about Objects, Relations and Physics

A fast python implementation of Ray Tracing in One Weekend using python and Taichi

Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.

Codes for NeurIPS 2021 paper "Adversarial Neuron Pruning Purifies Backdoored Deep Models"

Image reconstruction done with untrained neural networks.

Source Code and data for my paper titled Linguistic Knowledge in Data Augmentation for Natural Language Processing: An Example on Chinese Question Matching

This is the official pytorch implementation for our ICCV 2021 paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering" on VQA Task

Code for the paper "How Attentive are Graph Attention Networks?"

Python code for the paper How to scale hyperparameters for quickshift image segmentation

Convert dog pictures into various painting styles. Try LimnPet

Implementations of LSTM: A Search Space Odyssey variants and their training results on the PTB dataset.

This repository contains the code for: RerrFact model for SciVer shared task

Implicit MLE: Backpropagating Through Discrete Exponential Family Distributions

Video Frame Interpolation with Transformer (CVPR2022)

We will see a basic program that is basically a hint to brute force attack to crack passwords. In other words, we will make a program to Crack Any Password Using Python. Show some ❤️ by starring this repository!

Construct a neural network frame by Numpy

A robust camera and Lidar fusion based velocity estimator to undistort the pointcloud.

Spatial-Temporal Transformer for Dynamic Scene Graph Generation, ICCV2021

A Python library for adversarial machine learning focusing on benchmarking adversarial robustness.