Learning recognition/segmentation models without end-to-end training. 40%-60% less GPU memory footprint. Same training time. Better performance.

Last update: Dec 27, 2022

Related tags

Deep Learning InfoPro-Pytorch

Overview

InfoPro-Pytorch

The Information Propagation algorithm for training deep networks with local supervision.

(ICLR 2021) Revisiting Locally Supervised Learning: an Alternative to End-to-end Training

Update on 2021/01/25: Release Pre-trained models on ImageNet and Cityscapes.

Update on 2021/01/24: Release Code for Image Classification on CIFAR/SVHN/STL10/ImageNet and Semantic Segmentation on Cityscapes.

Introduction

We propose Information Propagation (InfoPro), a locally supervised deep learning algorithm, from the information-theoretic perspective. By splitting the whole deep network into multiple local modules and training them with local InfoPro loss, we reduce the GPU memory footprint by 40-60% without introducing notable extra computational cost or training time, but improve the performance moderately.

Citation

If you find this work valuable or use our code in your own research, please consider citing us with the following bibtex:

@inproceedings{wang2021revisiting,
        title = {Revisiting Locally Supervised Learning: an Alternative to End-to-end Training},
       author = {Yulin Wang and Zanlin Ni and Shiji Song and Le Yang and Gao Huang},
    booktitle = {International Conference on Learning Representations (ICLR)},
         year = {2021},
          url = {https://openreview.net/forum?id=fAbkE6ant2}
}

Get Started

Please go to the folder Experiments on CIFAR-SVHN-STL10, Experiments on ImageNet and Semantic segmentation for specific docs.

Results

CIFAR & STL-10

ImageNet

Semantic Segmentation

GPU Memory Cost

In the paper, we report the minimally required GPU memory to run the InfoPro* algorithm with torch.backends.cudnn.benchmark=True (for practical acceleration). Note that this result is (sometimes largely) different from what is printed by nvidia-smi.

Contact

This repo is a re-implementation of our original code. If you have any question, please feel free to contact the authors. Yulin Wang: [email protected].

Acknowledgments

Our code of Semantic Segmentation is from MMSegmentation. We highly appreciate their awesome work!

Learning recognition/segmentation models without end-to-end training. 40%-60% less GPU memory footprint. Same training time. Better performance.

Related tags

Overview

InfoPro-Pytorch

Introduction

Citation

Get Started

Results

GPU Memory Cost

Contact

Acknowledgments

Owner

Cleaned up code for DSTC 10: SIMMC 2.0 track: subtask 2: multimodal coreference resolution

Gesture-Volume-Control - This Python program can adjust the system's volume by using hand gestures

Model Agnostic Interpretability for Multiple Instance Learning

The official PyTorch implementation for NCSNv2 (NeurIPS 2020)

Deep Semisupervised Multiview Learning With Increasing Views (IEEE TCYB 2021, PyTorch Code)

Aligning Latent and Image Spaces to Connect the Unconnectable

Learning Correspondence from the Cycle-consistency of Time (CVPR 2019)

A novel pipeline framework for multi-hop complex KGQA task. About the paper title: Improving Multi-hop Embedded Knowledge Graph Question Answering by Introducing Relational Chain Reasoning

Mmdet benchmark with python

Using the provided dataset which includes various book features, in order to predict the price of books, using various proposed methods and models.

Repo 4 basic seminar §How to make human machine readable"

Nonuniform-to-Uniform Quantization: Towards Accurate Quantization via Generalized Straight-Through Estimation. In CVPR 2022.

Easily pull telemetry data and create beautiful visualizations for analysis.

A Flow-based Generative Network for Speech Synthesis

Commonsense Ability Tests

Official implementation of "StyleCariGAN: Caricature Generation via StyleGAN Feature Map Modulation" (SIGGRAPH 2021)

Advancing Self-supervised Monocular Depth Learning with Sparse LiDAR

Scales, Chords, and Cadences: Practical Music Theory for MIR Researchers

A Deep Learning Based Knowledge Extraction Toolkit for Knowledge Base Population

Computing Shapley values using VAEAC