ResNEsts and DenseNEsts: Block-based DNN Models with Improved Representation Guarantees

Last update: Oct 02, 2022

Overview

ResNEsts and DenseNEsts: Block-based DNN Models with Improved Representation Guarantees

This repository is the official implementation of the empirical research presented in the supplementary material of the paper, ResNEsts and DenseNEsts: Block-based DNN Models with Improved Representation Guarantees.

Requirements

To install requirements:

pip install -r requirements.txt

Please install Python before running the above setup command. The code was tested on Python 3.8.10.

Create a folder to store all the models and results:

mkdir ckeckpoint

Training

To fully replicate the results below, train all the models by running the following two commands:

./train_cuda0.sh

./train_cuda1.sh

We used two separate scripts because we had two NVIDIA GPUs and we wanted to run two training processes for different models at the same time. If you have more GPUs or resources, you can submit multiple jobs and let them run in parallel.

To train a model with different seeds (initializations), run the command in the following form:

python main.py --data <dataset> --model <DNN_model> --mu <learning_rate>

The above command uses the default seed list. You can also specify your seeds like the following example:

python main.py --data CIFAR10 --model CIFAR10_BNResNEst_ResNet_110 --seed_list 8 9

Run this command to see how to customize your training or hyperparameters:

python main.py --help

Evaluation

To evaluate all trained models on benchmarks reported in the tables below, run:

./eval.sh

To evaluate a model, run:

python eval.py --data  <dataset> --model <DNN_model> --seed_list <seed>

Results

Image Classification on CIFAR-10

Architecture	Standard	ResNEst	BN-ResNEst	A-ResNEst
WRN-16-8	95.58% (11M)	94.47% (11M)	95.49% (11M)	95.29% (8.7M)
WRN-40-4	95.49% (9.0M)	94.64% (9.0M)	95.62% (9.0M)	95.48% (8.4M)
ResNet-110	94.33% (1.7M)	92.62% (1.7M)	94.47% (1.7M)	93.93% (1.7M)
ResNet-20	92.58% (0.27M)	90.98% (0.27M)	92.56% (0.27M)	92.47% (0.24M)

Image Classification on CIFAR-100

Architecture	Standard	ResNEst	BN-ResNEst	A-ResNEst
WRN-16-8	79.14% (11M)	75.42% (11M)	78.98% (11M)	78.74% (8.9M)
WRN-40-4	79.08% (9.0M)	75.16% (9.0M)	78.81% (9.0M)	78.69% (8.7M)
ResNet-110	74.08% (1.7M)	69.08% (1.7M)	74.24% (1.7M)	72.53% (1.9M)
ResNet-20	68.56% (0.28M)	64.73% (0.28M)	68.49% (0.28M)	68.16% (0.27M)

BibTeX

@inproceedings{chen2021resnests,
  title={{ResNEsts} and {DenseNEsts}: Block-based {DNN} Models with Improved Representation Guarantees},
  author={Chen, Kuan-Lin and Lee, Ching-Hua and Garudadri, Harinath and Rao, Bhaskar D.},
  booktitle={Advances in Neural Information Processing Systems (NeurIPS)},
  year={2021}
}

ResNEsts and DenseNEsts: Block-based DNN Models with Improved Representation Guarantees

Related tags

Overview

ResNEsts and DenseNEsts: Block-based DNN Models with Improved Representation Guarantees

Requirements

Training

Evaluation

Results

Image Classification on CIFAR-10

Image Classification on CIFAR-100

BibTeX

Owner

Kuan-Lin (Jason) Chen

PyTorch implementation of probabilistic deep forecast applied to air quality.

Malware Bypass Research using Reinforcement Learning

PSPNet in Chainer

Code artifacts for the submission "Mind the Gap! A Study on the Transferability of Virtual vs Physical-world Testing of Autonomous Driving Systems"

Code and real data for the paper "Counterfactual Temporal Point Processes", available at arXiv.

Dataset and codebase for NeurIPS 2021 paper: Exploring Forensic Dental Identification with Deep Learning

The fundamental package for scientific computing with Python.

ViewFormer: NeRF-free Neural Rendering from Few Images Using Transformers

A novel framework to automatically learn high-quality scanning of non-planar, complex anisotropic appearance.

Semi-supervised Video Deraining with Dynamical Rain Generator (CVPR, 2021, Pytorch)

Codes for [NeurIPS'21] You are caught stealing my winning lottery ticket! Making a lottery ticket claim its ownership.

SLAMP: Stochastic Latent Appearance and Motion Prediction

An interactive DNN Model deployed on web that predicts the chance of heart failure for a patient with an accuracy of 98%

KIDA: Knowledge Inheritance in Data Aggregation

Gym environment for FLIPIT: The Game of "Stealthy Takeover"

MediaPipe is a an open-source framework from Google for building multimodal

Nonnegative spatial factorization for multivariate count data

Split Variational AutoEncoder

Out-of-Domain Human Mesh Reconstruction via Dynamic Bilevel Online Adaptation

The final project of "Applying AI to 3D Medical Imaging Data" from "AI for Healthcare" nanodegree - Udacity.