[ICLR 2021] "Neural Architecture Search on ImageNet in Four GPU Hours: A Theoretically Inspired Perspective" by Wuyang Chen, Xinyu Gong, Zhangyang Wang

Last update: Nov 28, 2022

Overview

Neural Architecture Search on ImageNet in Four GPU Hours: A Theoretically Inspired Perspective [PDF]

Wuyang Chen, Xinyu Gong, Zhangyang Wang

In ICLR 2021.

Overview

We present TE-NAS, the first published training-free neural architecture search method with extremely fast search speed (no gradient descent at all!) and high-quality performance.

Highlights:

Trainig-free and label-free NAS: we achieved extreme fast neural architecture search without a single gradient descent.
Bridging the theory-application gap: We identified two training-free indicators to rank the quality of deep networks: the condition number of their NTKs, and the number of linear regions in their input space.
SOTA: TE-NAS achieved extremely fast search speed (one 1080Ti, 20 minutes on NAS-Bench-201 space / four hours on DARTS space on ImageNet) and maintains competitive accuracy.

Prerequisites

Ubuntu 16.04
Python 3.6.9
CUDA 10.1 (lower versions may work but were not tested)
NVIDIA GPU + CuDNN v7.3

This repository has been tested on GTX 1080Ti. Configurations may need to be changed on different platforms.

Installation

Clone this repo:

git clone https://github.com/chenwydj/TENAS.git
cd TENAS

Install dependencies:

pip install -r requirements.txt

Usage

0. Prepare the dataset

Please follow the guideline here to prepare the CIFAR-10/100 and ImageNet dataset, and also the NAS-Bench-201 database.
Remember to properly set the TORCH_HOME and data_paths in the prune_launch.py.

1. Search

NAS-Bench-201 Space

python prune_launch.py --space nas-bench-201 --dataset cifar10 --gpu 0
python prune_launch.py --space nas-bench-201 --dataset cifar100 --gpu 0
python prune_launch.py --space nas-bench-201 --dataset ImageNet16-120 --gpu 0

DARTS Space (NASNET)

python prune_launch.py --space darts --dataset cifar10 --gpu 0
python prune_launch.py --space darts --dataset imagenet-1k --gpu 0

2. Evaluation

For architectures searched on nas-bench-201, the accuracies are immediately available at the end of search (from the console output).
For architectures searched on darts, please use DARTS_evaluation for training the searched architecture from scratch and evaluation.

Citation

@inproceedings{chen2020tenas,
  title={Neural Architecture Search on ImageNet in Four GPU Hours: A Theoretically Inspired Perspective},
  author={Chen, Wuyang and Gong, Xinyu and Wang, Zhangyang},
  booktitle={International Conference on Learning Representations},
  year={2021}
}

Acknowledgement

Code base from NAS-Bench-201.

[ICLR 2021] "Neural Architecture Search on ImageNet in Four GPU Hours: A Theoretically Inspired Perspective" by Wuyang Chen, Xinyu Gong, Zhangyang Wang

Related tags

Overview

Neural Architecture Search on ImageNet in Four GPU Hours: A Theoretically Inspired Perspective [PDF]

Overview

Prerequisites

Installation

Usage

0. Prepare the dataset

1. Search

NAS-Bench-201 Space

DARTS Space (NASNET)

2. Evaluation

Citation

Acknowledgement

Owner

VITA

[cvpr22] Perturbed and Strict Mean Teachers for Semi-supervised Semantic Segmentation

Python-kafka-reset-consumergroup-offset-example - Python Kafka reset consumergroup offset example

Cross-modal Deep Face Normals with Deactivable Skip Connections

Code for the paper "Can Active Learning Preemptively Mitigate Fairness Issues?" presented at RAI 2021.

Implementation for Learning to Track with Object Permanence

CN24 is a complete semantic segmentation framework using fully convolutional networks

COVINS -- A Framework for Collaborative Visual-Inertial SLAM and Multi-Agent 3D Mapping

A TensorFlow implementation of DeepMind's WaveNet paper

Official code repository for A Simple Long-Tailed Rocognition Baseline via Vision-Language Model.

내가 보려고 정리한 <프로그래밍 기초 Ⅰ> / organized for me

学习 python3 以来写的一些垃圾玩具……

Interpretation of T cell states using reference single-cell atlases

Ascend your Jupyter Notebook usage

TensorFlow implementation for Bayesian Modeling and Uncertainty Quantification for Learning to Optimize: What, Why, and How

This repo is about to create the Streamlit application for given ML model.

Revisiting Contrastive Methods for Unsupervised Learning of Visual Representations. [2021]

Implementation of the paper "Generating Symbolic Reasoning Problems with Transformer GANs"

PyTorch code for EMNLP 2021 paper: Don't be Contradicted with Anything! CI-ToD: Towards Benchmarking Consistency for Task-oriented Dialogue System

Torch-based tool for quantizing high-dimensional vectors using additive codebooks

Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.