code for "AttentiveNAS Improving Neural Architecture Search via Attentive Sampling"

Last update: Oct 26, 2022

Related tags

Overview

AttentiveNAS: Improving Neural Architecture Search via Attentive Sampling

This repository contains PyTorch evaluation code, training code and pretrained models for AttentiveNAS.

For details see AttentiveNAS: Improving Neural Architecture Search via Attentive Sampling by Dilin Wang, Meng Li, Chengyue Gong and Vikas Chandra.

If you find this project useful in your research, please consider cite:

@article{wang2020attentivenas,
  title={AttentiveNAS: Improving Neural Architecture Search via Attentive Sampling},
  author={Wang, Dilin and Li, Meng and Gong, Chengyue and Chandra, Vikas},
  journal={arXiv preprint arXiv:2011.09011},
  year={2020}
}

Pretrained models and data

Download our pretrained AttentiveNAS models and a (sub-network, FLOPs) lookup table from Google Drive and put them under folder ./attentive_nas_data

Evaluation

To evaluate our pre-trained AttentiveNAS models, from AttentiveNAS-A0 to A6, on ImageNet val with a single GPU, run:

python test_attentive_nas.py --config-file ./configs/eval_attentive_nas_models.yml --model a[0-6]

Expected results:

Name	MFLOPs	Top-1 (%)
AttentiveNAS-A0	203	77.3
AttentiveNAS-A1	279	78.4
AttentiveNAS-A2	317	78.8
AttentiveNAS-A3	357	79.1
AttentiveNAS-A4	444	79.8
AttentiveNAS-A5	491	80.1
AttentiveNAS-A6	709	80.7

Training

To train our AttentiveNAS models from scratch, run

python train_supernet.py --config-file configs/train_attentive_nas_models.yml --machine-rank ${machine_rank} --num-machines ${num_machines} --dist-url ${dist_url}

We adopt SGD training on 64 GPUs. The mini-batch size is 32 per GPU; all training hyper-parameters are specified in train_attentive_nas_models.yml.

License

The majority of AttentiveNAS is licensed under CC-BY-NC, however portions of the project are available under separate license terms: Once For All is licensed under the Apache 2.0 license.

Contributing

We actively welcome your pull requests! Please see CONTRIBUTING and CODE_OF_CONDUCT for more info.

code for "AttentiveNAS Improving Neural Architecture Search via Attentive Sampling"

Related tags

Overview

AttentiveNAS: Improving Neural Architecture Search via Attentive Sampling

Pretrained models and data

Evaluation

Training

License

Contributing

Owner

Facebook Research

KLUE-baseline contains the baseline code for the Korean Language Understanding Evaluation (KLUE) benchmark.

Use fastai-v2 with HuggingFace's pretrained transformers

GAP-text2SQL: Learning Contextual Representations for Semantic Parsing with Generation-Augmented Pre-Training

PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

NLP-based analysis of poor Chinese movie reviews on Douban

Production First and Production Ready End-to-End Keyword Spotting Toolkit

Knowledge Management for Humans using Machine Learning & Tags

This repository contains the codes for LipGAN. LipGAN was published as a part of the paper titled "Towards Automatic Face-to-Face Translation".

Artificial Conversational Entity for queries in Eulogio "Amang" Rodriguez Institute of Science and Technology (EARIST)

Implementation of Fast Transformer in Pytorch

Toward Model Interpretability in Medical NLP

Phomber is infomation grathering tool that reverse search phone numbers and get their details, written in python3.

A design of MIDI language for music generation task, specifically for Natural Language Processing (NLP) models.

SimBERT升级版（SimBERTv2）！

Idea is to build a model which will take keywords as inputs and generate sentences as outputs.

NeoDays-based tileset for the roguelike CDDA (Cataclysm Dark Days Ahead)

Prompt tuning toolkit for GPT-2 and GPT-Neo

Faster, modernized fork of the language identification tool langid.py

Share constant definitions between programming languages and make your constants constant again

[NeurIPS 2021] Code for Learning Signal-Agnostic Manifolds of Neural Fields