MicroNet: Improving Image Recognition with Extremely Low FLOPs (ICCV 2021)

A pytorch implementation of MicroNet. If you use this code in your research please consider citing

@article{li2021micronet, title={MicroNet: Improving Image Recognition with Extremely Low FLOPs}, author={Li, Yunsheng and Chen, Yinpeng and Dai, Xiyang and Chen, Dongdong and Liu, Mengchen and Yuan, Lu and Liu, Zicheng and Zhang, Lei and Vasconcelos, Nuno}, journal={arXiv preprint arXiv:2108.05894}, year={2021} }

Requirements

Linux or macOS with Python ≥ 3.6.
Anaconda3, PyTorch ≥ 1.5 with matched torchvision

Models

Model	#Param	MAdds	Top-1	download
MicroNet-M3	2.6M	21M	62.5	model
MicroNet-M2	2.4M	12M	59.4	model
MicroNet-M1	1.8M	6M	51.4	model
MicroNet-M0	1.0M	4M	46.6	model

Evaluate MicroNet on ImageNet

Download the pretrained MicroNet M0-M3 with the link above. The scripts used for evaluation can be found here. For example, if you want to test MicroNet-M3, you can use the following command.

sh scripts/eval_micronet_m3.sh /path/to/imagenet /path/to/output /path/to/pretrained_model

Train MicroNet on ImageNet

The scripts used for training MicroNet M0-M3 can be found here and can be implemented as follows (You can choose to use different scripts for 2 gpu or 4 gpu training based on the resources you can access).

sh scripts/train_micronet_m3_4gpu.sh /path/to/imagenet /path/to/output

MicroNet: Improving Image Recognition with Extremely Low FLOPs (ICCV 2021)

Related tags

Overview

MicroNet: Improving Image Recognition with Extremely Low FLOPs (ICCV 2021)

Requirements

Models

Evaluate MicroNet on ImageNet

Train MicroNet on ImageNet

Owner

Yunsheng Li

DataCLUE: 国内首个以数据为中心的AI测评（含模型分析报告）

Code repo for "Transformer on a Diet" paper

Estimating and Exploiting the Aleatoric Uncertainty in Surface Normal Estimation

DetCo: Unsupervised Contrastive Learning for Object Detection

Official implementation for NIPS'17 paper: PredRNN: Recurrent Neural Networks for Predictive Learning Using Spatiotemporal LSTMs.

Implementation of "Bidirectional Projection Network for Cross Dimension Scene Understanding" CVPR 2021 (Oral)

Source code to accompany Defunctland's video "FASTPASS: A Complicated Legacy"

Tensorflow implementation for Self-supervised Graph Learning for Recommendation

Codes for the compilation and visualization examples to the HIF vegetation dataset

A nutritional label for food for thought.

Image Captioning on google cloud platform based on iot

Official implementation of ACMMM'20 paper 'Self-supervised Video Representation Learning Using Inter-intra Contrastive Framework'

Sandbox for training deep learning networks

Autonomous Driving on Curvy Roads without Reliance on Frenet Frame: A Cartesian-based Trajectory Planning Method

Code for "Finding Regions of Heterogeneity in Decision-Making via Expected Conditional Covariance" at NeurIPS 2021

Phonetic PosteriorGram (PPG)-Based Voice Conversion (VC)

A U-Net combined with a variational auto-encoder that is able to learn conditional distributions over semantic segmentations.

Addon and nodes for working with structural biology and molecular data in Blender.

This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.

Rule-based Customer Segmentation