Train the HRNet model on ImageNet

Last update: Jan 04, 2023

Overview

High-resolution networks (HRNets) for Image classification

News

[2021/01/20] Add some stronger ImageNet pretrained models, e.g., the HRNet_W48_C_ssld_pretrained.pth achieved top-1 acc 83.6%.
[2020/03/13] Our paper is accepted by TPAMI: Deep High-Resolution Representation Learning for Visual Recognition.
Per request, we provide two small HRNet models. #parameters and GFLOPs are similar to ResNet18. The segmentation resutls using the two small models are also available at https://github.com/HRNet/HRNet-Semantic-Segmentation.
TensoFlow implemenation available at https://github.com/yuanyuanli85/tf-hrnet. Thanks VictorLi!
ONNX export enabled after fixing issues. Thanks Baowen Bao!

Introduction

This is the official code of high-resolution representations for ImageNet classification. We augment the HRNet with a classification head shown in the figure below. First, the four-resolution feature maps are fed into a bottleneck and the number of output channels are increased to 128, 256, 512, and 1024, respectively. Then, we downsample the high-resolution representations by a 2-strided 3x3 convolution outputting 256 channels and add them to the representations of the second-high-resolution representations. This process is repeated two times to get 1024 channels over the small resolution. Last, we transform 1024 channels to 2048 channels through a 1x1 convolution, followed by a global average pooling operation. The output 2048-dimensional representation is fed into the classifier.

ImageNet pretrained models

HRNetV2 ImageNet pretrained models are now available!

model	#Params	GFLOPs	top-1 error	top-5 error	Link
HRNet-W18-C-Small-v1	13.2M	1.49	27.7%	9.3%	OneDrive/BaiduYun(Access Code:v3sw)
HRNet-W18-C-Small-v2	15.6M	2.42	24.9%	7.6%	OneDrive/BaiduYun(Access Code:bnc9)
HRNet-W18-C	21.3M	3.99	23.2%	6.6%	OneDrive/BaiduYun(Access Code:r5xn)
HRNet-W30-C	37.7M	7.55	21.8%	5.8%	OneDrive/BaiduYun(Access Code:ajc1)
HRNet-W32-C	41.2M	8.31	21.5%	5.8%	OneDrive/BaiduYun(Access Code:itc1)
HRNet-W40-C	57.6M	11.8	21.1%	5.5%	OneDrive/BaiduYun(Access Code:i58x)
HRNet-W44-C	67.1M	13.9	21.1%	5.6%	OneDrive/BaiduYun(Access Code:3imd)
HRNet-W48-C	77.5M	16.1	20.7%	5.5%	OneDrive/BaiduYun(Access Code:68g2)
HRNet-W64-C	128.1M	26.9	20.5%	5.4%	OneDrive/BaiduYun(Access Code:6kw4)

Newly added checkpoints:

model	#Params	GFLOPs	top-1 error	Link
HRNet-W18-C (w/ CosineLR + CutMix + 300epochs)	21.3M	3.99	22.1%	Link
HRNet-W48-C (w/ CosineLR + CutMix + 300epochs)	77.5M	16.1	18.9%	Link
HRNet-W18-C-ssld (converted from PaddlePaddle)	21.3M	3.99	18.8%	Link
HRNet-W48-C-ssld (converted from PaddlePaddle)	77.5M	16.1	16.4%	Link

In the above Table, the first 2 checkpoints are trained with CosineLR, CutMix data augmentation and for longer epochs, i.e., 300epochs. The other two checkpoints are converted from PaddleClas. Please refer to SSLD tutorial for more details.

Quick start

Install

Install PyTorch=0.4.1 following the official instructions
git clone https://github.com/HRNet/HRNet-Image-Classification
Install dependencies: pip install -r requirements.txt

Data preparation

You can follow the Pytorch implementation: https://github.com/pytorch/examples/tree/master/imagenet

The data should be under ./data/imagenet/images/.

Train and test

Please specify the configuration file.

For example, train the HRNet-W18 on ImageNet with a batch size of 128 on 4 GPUs:

python tools/train.py --cfg experiments/cls_hrnet_w18_sgd_lr5e-2_wd1e-4_bs32_x100.yaml

For example, test the HRNet-W18 on ImageNet on 4 GPUs:

python tools/valid.py --cfg experiments/cls_hrnet_w18_sgd_lr5e-2_wd1e-4_bs32_x100.yaml --testModel hrnetv2_w18_imagenet_pretrained.pth

Other applications of HRNet

Citation

If you find this work or code is helpful in your research, please cite:

@inproceedings{SunXLW19,
  title={Deep High-Resolution Representation Learning for Human Pose Estimation},
  author={Ke Sun and Bin Xiao and Dong Liu and Jingdong Wang},
  booktitle={CVPR},
  year={2019}
}

@article{WangSCJDZLMTWLX19,
  title={Deep High-Resolution Representation Learning for Visual Recognition},
  author={Jingdong Wang and Ke Sun and Tianheng Cheng and 
          Borui Jiang and Chaorui Deng and Yang Zhao and Dong Liu and Yadong Mu and 
          Mingkui Tan and Xinggang Wang and Wenyu Liu and Bin Xiao},
  journal   = {TPAMI}
  year={2019}
}

Reference

[1] Deep High-Resolution Representation Learning for Visual Recognition. Jingdong Wang, Ke Sun, Tianheng Cheng, Borui Jiang, Chaorui Deng, Yang Zhao, Dong Liu, Yadong Mu, Mingkui Tan, Xinggang Wang, Wenyu Liu, Bin Xiao. Accepted by TPAMI. download

Comments

when will you release the pretrain models? the onedrive links are all broken?

when will you release the pretrain models? the onedrive links are all broken? or can you release the download links in google drive or BaiduYun? thanks

opened by nemonameless 2
Bump opencv-python from 3.4.1.15 to 3.4.7.28
Bumps opencv-python from 3.4.1.15 to 3.4.7.28.

Release notes

Sourced from opencv-python's releases.

3.4.7.28

OpenCV version 3.4.7.

3.4.6.27

OpenCV version 3.4.6.

3.4.5.20

OpenCV version 3.4.5.

Once some build issues are solved, next releases will be targeting OpenCV version 4.

opencv-python: https://pypi.org/project/opencv-python/

opencv-contrib-python: https://pypi.org/project/opencv-contrib-python/

opencv-python-headless: https://pypi.org/project/opencv-python-headless/

opencv-contrib-python-headless: https://pypi.org/project/opencv-contrib-python-headless/

3.4.4.19

opencv-python: https://pypi.org/project/opencv-python/

opencv-contrib-python: https://pypi.org/project/opencv-contrib-python/

opencv-python-headless: https://pypi.org/project/opencv-python-headless/

opencv-contrib-python-headless: https://pypi.org/project/opencv-contrib-python-headless/

OpenCV version 3.4.4.

Thanks to Ivan Pozdeev for following fixes and enhancements: #135, #136, #141, #144, #145, #146, #147, #149, #150

3.4.3.18

opencv-python: https://pypi.org/project/opencv-python/

opencv-contrib-python: https://pypi.org/project/opencv-contrib-python/

opencv-python-headless: https://pypi.org/project/opencv-python-headless/

opencv-contrib-python-headless: https://pypi.org/project/opencv-contrib-python-headless/

OpenCV version 3.4.3.

3.4.2.17

opencv-python: https://pypi.org/project/opencv-python/

opencv-contrib-python: https://pypi.org/project/opencv-contrib-python/

opencv-python-headless: https://pypi.org/project/opencv-python-headless/

opencv-contrib-python-headless: https://pypi.org/project/opencv-contrib-python-headless/

Same as 3.4.2.16 but includes missing x86_64 Linux wheels. Thanks to Krassimir Valev for fixing the build matrix.

3.4.2.16

opencv-python: https://pypi.org/project/opencv-python/

opencv-contrib-python: https://pypi.org/project/opencv-contrib-python/

opencv-python-headless: https://pypi.org/project/opencv-python-headless/

opencv-contrib-python-headless: https://pypi.org/project/opencv-contrib-python-headless/

This release bumps OpenCV version to 3.4.2 and adds support for Python 3.7.

... (truncated)

Commits

See full diff in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR

@dependabot recreate will recreate this PR, overwriting any edits that have been made to it

@dependabot merge will merge this PR after your CI passes on it

@dependabot squash and merge will squash and merge this PR after your CI passes on it

@dependabot cancel merge will cancel a previously requested merge and block automerging

@dependabot reopen will reopen this PR if it is closed

@dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually

@dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

@dependabot use these labels will set the current labels as the default for future PRs for this repo and language

@dependabot use these reviewers will set the current reviewers as the default for future PRs for this repo and language

@dependabot use these assignees will set the current assignees as the default for future PRs for this repo and language

@dependabot use this milestone will set the current milestone as the default for future PRs for this repo and language

You can disable automated security fix PRs for this repo from the Security Alerts page.

dependencies
opened by dependabot[bot] 1
Could someone help to put the pretrained models in google drive?

I am interested in the following models: HRNETV2_W18: "./pretrained_models/hrnetv2_w18_imagenet_pretrained.pth" HRNETV2_W32: "./pretrained_models/hrnetv2_w32_imagenet_pretrained.pth" HRNETV2_W48: "./pretrained_models/hrnetv2_w48_imagenet_pretrained.pth"

I am not able to download them from baidu. I wonder if someone can help to put them in google drive. Many thanks.

opened by MaitaYuki 1
Training Custom Dataset

I have been trying to find the format in which I can train RPC Dataset with the HR-Net and do evaluation. It is COCO format. I am unable to use it in Tensor or Pytorch version of the code. The only support that is given is for Imagenet and that too doesnt help.

opened by singlautsav 0
Enable ONNX export

Related issue in PyTorch https://github.com/pytorch/pytorch/issues/23474#issuecomment-522748992

This PR is for enalbing ONNX export for this model. ONNX does not support average pool with dynamic kernel size. Since the kernel size in this case is always the same as input spatial size, the average pool operator can be replaced with mean.

Here I used the flag torch._C._get_tracing_state():, such that the change only takes effect when users are exporting the model to ONNX. No changes to normal usages such as model training/validating.

opened by BowenBao 0
The differences between HRNet-W18-C-Small-v1 and HRNet-W18-C-Small-v2

Hi, authors! I'm wondering about the differences between HRNet-W18-C-Small-v1 and HRNet-W18-C-Small-v2. I'd appreciate it if you could point them out!

opened by HankYe 0
How To Perform Inference

I’ve been able to train my model, and perform validation, however, I cannot find a way to do inference. Even in validation, while it tells me the percentage it got wrong, I could not find any file or log that tells me which ones it got wrong. I’ve searched through the entire repo, and haven’t found a way to perform inference.

With that, I would like to ask the obvious question of how to perform inference and use the model.

opened by Trainmaster9977 0
BRANCHES instead of RANCHES

In cls_hrnet_w18_small_v2_sgd_lr5e-2_wd1e-4_bs32_x100.yaml STAGE1 configuration should read NUM_BRANCHES instead of NUM_RANCHES. In fact this doesn't affect in anything the code since .make_stage instead called for stage1; however, just to be consistent it be good to change it.

opened by DiegoEPaez 0

Releases(PretrainedWeights)

PretrainedWeights(Jan 20, 2021)
HRNet_W18_C_ssld_pretrained.pth with Top-1 Acc 81.2% on ImageNet.

HRNet_W48_C_ssld_pretrained.pth with Top-1 Acc 83.6% on ImageNet.

HRNet_W18_C_cosinelr_cutmix_300epoch.pth.tar with Top-1 Acc 78% on ImageNet.

HRNet_W48_C_cosinelr_cutmix_300epoch.pth.tar with Top-1 Acc 81.1% on ImageNet.

HRNet-W18-C with Top-1 Acc 76.8% on ImageNet.

HRNet-W48-C with Top-1 Acc 79.3% on ImageNet.

Source code(tar.gz)
Source code(zip)
HRNet_W18_C_cosinelr_cutmix_300epoch.pth.tar(82.11 MB)
HRNet_W18_C_pretrained.pth(81.78 MB)
HRNet_W18_C_ssld_pretrained.pth(81.68 MB)
HRNet_W48_C_cosinelr_cutmix_300epoch.pth.tar(296.60 MB)
HRNet_W48_C_pretrained.pth(296.25 MB)
HRNet_W48_C_ssld_pretrained.pth(296.14 MB)

Owner

HRNet

Code for pose estimation is available at https://github.com/leoxiaobin/deep-high-resolution-net.pytorch

GitHub Repository https://jingdongwang2017.github.io/Projects/HRNet/

Embracing Single Stride 3D Object Detector with Sparse Transformer

SST: Single-stride Sparse Transformer This is the official implementation of paper: Embracing Single Stride 3D Object Detector with Sparse Transformer

385 Dec 28, 2022

[NeurIPS2021] Code Release of K-Net: Towards Unified Image Segmentation

K-Net: Towards Unified Image Segmentation Introduction This is an official release of the paper K-Net:Towards Unified Image Segmentation. K-Net will a

423 Jan 02, 2023

A PyTorch Implementation of Single Shot MultiBox Detector

SSD: Single Shot MultiBox Object Detector, in PyTorch A PyTorch implementation of Single Shot MultiBox Detector from the 2016 paper by Wei Liu, Dragom

4.8k Jan 07, 2023

Este conversor criará a medida exata para sua receita de capuccino gelado da grandiosa Rafaella Ballerini!

ConversorDeMedidas_CapuccinoGelado Este conversor criará a medida exata para sua receita de capuccino gelado da grandiosa Rafaella Ballerini! Requirem

48 Nov 15, 2022

Contains source code for the winning solution of the xView3 challenge

Winning Solution for xView3 Challenge This repository contains source code and pretrained models for my (Eugene Khvedchenya) solution to xView 3 Chall

51 Dec 30, 2022

[CVPR21] LightTrack: Finding Lightweight Neural Network for Object Tracking via One-Shot Architecture Search

LightTrack: Finding Lightweight Neural Networks for Object Tracking via One-Shot Architecture Search The official implementation of the paper LightTra

290 Dec 24, 2022

This is an official implementation for "PlaneRecNet".

PlaneRecNet This is an official implementation for PlaneRecNet: A multi-task convolutional neural network provides instance segmentation for piece-wis

50 Nov 17, 2022

[AAAI22] Reliable Propagation-Correction Modulation for Video Object Segmentation

Reliable Propagation-Correction Modulation for Video Object Segmentation (AAAI22) Preview version paper of this work is available at: https://arxiv.or

70 Dec 04, 2022

ROS Basics and TurtleSim

Waypoint Follower Anna Garverick This package draws given waypoints, then waits for a service call with a start position to send the turtle to each wa

1 Dec 13, 2021

The object detection pipeline is based on Ultralytics YOLOv5

AYOLOv2 The main goal of this repository is to rewrite the object detection pipeline with a better code structure for better portability and adaptabil

153 Dec 22, 2022

Text-to-Music Retrieval using Pre-defined/Data-driven Emotion Embeddings

Text2Music Emotion Embedding Text-to-Music Retrieval using Pre-defined/Data-driven Emotion Embeddings Reference Emotion Embedding Spaces for Matching

50 Dec 05, 2022

A general framework for inferring CNNs efficiently. Reduce the inference latency of MobileNet-V3 by 1.3x on an iPhone XS Max without sacrificing accuracy.

GFNet-Pytorch (NeurIPS 2020) This repo contains the official code and pre-trained models for the glance and focus network (GFNet). Glance and Focus: a

169 Oct 28, 2022

Unofficial implementation of Google's FNet: Mixing Tokens with Fourier Transforms

FNet: Mixing Tokens with Fourier Transforms Pytorch implementation of Fnet : Mixing Tokens with Fourier Transforms. Citation: @misc{leethorp2021fnet,

218 Jan 05, 2023

Awesome-AI-books - Some awesome AI related books and pdfs for learning and downloading

Awesome AI books Some awesome AI related books and pdfs for downloading and learning. Preface This repo only used for learning, do not use in business

1k Jan 01, 2023

Generative Adversarial Networks for High Energy Physics extended to a multi-layer calorimeter simulation

CaloGAN Simulating 3D High Energy Particle Showers in Multi-Layer Electromagnetic Calorimeters with Generative Adversarial Networks. This repository c

101 Nov 13, 2022

Omnidirectional Scene Text Detection with Sequential-free Box Discretization (IJCAI 2019). Including competition model, online demo, etc.

Box_Discretization_Network This repository is built on the pytorch [maskrcnn_benchmark]. The method is the foundation of our ReCTs-competition method

266 Nov 24, 2022