Official pytorch implementation of paper "Inception Convolution with Efficient Dilation Search" (CVPR 2021 Oral).

Last update: Dec 31, 2022

Related tags

Deep Learning IC-Conv

Overview

IC-Conv

This repository is an official implementation of the paper Inception Convolution with Efficient Dilation Search.

Getting Started

Download ImageNet pre-trained checkpoints.

Extract the file to get the following directory tree

|-- README.md
|-- ckpt
|   |-- detection
|   |-- human_pose
|   |-- segmentation
|-- config
|-- model
|-- pattern_zoo

Easy Use

The current implementation is coupled to specific downstream tasks. OpenMMLab users can quickly use IC-Conv in the following simple ways.

from models import IC_ResNet
import torch
net = IC_ResNet(depth=50,pattern_path='pattern_zoo/detection/ic_r50_k9.json')
net.eval()
inputs = torch.rand(1, 3, 32, 32)
outputs = net.forward(inputs)

For 2d Human Pose Estimation using MMPose

Copying the config files to the config path of mmpose, such as

cp config/human_pose/ic_res50_k13_coco_640x640.py your_mmpose_path/mmpose/configs/bottom_up/resnet/coco/ic_res50_k13_coco_640x640.py

Copying the inception conv files to the model path of mmpose,

cp model/ic_conv2d.py your_mmpose_path/mmpose/mmpose/models/backbones/ic_conv2d.py
cp model/ic_resnet.py your_mmpose_path/mmpose/mmpose/models/backbones/ic_resnet.py

Running it directly like MMPose.

Model Zoo

We provided the pre-trained weights of IC-ResNet-50, IC-ResNet-101and IC-ResNeXt-101 (32x4d) on ImageNet and the weights trained on specific tasks.

For users with limited computing power, you can directly reuse our provided IC-Conv and ImageNet pre-training weights for detection, segmentation, and 2d human pose estimation tasks on other datasets.

Attentions: The links in the tables below are relative paths. Therefore, you should clone the repository and download checkpoints.

Object Detection

Detector	Backbone	Lr	AP	dilation_pattern	checkpoint
Faster-RCNN-FPN	IC-R50	1x	38.9	pattern	ckpt/imagenet_retrain_ckpt
Faster-RCNN-FPN	IC-R101	1x	41.9	pattern	ckpt/imagenet_retrain_ckpt
Faster-RCNN-FPN	IC-X101-32x4d	1x	42.1	pattern	ckpt/imagenet_retrain_ckpt
Cascade-RCNN-FPN	IC-R50	1x	42.4	pattern	ckpt/imagenet_retrain_ckpt
Cascade-RCNN-FPN	IC-R101	1x	45.0	pattern	ckpt/imagenet_retrain_ckpt
Cascade-RCNN-FPN	IC-X101-32x4d	1x	45.7	pattern	ckpt/imagenet_retrain_ckpt

Instance Segmentation

Detector	Backbone	Lr	box AP	mask AP	dilation_pattern	checkpoint
Mask-RCNN-FPN	IC-R50	1x	40.0	35.9	pattern	ckpt/imagenet_retrain_ckpt
Mask-RCNN-FPN	IC-R101	1x	42.6	37.9	pattern	ckpt/imagenet_retrain_ckpt
Mask-RCNN-FPN	IC-X101-32x4d	1x	43.4	38.4	pattern	ckpt/imagenet_retrain_ckpt
Cascade-RCNN-FPN	IC-R50	1x	43.4	36.8	pattern	ckpt/imagenet_retrain_ckpt
Cascade-RCNN-FPN	IC-R101	1x	45.7	38.7	pattern	ckpt/imagenet_retrain_ckpt
Cascade-RCNN-FPN	IC-X101-32x4d	1x	46.4	39.1	pattern	ckpt/imagenet_retrain_ckpt

2d Human Pose Estimation

We adjust the learning rate of resnet backbone in MMPose and get better baseline results. Please see the specific config files in config/human_pose/.

Results on COCO val2017 without multi-scale test

Backbone	Input Size	AP	dilation_pattern	checkpoint
R50(mmpose)	640x640	47.9	~	~
R50	640x640	51.0	~	~
IC-R50	640x640	62.2	pattern	ckpt/imagenet_retrain_ckpt
R101	640x640	55.5	~	~
IC-R101	640x640	63.3	pattern	ckpt/imagenet_retrain_ckpt

Results on COCO val2017 with multi-scale test. 3 default scales ([2, 1, 0.5]) are used

Backbone	Input Size	AP
R50(mmpose)	640x640	52.5
R50	640x640	55.8
IC-R50	640x640	65.8
R101	640x640	60.2
IC-R101	640x640	68.5

Acknowledgement

The human pose estimation experiments are built upon MMPose.

Citation

If our paper helps your research, please cite it in your publications:

@article{liu2020inception,
 title={Inception Convolution with Efficient Dilation Search},
 author={Liu, Jie and Li, Chuming and Liang, Feng and Lin, Chen and Sun, Ming and Yan, Junjie and Ouyang, Wanli and Xu, Dong},
 journal={arXiv preprint arXiv:2012.13587},
 year={2020}
}

Official pytorch implementation of paper "Inception Convolution with Efficient Dilation Search" (CVPR 2021 Oral).

Related tags

Overview

IC-Conv

Getting Started

Easy Use

For 2d Human Pose Estimation using MMPose

Model Zoo

Object Detection

Instance Segmentation

2d Human Pose Estimation

Results on COCO val2017 without multi-scale test

Results on COCO val2017 with multi-scale test. 3 default scales ([2, 1, 0.5]) are used

Acknowledgement

Citation

Owner

Jie Liu

It's a implement of this paper：Relation extraction via Multi-Level attention CNNs

This python-based package offers a way of creating a parametric OpenMC plasma source from plasma parameters.

Code for the upcoming CVPR 2021 paper

Apollo optimizer in tensorflow

DIP-football - A football video analyse system based on Yolov5, alphapose, Qt6

CLEAR algorithm for multi-view data association

Visualizer using audio and semantic analysis to explore BigGAN (Brock et al., 2018) latent space.

PyTorch implementation of ICLR 2022 paper PiCO: Contrastive Label Disambiguation for Partial Label Learning

A study project using the AA-RMVSNet to reconstruct buildings from multiple images

Group-Free 3D Object Detection via Transformers

A trashy useless Latin programming language written in python.

Official codebase for Pretrained Transformers as Universal Computation Engines.

Qt-GUI implementation of the YOLOv5 algorithm (ver.6 and ver.5)

Code for "My(o) Armband Leaks Passwords: An EMG and IMU Based Keylogging Side-Channel Attack" paper

Predicting Semantic Map Representations from Images with Pyramid Occupancy Networks

Deep Learning with PyTorch made easy 🚀 !

Revisting Open World Object Detection

Yet Another Robotics and Reinforcement (YARR) learning framework for PyTorch.

MOOSE (Multi-organ objective segmentation) a data-centric AI solution that generates multilabel organ segmentations to facilitate systemic TB whole-person research

This is the workbook I created while I was studying for the Qiskit Associate Developer exam. I hope this becomes useful to others as it was for me :)