Deep Structured Instance Graph for Distilling Object Detectors (ICCV 2021)

Last update: Nov 17, 2022

Related tags

Deep Learning Dsig

Overview

DSIG

Deep Structured Instance Graph for Distilling Object Detectors

Authors: Yixin Chen, Pengguang Chen, Shu Liu, Liwei Wang, Jiaya Jia.

[pdf] [slide] [supp] [bibtex]

This repo provides the implementation of paper "Deep Structured Instance Graph for Distilling Object Detectors"(Dsig) based on detectron2. Specifically, aiming at solving the feature imbalance problem while further excavating the missing relation inside semantic instances, we design a graph whose nodes correspond to instance proposal-level features and edges represent the relation between nodes. We achieve new state-of-the-art results on the COCO object detection task with diverse student-teacher pairs on both one- and two-stage detectors.

Installation

Requirements

Python >= 3.6
Pytorch >= 1.7.0
Torchvision >= 0.8.1
Pycocotools 2.0.2

Follow the install instructions in detectron2, note that in this repo we use detectron2 commit version ff638c931d5999f29c22c1d46a3023e67a5ae6a1. Download COCO dataset and export DETECTRON2_DATASETS=$COCOPATH to direct to COCO dataset. We prepare our pre-trained weights for training in Student-Teacher format, please follow the instructions in Pretrained.

Running

We prepare training configs following the detectron2 format. For training a Faster R-CNN R18-FPN student with a Faster R-CNN R50-FPN teacher on 4 GPUs:

./start_train.sh train projects/Distillation/configs/Distillation-FasterRCNN-R18-R50-dsig-1x.yaml

For testing:

./start_train.sh eval projects/Distillation/configs/Distillation-FasterRCNN-R18-R50-dsig-1x.yaml

For debugging:

./start_train.sh debugtrain projects/Distillation/configs/Distillation-FasterRCNN-R18-R50-dsig-1x.yaml

Results and Models

Faster R-CNN:

Experiment(Student-Teacher)	Schedule	AP	Config	Model
R18-R50	1x	37.25	config	googledrive
R50-R101	1x	40.57	config	googledrive
R101-R152	1x	41.65	config	googledrive
MNV2-R50	1x	34.44	config	googledrive
EB0-R101	1x	37.74	config	googledrive

RetinaNet:

Experiment(Student-Teacher)	Schedule	AP	Config	Model
R18-R50	1x	34.72	config	googledrive
MNV2-R50	1x	32.16	config	googledrive
EB0-R101	1x	34.44	config	googledrive

More models and results will be released soon.

Citation

@inproceedings{chen2021dsig,
    title={Deep Structured Instance Graph for Distilling Object Detectors},
    author={Yixin Chen, Pengguang Chen, Shu Liu, Liwei Wang, and Jiaya Jia},
    booktitle={IEEE International Conference on Computer Vision (ICCV)},
    year={2021},
}

Contact

Please contact [email protected].

Deep Structured Instance Graph for Distilling Object Detectors (ICCV 2021)

Related tags

Overview

DSIG

Installation

Requirements

Running

Results and Models

Citation

Contact

Owner

DV Lab

Numerical-computing-is-fun - Learning numerical computing with notebooks for all ages.

Implementation of CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification

Acoustic mosquito detection code with Bayesian Neural Networks

Submodular Subset Selection for Active Domain Adaptation (ICCV 2021)

Incorporating Transformer and LSTM to Kalman Filter with EM algorithm

《Where am I looking at? Joint Location and Orientation Estimation by Cross-View Matching》(CVPR 2020)

This tutorial repository is to introduce the functionality of KGTK to first-time users

a basic code repository for basic task in CV(classification,detection,segmentation)

Official code of the paper "Expanding Low-Density Latent Regions for Open-Set Object Detection" (CVPR 2022)

[BMVC2021] "TransFusion: Cross-view Fusion with Transformer for 3D Human Pose Estimation"

Analyzing basic network responses to novel classes

A scikit-learn-compatible module for estimating prediction intervals.

CC-GENERATOR - A python script for generating CC

PyTorch implementations of algorithms for density estimation

On Size-Oriented Long-Tailed Graph Classification of Graph Neural Networks

TensorFlow implementation of Elastic Weight Consolidation

[Link]deep_portfolo - Use Reforcemet earg ad Supervsed learg to Optmze portfolo allocato []

DecoupledNet is semantic segmentation system which using heterogeneous annotations

A scikit-learn compatible neural network library that wraps PyTorch

Disagreement-Regularized Imitation Learning