This repo uses a combination of logits and feature distillation method to teach the PSPNet model of ResNet18 backbone with the PSPNet model of ResNet50 backbone. All the models are trained and tested on the PASCAL-VOC2012 dataset.

Last update: Dec 01, 2022

Overview

PSPNet-logits and feature-distillation

Introduction

This repository is based on PSPNet and modified from semseg and Pixelwise_Knowledge_Distillation_PSPNet18 which uses a logits knowledge distillation method to teach the PSPNet model of ResNet18 backbone with the PSPNet model of ResNet50 backbone. All the models are trained and tested on the PASCAL-VOC2012 dataset(Enhanced Version).

Innovation and Limitations

This repo adds a feature distillation in the aux layer of PSPNet without a linear feature mapping since the teacher and student model's output dimension after the aux layer is the same. On the other hand, if you want to adapt this repo to other structures, a mapping should be needed. Also, the output of the aux layer is very close to which of the final layer, so you should pay attention to the overfitting problem. Or you can distillate the features in earlier layers and add a mapping, of course, just like Fitnet.

For reimplementation

Please download related datasets and symlink the relevant paths. The temperature parameter(T) and corresponding weights can be changed flexibly. All the numbers showed in the name of python code indicate the number of layers; for instance, train_50_18.py represents the distillation of 50 layers to 18 layers.

Please note that you should train a teacher model( PSPNet model of ResNet50 backbone) at first, and save the checkpoints or just use a well trained PSPNet50 model, which you can refer to the original public code at semseg, and you should download the initial models and corresponding lists in semseg and put them in right paths, also all the environmental requirements in this repo are the same as semseg.

Usage

Requirement: PyTorch>=1.1.0, Python3, tensorboardX, GPU
Clone the repository:

git clone https://github.com/asaander719/PSPNet-knowledge-distillation.git

Download initialization models and lists, also trained models and predictions can be optional, by the link shows in semseg, and put them in files followed by instructions.
Download official dataset PASCAL-VOC2012, please note that it is Enhanced Version,and put them in corresponding paths follwed by data lists.
Train and test a teacher model: adjust parameters in config (voc2012_pspnet50.yaml), like layers. etc.., and the checkpoints will be saved automaticly, or you can just download a trained model, and put it in a right path.

python train_50.py

python test_50.py

Train and test a student model(optional, only for comparison): adjust parameters in config (voc2012_pspnet18.yaml), like layers. etc.., and the checkpoints will be saved automaticly, or you can just download a trained model, and put it in a right path.

python train_18.py

python test_18.py

Distillation and Test: the results should between the teacher and the student model.

Please note that you should adjust some parameters when you use fuctions in the file named model.

python train_50_18_my.py

python test_50_18.py

Reference

@misc{semseg2019, author={Zhao, Hengshuang}, title={semseg}, howpublished={\url{https://github.com/hszhao/semseg}}, year={2019} }

@inproceedings{zhao2017pspnet, title={Pyramid Scene Parsing Network}, author={Zhao, Hengshuang and Shi, Jianping and Qi, Xiaojuan and Wang, Xiaogang and Jia, Jiaya}, booktitle={CVPR}, year={2017} }

@inproceedings{zhao2018psanet, title={{PSANet}: Point-wise Spatial Attention Network for Scene Parsing}, author={Zhao, Hengshuang and Zhang, Yi and Liu, Shu and Shi, Jianping and Loy, Chen Change and Lin, Dahua and Jia, Jiaya}, booktitle={ECCV}, year={2018} }

This repo uses a combination of logits and feature distillation method to teach the PSPNet model of ResNet18 backbone with the PSPNet model of ResNet50 backbone. All the models are trained and tested on the PASCAL-VOC2012 dataset.

Related tags

Overview

PSPNet-logits and feature-distillation

Introduction

Innovation and Limitations

For reimplementation

Usage

Reference

Owner

LIAO Shuiying

RLHive: a framework designed to facilitate research in reinforcement learning.

Code for "Adversarial attack by dropping information." (ICCV 2021)

Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!

[AAAI2022] Source code for our paper《Suppressing Static Visual Cues via Normalizing Flows for Self-Supervised Video Representation Learning》

Fast Differentiable Matrix Sqrt Root

AFLNet: A Greybox Fuzzer for Network Protocols

A repository for interferometer controller code.

基于Pytorch实现优秀的自然图像分割框架！(包括FCN、U-Net和Deeplab)

Sample and Computation Redistribution for Efficient Face Detection

PyTorch implementation of SMODICE: Versatile Offline Imitation Learning via State Occupancy Matching

Alpha-Zero - Telegram Group Manager Bot Written In Python Using Pyrogram

Codes for our paper "SentiLARE: Sentiment-Aware Language Representation Learning with Linguistic Knowledge" (EMNLP 2020)

Genshin-assets - 👧 Public documentation & static assets for Genshin Impact data.

git《Joint Entity and Relation Extraction with Set Prediction Networks》(2020) GitHub:

Build a medical knowledge graph based on Unified Language Medical System (UMLS)

Software for Multimodalty 2D+3D Facial Expression Recognition (FER) UI

Embracing Single Stride 3D Object Detector with Sparse Transformer

Consensus score for tripadvisor

Standalone pre-training recipe with JAX+Flax

The spiritual successor to knockknock for PyTorch Lightning, get notified when your training ends