The implementation for "Comprehensive Knowledge Distillation with Causal Intervention".

Last update: Nov 03, 2022

Related tags

Overview

Comprehensive Knowledge Distillation with Causal Intervention

This repository is a PyTorch implementation of "Comprehensive Knowledge Distillation with Causal Intervention". The code is modified from CRD, and the pretrained teachers (except WRN-40-4) are also downloaded from CRD.

Requirements

The code was tested on

Python 3.6
torch 1.2.0
torchvision 0.4.0

Evaluation

To evaluate our pre-trained light-weight student networks, first download the folder "pretrained_student_model" from CID models into the "save" folder, then simply run the command below to evaluate these light-weight students:

run evaluate_scripts.sh

Training

To train students from scratch by distilling knowledge from teacher networks with CID, first download the pretrained teacher folder "models" from CID models into the "save" folder, and then simply run the command below to compress large models to smaller ones:

run train_scripts.sh

Citation

If you find this code helpful, you may consider citing this paper:

@inproceedings{deng2021comprehensive,
  title={Comprehensive Knowledge Distillation with Causal Intervention},
  author={Deng, Xiang and Zhang, Zhongfei},
  booktitle = {Proceedings of the 30th Annual Conference on Neural Information Processing Systems},
  year={2021}
}

The implementation for "Comprehensive Knowledge Distillation with Causal Intervention".

Related tags

Overview

Comprehensive Knowledge Distillation with Causal Intervention

Requirements

Evaluation

Training

Citation

Owner

Xiang Deng

Implementation for the "Surface Reconstruction from 3D Line Segments" paper.

Spatial Transformer Nets in TensorFlow/ TensorLayer

HINet: Half Instance Normalization Network for Image Restoration

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

An Evaluation of Generative Adversarial Networks for Collaborative Filtering.

⚓ Eurybia monitor model drift over time and securize model deployment with data validation

Grounding Representation Similarity with Statistical Testing

ComputerVision - This repository aims at realized easy network architecture

Polyp-PVT: Polyp Segmentation with Pyramid Vision Transformers (arXiv2021)

[Open Source]. The improved version of AnimeGAN. Landscape photos/videos to anime

Multi-Task Learning as a Bargaining Game

Implementation of the bachelor's thesis "Real-time stock predictions with deep learning and news scraping".

A repository that finds a person who looks like you by using face recognition technology.

Collaborative forensic timeline analysis

Practical and Real-world applications of ML based on the homework of Hung-yi Lee Machine Learning Course 2021

Trustworthy AI related projects

Differentiable Neural Computers, Sparse Access Memory and Sparse Differentiable Neural Computers, for Pytorch

PoolFormer: MetaFormer is Actually What You Need for Vision

AdaShare: Learning What To Share For Efficient Deep Multi-Task Learning

structured-generative-modeling