PyTorch implementation of DCT fast weight RNNs

Last update: Dec 24, 2022

Overview

DCT based fast weights

This repository contains the official code for the paper: Training and Generating Neural Networks in Compressed Weight Space.

The main code includes:

DCT LSTM: LSTMs whose weights are encoded by discrete cosine transform (DCT).
DCT fast weight RNN: RNNs whose weights are encoded by DCT, and the DCT coefficients are parameterized by LSTMs.

The language modeling experiments reported in the paper were produced by porting code (with minor changes due to some clean-up) of this repository in a fork of this toolkit.

Requirements

torch_dct (can be installed via pip install torch_dct)
PyTorch with a version compatible with torch_dct.

Our experiments were conducted using PyTorch version 1.6.0 . More recent versions are apparently not compatible with torch_dct (at least at the time of writing this file). We recommend to run python custom_layer.py to check the compatibility.

References

If you make use of this toolkit for your experiments, please cite:

@inproceedings{irie2021training,
  title={Training and Generating Neural Networks in Compressed Weight Space},
  author={Kazuki Irie and J{\"u}rgen Schmidhuber},
  booktitle={Neural Compression: From Information Theory to Applications -- Workshop @ ICLR 2021},
  year={2021},
  address={Virtual only},
  month=may
}

PyTorch implementation of DCT fast weight RNNs

Related tags

Overview

DCT based fast weights

Requirements

References

Owner

Kazuki Irie

The fundamental package for scientific computing with Python.

pytorch, hand(object) detect ,yolo v5，手检测

Recognize numbers from an (28 x 28) image using neural networks

Stitch it in Time: GAN-Based Facial Editing of Real Videos

Official implementation for "Symbolic Learning to Optimize: Towards Interpretability and Scalability"

A very simple tool to rewrite parameters such as attributes and constants for OPs in ONNX models. Simple Attribute and Constant Modifier for ONNX.

Personal project about genus-0 meshes, spherical harmonics and a cow

Node Editor Plug for Blender

HMLET (Hybrid-Method-of-Linear-and-non-linEar-collaborative-filTering-method)

Inference pipeline for our participation in the FeTA challenge 2021.

S-attack library. Official implementation of two papers "Are socially-aware trajectory prediction models really socially-aware?" and "Vehicle trajectory prediction works, but not everywhere".

Anti-UAV base on PaddleDetection

Forecasting Nonverbal Social Signals during Dyadic Interactions with Generative Adversarial Neural Networks

Applying CLIP to Point Cloud Recognition.

A PyTorch implementation for V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation

Meta Learning for Semi-Supervised Few-Shot Classification

Meta-TTS: Meta-Learning for Few-shot SpeakerAdaptive Text-to-Speech

[NeurIPS'21 Spotlight] PyTorch code for our paper "Aligned Structured Sparsity Learning for Efficient Image Super-Resolution"

EvDistill: Asynchronous Events to End-task Learning via Bidirectional Reconstruction-guided Cross-modal Knowledge Distillation (CVPR'21)

A best practice for tensorflow project template architecture.