Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers"

Last update: Nov 15, 2022

Overview

Recurrent Fast Weight Programmers

This is the official repository containing the code we used to produce the experimental results reported in the paper:

Going Beyond Linear Transformers with Recurrent Fast Weight Programmers

algorithmic directory for code execution and ListOps
language_modeling directory for language modeling
reinforcement_learning directory for RL

Separate license files can be found under each directory.

General instructions

Please refer to the readme file in each directory for further instructions.

In all tasks, our custom CUDA kernels will be automatically compiled. To avoid recompiling the code multiple times, we recommend to specify the path to a directory to store the compiled code via:

export TORCH_EXTENSIONS_DIR="/home/me/torch_extensions/lm"

Such a line is already included in the example scripts we provide. Please change the path to a safe directory of your choice.

Important: separate paths should be used for different tasks (i.e. here, one for language modeling, one for code execution, one for ListOps, and one for RL).

BibTex

@article{irie2021going,
      title={Going Beyond Linear Transformers with Recurrent Fast Weight Programmers}, 
      author={Kazuki Irie and Imanol Schlag and R\'obert Csord\'as and J\"urgen Schmidhuber},
      journal={Preprint arXiv:2106.06295},
      year={2021}
}

Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers"

Related tags

Overview

Recurrent Fast Weight Programmers

Contents

General instructions

BibTex

Links

Owner

IDSIA

Full Transformer Framework for Robust Point Cloud Registration with Deep Information Interaction

Official implementation of our CVPR2021 paper "OTA: Optimal Transport Assignment for Object Detection" in Pytorch.

The audio-video synchronization of MKV Container Format is exploited to achieve data hiding

Implementation for our ICCV 2021 paper: Dual-Camera Super-Resolution with Aligned Attention Modules

Tensorflow Repo for "DeepGCNs: Can GCNs Go as Deep as CNNs?"

Face Mask Detector by live camera using tensorflow-keras, openCV and Python

Volumetric Correspondence Networks for Optical Flow, NeurIPS 2019.

Official implementation of VQ-Diffusion

Pytorch implementation of CVPR2021 paper "MUST-GAN: Multi-level Statistics Transfer for Self-driven Person Image Generation"

A tool to visualise the results of AlphaFold2 and inspect the quality of structural predictions

Experiments on continual learning from a stream of pretrained models.

Deep Convolutional Generative Adversarial Networks

Object tracking and object detection is applied to track golf puts in real time and display stats/games.

This code is a near-infrared spectrum modeling method based on PCA and pls

FaceAnon - Anonymize people in images and videos using yolov5-crowdhuman

EFENet: Reference-based Video Super-Resolution with Enhanced Flow Estimation

Fastshap: A fast, approximate shap kernel

This is the official implementation of TrivialAugment and a mini-library for the application of multiple image augmentation strategies including RandAugment and TrivialAugment.

Example repository for custom C++/CUDA operators for TorchScript

Hierarchical probabilistic 3D U-Net, with attention mechanisms (—𝘈𝘵𝘵𝘦𝘯𝘵𝘪𝘰𝘯 𝘜-𝘕𝘦𝘵, 𝘚𝘌𝘙𝘦𝘴𝘕𝘦𝘵) and a nested decoder structure with deep supervision (—𝘜𝘕𝘦𝘵++).