Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method (NeurIPS 2021)

Last update: Sep 20, 2022

Related tags

Overview

Skyformer

This repository is the official implementation of Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr"om Method (NeurIPS 2021).

Requirements

To install requirements in a conda environment:

conda create -n skyformer python=3.6
conda activate skyformer
pip install -r requirements.txt

Note: Specific requirements for data preprocessing are not included here.

Data Preparation

Processed files can be downloaded here, or processed with the following steps:

Requirements

tensorboard>=2.3.0
tensorflow>=2.3.1
tensorflow-datasets>=4.0.1

Download the TFDS files for pathfinder and then set _PATHFINER_TFDS_PATH to the unzipped directory (following https://github.com/google-research/long-range-arena/issues/11)
Download lra_release.gz (7.7 GB).
Unzip lra-release and put under ./data/.

cd data
wget https://storage.googleapis.com/long-range-arena/lra_release.gz
tar zxvf lra-release.gz

Create a directory lra_processed under ./data/.

mkdir lra_processed
cd ..

6.The directory structure would be (assuming the root dir is code)

./data/lra-processed
./data/long-range-arena-main
./data/lra_release

Create train, dev, and test dataset pickle files for each task.

cd preprocess
python create_pathfinder.py
python create_listops.py
python create_retrieval.py
python create_text.py
python create_cifar10.py

Note: most source code comes from LRA repo.

Run

Modify the configuration in config.py and run

python main.py --mode train --attn skyformer --task lra-text

mode: train, eval
attn: softmax, nystrom, linformer, reformer, perfromer, informer, bigbird, kernelized, skyformer
task: lra-listops, lra-pathfinder, lra-retrieval, lra-text, lra-image

Reference

@inproceedings{Skyformer,
    title={Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method}, 
    author={Yifan Chen and Qi Zeng and Heng Ji and Yun Yang},
    booktitle={NeurIPS},
    year={2021}
}

Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method (NeurIPS 2021)

Related tags

Overview

Skyformer

Requirements

Data Preparation

Run

Reference

Owner

Qi Zeng

Collection of in-progress libraries for entity neural networks.

3D Generative Adversarial Network

Driller: augmenting AFL with symbolic execution!

Mmdetection3d Noted - MMDetection3D is an open source object detection toolbox based on PyTorch

Face recognition system using MTCNN, FACENET, SVM and FAST API to track participants of Big Brother Brasil in real time.

This repository contains the official code of the paper Equivariant Subgraph Aggregation Networks (ICLR 2022)

Scikit-learn compatible estimation of general graphical models

Tool which allow you to detect and translate text.

Tensorflow implementation of ID-Unet: Iterative Soft and Hard Deformation for View Synthesis.

Oriented Response Networks, in CVPR 2017

Leaderboard and Visualization for RLCard

A Tensorflow implementation of BicycleGAN.

UnpNet - Rethinking 3-D LiDAR Point Cloud Segmentation(IEEE TNNLS)

Fairness Metrics: All you need to know

An Api for Emotion recognition.

The source code for CATSETMAT: Cross Attention for Set Matching in Bipartite Hypergraphs

Fashion Recommender System With Python

Monitor your ML jobs on mobile devices📱, especially for Google Colab / Kaggle

Nightmare-Writeup - Writeup for the Nightmare CTF Challenge from 2022 DiceCTF

A complete, self-contained example for training ImageNet at state-of-the-art speed with FFCV