Adaptive FNO transformer - official Pytorch implementation

Last update: Dec 29, 2022

Related tags

Overview

Adaptive Fourier Neural Operators: Efficient Token Mixers for Transformers

This repository contains PyTorch implementation of the Adaptive Fourier Neural Operator token mixer. Classification code is also provided in the classification folder.

The Adaptive Fourier Neural Operator is a token mixer that learns to mix in the Fourier domain. AFNO is based on a principled foundation of operator learning which allows us to frame token mixing as a continuous global convolution without any dependence on the input resolution. This principle was previously used to design FNO, which solves global convolution efficiently in the Fourier domain and has shown promise in learning challenging PDEs. To handle challenges in visual representation learning such as discontinuities in images and high resolution inputs, we propose principled architectural modifications to FNO which results in memory and computational efficiency. This includes imposing a block-diagonal structure on the channel mixing weights, adaptively sharing weights across tokens, and sparsifying the frequency modes via soft-thresholding and shrinkage. The resulting model is highly parallel with a quasi-linear complexity and has linear memory in the sequence size.

[arXiv]

Usage

Requirements

torch>=1.8.0
torchvision
timm

Note: To use the rfft2 and irfft2 functions in PyTorch, you need to install PyTorch>=1.8.0. Complex numbers are supported after PyTorch 1.6.0, but the fft API is slightly different from the current version.

Installation

pip install -e .

Example

from afno import AFNO1D, AFNO2D

mixer = AFNO1D()
mixer = AFNO2D()

Citation

If you find our work useful in your research, please consider citing:

@inproceedings{guibas2021efficient,
  title={Efficient Token Mixing for Transformers via Adaptive Fourier Neural Operators},
  author={Guibas, John and Mardani, Morteza and Li, Zongyi and Tao, Andrew and Anandkumar, Anima and Catanzaro, Bryan},
  booktitle={International Conference on Learning Representations},
  year={2021}
}

Adaptive FNO transformer - official Pytorch implementation

Related tags

Overview

Adaptive Fourier Neural Operators: Efficient Token Mixers for Transformers

Usage

Requirements

Installation

Example

Citation

Owner

NVIDIA Research Projects

Spatial Single-Cell Analysis Toolkit

Deal or No Deal? End-to-End Learning for Negotiation Dialogues

Diverse Object-Scene Compositions For Zero-Shot Action Recognition

Jiminy Cricket Environment (NeurIPS 2021)

Related resources for our EMNLP 2021 paper

[NeurIPS 2021] Low-Rank Subspaces in GANs

This is the source code for our ICLR2021 paper: Adaptive Universal Generalized PageRank Graph Neural Network.

This is the code for our KILT leaderboard submission to the T-REx and zsRE tasks. It includes code for training a DPR model then continuing training with RAG.

Pytorch Implementation for Dilated Continuous Random Field

Towards End-to-end Video-based Eye Tracking

Rax is a Learning-to-Rank library written in JAX

ESL: Event-based Structured Light

This is the pytorch implementation for the paper: Learning Accurate Performance Predictors for Ultrafast Automated Model Compression, which is in submission to TPAMI

Potato Disease Classification - Training, Rest APIs, and Frontend to test.

Fast image augmentation library and an easy-to-use wrapper around other libraries

PyTorch implementation of Algorithm 1 of "On the Anatomy of MCMC-Based Maximum Likelihood Learning of Energy-Based Models"

Scripts of Machine Learning Algorithms from Scratch. Implementations of machine learning models and algorithms using nothing but NumPy with a focus on accessibility. Aims to cover everything from basic to advance.

Safe Model-Based Reinforcement Learning using Robust Control Barrier Functions

This repo is customed for VisDrone.

How the Deep Q-learning method works and discuss the new ideas that makes the algorithm work

Adaptive FNO transformer - official Pytorch implementation

Related tags

Overview

Adaptive Fourier Neural Operators: Efficient Token Mixers for Transformers

Usage

Requirements

Installation

Example

Citation

Owner

NVIDIA Research Projects

Spatial Single-Cell Analysis Toolkit

Deal or No Deal? End-to-End Learning for Negotiation Dialogues

Diverse Object-Scene Compositions For Zero-Shot Action Recognition

Jiminy Cricket Environment (NeurIPS 2021)

Related resources for our EMNLP 2021 paper

[NeurIPS 2021] Low-Rank Subspaces in GANs

This is the source code for our ICLR2021 paper: Adaptive Universal Generalized PageRank Graph Neural Network.

This is the code for our KILT leaderboard submission to the T-REx and zsRE tasks. It includes code for training a DPR model then continuing training with RAG.

Pytorch Implementation for Dilated Continuous Random Field

Towards End-to-end Video-based Eye Tracking

Rax is a Learning-to-Rank library written in JAX

ESL: Event-based Structured Light

This is the pytorch implementation for the paper: *Learning Accurate Performance Predictors for Ultrafast Automated Model Compression*, which is in submission to TPAMI

Potato Disease Classification - Training, Rest APIs, and Frontend to test.

Fast image augmentation library and an easy-to-use wrapper around other libraries

PyTorch implementation of Algorithm 1 of "On the Anatomy of MCMC-Based Maximum Likelihood Learning of Energy-Based Models"

Scripts of Machine Learning Algorithms from Scratch. Implementations of machine learning models and algorithms using nothing but NumPy with a focus on accessibility. Aims to cover everything from basic to advance.

Safe Model-Based Reinforcement Learning using Robust Control Barrier Functions

This repo is customed for VisDrone.

How the Deep Q-learning method works and discuss the new ideas that makes the algorithm work

This is the pytorch implementation for the paper: Learning Accurate Performance Predictors for Ultrafast Automated Model Compression, which is in submission to TPAMI