tokenlearner-pytorch

Unofficial PyTorch implementation of TokenLearner by Ryoo et al. from Google AI (abs, pdf)

Installation

You can install TokenLearner via pip:

pip install tokenlearner-pytorch

Usage

You can access the TokenLearner class from the tokenlearner_pytorch package. You can use this layer with a Vision Transformer, MLPMixer, or Video Vision Transformer as done in the paper.

import torch
from tokenlearner_pytorch import TokenLearner

tklr = TokenLearner(S=8)
x = torch.rand(512, 32, 32, 3)
y = tklr(x) # [512, 8, 3]

You can also use TokenLearner and TokenFuser together with Multi-head Self-Attention as done in the paper:

import torch
import torch.nn as nn
from tokenlearner_pytorch import TokenLearner, TokenFuser

mhsa = nn.MultiheadAttention(3, 1)
tklr = TokenLearner(S=8)
tkfr = TokenFuser(H=32, W=32, C=3, S=8)

x = torch.rand(512, 32, 32, 3) # a batch of images

y = tklr(x)
y = y.view(8, 512, 3)
y, _ = mhsa(y, y, y) # ignore attn weights
y = y.view(512, 8, 3)

out = tkfr(y, x) # [512, 32, 23, 3]

TODO

Add support for temporal dimension T
Implement TokenFuser with ViT
Implement TokenFuser with ViViT

Contributions

If I've made any errors or you have any suggestions, feel free to raise an Issue or PR. All contributions welcome!!

License

MIT

Unofficial PyTorch implementation of TokenLearner by Google AI

Related tags

Overview

tokenlearner-pytorch

Installation

Usage

TODO

Contributions

License

Owner

Rishabh Anand

A multilingual version of MS MARCO passage ranking dataset

An official source code for "Augmentation-Free Self-Supervised Learning on Graphs"

In the AI for TSP competition we try to solve optimization problems using machine learning.

RodoSol-ALPR Dataset

CTRL-C: Camera calibration TRansformer with Line-Classification

Doge-Prediction - Coding Club prediction ig

This project aim to create multi-label classification annotation tool to boost annotation speed and make it more easier.

[CVPR 2020] 3D Photography using Context-aware Layered Depth Inpainting

Generating Videos with Scene Dynamics

IDM: An Intermediate Domain Module for Domain Adaptive Person Re-ID,

HyperLib: Deep learning in the Hyperbolic space

This PyTorch package implements MoEBERT: from BERT to Mixture-of-Experts via Importance-Guided Adaptation (NAACL 2022).

Raindrop strategy for Irregular time series

Flower classification model that classifies flowers in 10 classes made using transfer learning (~85% accuracy).

Hierarchical Few-Shot Generative Models

Distributing reference energies for SMIRNOFF implementations

Code for "Diffusion is All You Need for Learning on Surfaces"

Semi-Supervised 3D Hand-Object Poses Estimation with Interactions in Time

Escaping the Gradient Vanishing: Periodic Alternatives of Softmax in Attention Mechanism