MLP-Mixer: An all-MLP Architecture for Vision

This repo contains PyTorch implementation of MLP-Mixer: An all-MLP Architecture for Vision.

Usage :

import torch
import numpy as np
from mlp-mixer import MLPMixer

img = torch.ones([1, 3, 224, 224])

model = MLPMixer(in_channels=3, image_size=224, patch_size=16, num_classes=1000,
                 dim=512, depth=8, token_dim=256, channel_dim=2048)

parameters = filter(lambda p: p.requires_grad, model.parameters())
parameters = sum([np.prod(p.size()) for p in parameters]) / 1_000_000
print('Trainable Parameters: %.3fM' % parameters)

out_img = model(img)

print("Shape of out :", out_img.shape)  # [B, in_channels, image_size, image_size]

Citation :

@misc{tolstikhin2021mlpmixer,
      title={MLP-Mixer: An all-MLP Architecture for Vision}, 
      author={Ilya Tolstikhin and Neil Houlsby and Alexander Kolesnikov and Lucas Beyer and Xiaohua Zhai and Thomas Unterthiner and Jessica Yung and Daniel Keysers and Jakob Uszkoreit and Mario Lucic and Alexey Dosovitskiy},
      year={2021},
      eprint={2105.01601},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Acknowledgement :

Some component borrowed from ViT code of @lucidrains repo : https://github.com/lucidrains/vit-pytorch

Unofficial implementation of MLP-Mixer: An all-MLP Architecture for Vision

Related tags

Overview

MLP-Mixer: An all-MLP Architecture for Vision

Usage :

Citation :

Acknowledgement :

Owner

Rishikesh (ऋषिकेश)

The source code for CATSETMAT: Cross Attention for Set Matching in Bipartite Hypergraphs

A proof of concept ai-powered Recaptcha v2 solver

Code for our paper "MG-GAN: A Multi-Generator Model Preventing Out-of-Distribution Samples in Pedestrian Trajectory Prediction" published at ICCV 2021.

Model Agnostic Interpretability for Multiple Instance Learning

Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm

Sentiment analysis translations of the Bhagavad Gita

A generalist algorithm for cell and nucleus segmentation.

Machine Translation Implement By Bi-GRU And Transformer

MAVE: : A Product Dataset for Multi-source Attribute Value Extraction

This project is the official implementation of our accepted ICLR 2021 paper BiPointNet: Binary Neural Network for Point Clouds.

Wileless-PDGNet Implementation

Official implementation for the paper "Attentive Prototypes for Source-free Unsupervised Domain Adaptive 3D Object Detection"

PantheonRL is a package for training and testing multi-agent reinforcement learning environments.

Implementation of ML models like Decision tree, Naive Bayes, Logistic Regression and many other

GANsformer: Generative Adversarial Transformers Drew A

Export CenterPoint PonintPillars ONNX Model For TensorRT

A dead simple python wrapper for darknet that works with OpenCV 4.1, CUDA 10.1

ELSED: Enhanced Line SEgment Drawing

Tensor-Based Quantum Machine Learning

Implementation of the SUMO (Slim U-Net trained on MODA) model