My implementation of transformers related papers for computer vision in pytorch

Last update: Nov 10, 2021

Overview

vision_transformers

This is my personnal repo to implement new transofrmers based and other computer vision DL models

I am currenlty working without a lot of GPU ressources therefore I mainly trained models on CIFAR 10. But my implementation are build to be fast and effective at scale.

Current paper implemented:

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale, from Dosovitskiy et al (2020)
Patch Are All You Need ? anonymous

Baseline:

Deep Residual Learning for Image Recognition, from He et al (2015)

Models are implemented in pure pytorch and trained via pytorchlightning. Dependencies are managed by poetry. It is included an Dockerfile to create a cuda ready container with jupyter lab inside. On the development part, I use jupytext in order to avoid commit every metadata change on the notebook. Fully tested with pytest and formatted with black and isort.

If you want to create a project with similar config, just use my boilerplat.

How to use it ?

first install the dependecies:

poetry install

Then, only for development:

add the precommit hook

poetry run pre-commit install

sync the notebook (only once)

poetry shell
make notebook-sync

launch a jupyter lab session

poetry run jupyter lab

Use tensorboard

poetry shell
make tensorboard

Format the code without the precommit hook

poetry shell
make formatting

Tests:

to run the tests:

poetry shell
make tests

You might also like...

Build fully-functioning computer vision models with PyTorch

Detecto is a Python package that allows you to build fully-functioning computer vision and object detection models with just 5 lines of code. Inferenc

576 Dec 29, 2022

A PyTorch-Based Framework for Deep Learning in Computer Vision

TorchCV: A PyTorch-Based Framework for Deep Learning in Computer Vision @misc{you2019torchcv, author = {Ansheng You and Xiangtai Li and Zhen Zhu a

2.2k Jan 9, 2023

Open Source Differentiable Computer Vision Library for PyTorch

Kornia is a differentiable computer vision library for PyTorch. It consists of a set of routines and differentiable modules to solve generic computer

7.6k Jan 4, 2023

An Agnostic Computer Vision Framework - Pluggable to any Training Library: Fastai, Pytorch-Lightning with more to come

IceVision is the first agnostic computer vision framework to offer a curated collection with hundreds of high-quality pre-trained models from torchvision, MMLabs, and soon Pytorch Image Models. It orchestrates the end-to-end deep learning workflow allowing to train networks with easy-to-use robust high-performance libraries such as Pytorch-Lightning and Fastai

789 Dec 29, 2022

My implementation of transformers related papers for computer vision in pytorch

Related tags

Overview

vision_transformers

How to use it ?

launch a jupyter lab session

Use tensorboard

Format the code without the precommit hook

Tests:

You might also like...

Build fully-functioning computer vision models with PyTorch

A PyTorch-Based Framework for Deep Learning in Computer Vision

Open Source Differentiable Computer Vision Library for PyTorch

An Agnostic Computer Vision Framework - Pluggable to any Training Library: Fastai, Pytorch-Lightning with more to come

Spiking Neural Network for Computer Vision using SpikingJelly framework and Pytorch-Lightning

Implementation of self-attention mechanisms for general purpose. Focused on computer vision modules. Ongoing repository.

The Incredible PyTorch: a curated list of tutorials, papers, projects, communities and more relating to PyTorch.

Explainability for Vision Transformers (in PyTorch)

PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO

Releases(0.1.0)

0.1.0(Nov 10, 2021)

Owner

samsja

My solution for the 7th place / 245 in the Umoja Hack 2022 challenge

An optimization and data collection toolbox for convenient and fast prototyping of computationally expensive models.

The code written during my Bachelor Thesis "Classification of Human Whole-Body Motion using Hidden Markov Models".

Reinforcement Learning for finance

Customised to detect objects automatically by a given model file(onnx)

AFLNet: A Greybox Fuzzer for Network Protocols

Code for the paper "On the Power of Edge Independent Graph Models"

A Broad Study on the Transferability of Visual Representations with Contrastive Learning

a practicable framework used in Deep Learning. So far UDL only provide DCFNet implementation for the ICCV paper (Dynamic Cross Feature Fusion for Remote Sensing Pansharpening)

BMVC 2021 Oral: code for BI-GCN: Boundary-Aware Input-Dependent Graph Convolution for Biomedical Image Segmentation

This repository is the offical Pytorch implementation of ContextPose: Context Modeling in 3D Human Pose Estimation: A Unified Perspective (CVPR 2021).

A Framework for Encrypted Machine Learning in TensorFlow

Code for the paper "Functional Regularization for Reinforcement Learning via Learned Fourier Features"

Leveraging OpenAI's Codex to solve cornerstone problems in Music

Code and dataset for ACL2018 paper "Exploiting Document Knowledge for Aspect-level Sentiment Classification"

NeRD: Neural Reflectance Decomposition from Image Collections

Official Implementation for the paper DeepFace-EMD: Re-ranking Using Patch-wise Earth Mover’s Distance Improves Out-Of-Distribution Face Identification

This repo will contain code to reproduce and build upon understanding transfer learning

Multispectral Object Detection with Yolov5

A variational Bayesian method for similarity learning in non-rigid image registration (CVPR 2022)