A PyTorch library for Vision Transformers

Last update: Nov 28, 2022

Related tags

Deep Learning vformer

Overview

VFormer

A PyTorch library for Vision Transformers

Getting Started

Read the contributing guidelines in CONTRIBUTING.rst to learn how to start contributing.

Comments

Add attention visualization methods
This article details different ways of visualizing a transformer's attention. It also talks about how such visualizations can aid in explainability of the models.

They also provide their code here.

We would like to have such visualization methods in the viz module.

good first issue
opened by NeelayS 7
Remove _Projection class

We can replace _Projection class with a one-liner if-else statement.

Should we replace it with if-else or should we keep the current implementation?

cc: @NeelayS @aditya-agrawal-30502 @alvanli

opened by abhi-glitchhg 6
Enhanced docstring

During the last PR (#45), I had to revert back because of compatibility issues

In this PR I have added some docstrings and Minor changes like changing variable names

this PR is the same as - #48 with edited title :)

@NeelayS

opened by abhi-glitchhg 3
Restructuring AbsolutePositionEmbedding class

AbsolutePositionEmbedding class was structured specifically for the PVT, but we can use it in other models too if we re-structure it properly, it should also support sinusoidal position embedding or a separate class for Sinusoidal embedding also works.
enhancement

opened by abhi-glitchhg 2
Add sharpness-aware optimizer

This paper describes how promoting smoothness with a recently proposed sharpness-aware optimizer substantially improves the performance of ViTs.

It would be good to have an implementation of this optimizer in our library. It would fit in the functional module.

A couple of PyTorch implementations are here and here.

opened by NeelayS 2
Documentation related to visualization methods

I have added some fixes for page breaks in #86.

Still, we need to enhance the docs for visualization methods.
We can include the license/copyright disclaimer for visualization methods in our license or have a separate file.

Additionally, we can add the sample outputs from these methods into the doc.

CC : @NeelayS @aditya-agrawal-30502 @alvanli
documentation enhancement good first issue

opened by abhi-glitchhg 1
[Paper] Visual Attention Network

paper - https://arxiv.org/abs/2202.09741 code- https://github.com/Visual-Attention-Network/VAN-Classification https://github.com/Visual-Attention-Network/VAN-Segmentation
Paper implementation

opened by abhi-glitchhg 0

Releases(v0.1.3)

v0.1.3(Jul 3, 2022)

Source code(tar.gz)
Source code(zip)
v0.1.2(Apr 7, 2022)

Source code(tar.gz)
Source code(zip)
v0.1.0(Feb 9, 2022)

First release of VFormer!
Source code(tar.gz)
Source code(zip)

Owner

Society for Artificial Intelligence and Deep Learning

GitHub Repository

Random Walk Graph Neural Networks

Random Walk Graph Neural Networks This repository is the official implementation of Random Walk Graph Neural Networks. Requirements Code is written in

38 Jan 02, 2023

Code for AA-RMVSNet: Adaptive Aggregation Recurrent Multi-view Stereo Network (ICCV 2021).

AA-RMVSNet Code for AA-RMVSNet: Adaptive Aggregation Recurrent Multi-view Stereo Network (ICCV 2021) in PyTorch. paper link: arXiv | CVF Change Log Ju

97 Dec 30, 2022

Dynamic vae - Dynamic VAE algorithm is used for anomaly detection of battery data

Dynamic VAE frame Automatic feature extraction can be achieved by probability di

10 Oct 07, 2022

Analyzes your GitHub Profile and presents you with a report on how likely you are to become the next MLH Fellow!

Fellowship Prediction GitHub Profile Comparative Analysis Tool Built with BentoML Table of Contents: Features Disclaimer Technologies Used Contributin

51 Dec 29, 2022

Source code for the BMVC-2021 paper "SimReg: Regression as a Simple Yet Effective Tool for Self-supervised Knowledge Distillation".

SimReg: A Simple Regression Based Framework for Self-supervised Knowledge Distillation Source code for the paper "SimReg: Regression as a Simple Yet E

9 Oct 15, 2022

This is the official github repository of the Met dataset

The Met dataset This is the official github repository of the Met dataset. The official webpage of the dataset can be found here. What is it? This cod

35 Dec 17, 2022

Sound Source Localization for AI Grand Challenge 2021

Sound-Source-Localization Sound Source Localization study for AI Grand Challenge 2021 (sponsored by NC Soft Vision Lab) Preparation 1. Place the data-

19 Mar 29, 2022

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

AlphaZero-Gomoku This is an implementation of the AlphaZero algorithm for playing the simple board game Gomoku (also called Gobang or Five in a Row) f

2.8k Dec 26, 2022

Code for ICE-BeeM paper - NeurIPS 2020

ICE-BeeM: Identifiable Conditional Energy-Based Deep Models Based on Nonlinear ICA This repository contains code to run and reproduce the experiments

65 Dec 22, 2022

HackBMU-5.0-Team-Ctrl-Alt-Elite - HackBMU 5.0 Team Ctrl Alt Elite

HackBMU-5.0-Team-Ctrl-Alt-Elite The search is over. We present to you ‘Health-A-

3 Feb 19, 2022

A command line simple note taking app

Why yet another note taking program? note was designed with a very specific target in mind: me, and my 2354 scraps of paper. It runs from the command

64 Nov 20, 2022

Object DGCNN and DETR3D, Our implementations are built on top of MMdetection3D.

This repo contains the implementations of Object DGCNN (https://arxiv.org/abs/2110.06923) and DETR3D (https://arxiv.org/abs/2110.06922). Our implementations are built on top of MMdetection3D.

539 Jan 07, 2023

Code for all the Advent of Code'21 challenges mostly written in python

Advent of Code 21 Code for all the Advent of Code'21 challenges mostly written in python. They are not necessarily the best or fastest solutions but j

4 May 26, 2022

QSYM: A Practical Concolic Execution Engine Tailored for Hybrid Fuzzing

QSYM: A Practical Concolic Execution Engine Tailored for Hybrid Fuzzing Environment Tested on Ubuntu 14.04 64bit and 16.04 64bit Installation # disabl

[email protected])"> 581 Dec 30, 2022

Python wrapper class for OpenVINO Model Server. User can submit inference request to OVMS with just a few lines of code

Python wrapper class for OpenVINO Model Server. User can submit inference request to OVMS with just a few lines of code.

7 Jul 27, 2022

The official implementation of Variable-Length Piano Infilling (VLI).

Variable-Length-Piano-Infilling The official implementation of Variable-Length Piano Infilling (VLI). (paper: Variable-Length Music Score Infilling vi

29 Sep 01, 2022

Residual Dense Net De-Interlace Filter (RDNDIF)

Residual Dense Net De-Interlace Filter (RDNDIF) Work in progress deep de-interlacer filter. It is based on the architecture proposed by Bernasconi et

7 Feb 15, 2022

Weakly Supervised Learning of Rigid 3D Scene Flow

Weakly Supervised Learning of Rigid 3D Scene Flow This repository provides code and data to train and evaluate a weakly supervised method for rigid 3D

124 Dec 27, 2022

METER: Multimodal End-to-end TransformER

METER Code and pre-trained models will be publicized soon. Citation @article{dou2021meter, title={An Empirical Study of Training End-to-End Vision-a

257 Jan 06, 2023

Pytorch implementation of FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks

flownet2-pytorch Pytorch implementation of FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks. Multiple GPU training is supported, a

2.8k Dec 27, 2022

A PyTorch library for Vision Transformers

Related tags

Overview

VFormer

A PyTorch library for Vision Transformers

Getting Started

Comments

Add attention visualization methods

Remove _Projection class

Enhanced docstring

Restructuring AbsolutePositionEmbedding class

Add sharpness-aware optimizer

Documentation related to visualization methods

[Paper] Visual Attention Network

Releases(v0.1.3)

v0.1.3(Jul 3, 2022)

v0.1.2(Apr 7, 2022)

v0.1.0(Feb 9, 2022)

Owner

Society for Artificial Intelligence and Deep Learning

Random Walk Graph Neural Networks

Code for AA-RMVSNet: Adaptive Aggregation Recurrent Multi-view Stereo Network (ICCV 2021).

Dynamic vae - Dynamic VAE algorithm is used for anomaly detection of battery data

Analyzes your GitHub Profile and presents you with a report on how likely you are to become the next MLH Fellow!

Source code for the BMVC-2021 paper "SimReg: Regression as a Simple Yet Effective Tool for Self-supervised Knowledge Distillation".

This is the official github repository of the Met dataset

Sound Source Localization for AI Grand Challenge 2021

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

Code for ICE-BeeM paper - NeurIPS 2020

HackBMU-5.0-Team-Ctrl-Alt-Elite - HackBMU 5.0 Team Ctrl Alt Elite

A command line simple note taking app

Object DGCNN and DETR3D, Our implementations are built on top of MMdetection3D.

Code for all the Advent of Code'21 challenges mostly written in python

QSYM: A Practical Concolic Execution Engine Tailored for Hybrid Fuzzing

Python wrapper class for OpenVINO Model Server. User can submit inference request to OVMS with just a few lines of code

The official implementation of Variable-Length Piano Infilling (VLI).

Residual Dense Net De-Interlace Filter (RDNDIF)

Weakly Supervised Learning of Rigid 3D Scene Flow

METER: Multimodal End-to-end TransformER

Pytorch implementation of FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks