An implementation of the efficient attention module.

Last update: Dec 15, 2022

Overview

Efficient Attention

An implementation of the efficient attention module.

Description

Efficient attention is an attention mechanism that substantially optimizes the memory and computational efficiency while retaining exactly the same expressive power as the conventional dot-product attention. The illustration above compares the two types of attention. The efficient attention module is a drop-in replacement for the non-local module (Wang et al., 2018), while it:

uses less resources to achieve the same accuracy;
achieves higher accuracy with the same resource constraints (by allowing more insertions); and
is applicable in domains and models where the non-local module is not (due to resource constraints).

Resources

YouTube:

Presentation: https://youtu.be/_wnjhTM04NM

bilibili (for users in Mainland China):

Presentation: https://www.bilibili.com/video/BV1tK4y1f7Rm
Presentation in Chinese: https://www.bilibili.com/video/bv1Gt4y1Y7E3

Implementation details

This repository implements the efficient attention module with softmax normalization, output reprojection, and residual connection.

Features not in the paper

This repository implements additionally implements the multi-head mechanism which was not in the paper. To learn more about the mechanism, refer to Vaswani et al.

Citation

The paper will appear at WACV 2021. If you use, compare with, or refer to this work, please cite

@inproceedings{shen2021efficient,
    author = {Zhuoran Shen and Mingyuan Zhang and Haiyu Zhao and Shuai Yi and Hongsheng Li},
    title = {Efficient Attention: Attention with Linear Complexities},
    booktitle = {WACV},
    year = {2021},
}

An implementation of the efficient attention module.

Related tags

Overview

Efficient Attention

Description

Resources

Implementation details

Features not in the paper

Citation

Owner

Shen Zhuoran

Zeyuan Chen, Yangchao Wang, Yang Yang and Dong Liu.

DLWP: Deep Learning Weather Prediction

This is an official implementation of the High-Resolution Transformer for Dense Prediction.

Experiment about Deep Person Re-identification with EfficientNet-v2

CLOOB: Modern Hopfield Networks with InfoLOOB Outperform CLIP

[CVPR 21] Vectorization and Rasterization: Self-Supervised Learning for Sketch and Handwriting, IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2021.

UIUCTF 2021 Public Challenge Repository

The source code of the ICCV2021 paper "PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering"

QHack—the quantum machine learning hackathon

Keras + Hyperopt: A very simple wrapper for convenient hyperparameter optimization

This repository contains datasets and baselines for benchmarking Chinese text recognition.

Torch implementation of "Enhanced Deep Residual Networks for Single Image Super-Resolution"

Code for the paper "Reinforcement Learning as One Big Sequence Modeling Problem"

Taichi Course Homework Template

Implementation of the Remixer Block from the Remixer paper, in Pytorch

Bag of Tricks for Natural Policy Gradient Reinforcement Learning

AutoPentest-DRL: Automated Penetration Testing Using Deep Reinforcement Learning

ICON: Implicit Clothed humans Obtained from Normals

Subdivision-based Mesh Convolutional Networks

MQBench: Towards Reproducible and Deployable Model Quantization Benchmark