External Attention Network

Last update: Dec 11, 2022

Related tags

Deep Learning -EANet

Overview

Beyond Self-attention: External Attention using Two Linear Layers for Visual Tasks

paper : https://arxiv.org/abs/2105.02358

Jittor code will come soon

Pascal VOC test result link

Other implementation:

Pytorch : https://github.com/xmu-xiaoma666/External-Attention-pytorch

TODO

release jittor semantic segmentation code and checkpoint.
release torch semantic segmentation code and checkpoint.
release point cloud related code and checkpoint.
merge segmentation module into mmsegmentation to reproduce the ADE20K and Cityscapes dataset results.
merge PyTorch-StudioGAN to reproduce the GAN results.

Acknowledgments

We would like to sincerely thank HamNet_seg, EMANet_seg, openseg, T2T-ViT, mmsegmentation and PyTorch-StudioGAN for their awesome released code.

Astract

Attention mechanisms, especially self-attention, play an increasingly important role in deep feature representation in visual tasks. Self-attention updates the feature at each position by computing a weighted sum of features using pair-wise affinities across all positions to capture long-range dependency within a single sample. However, self-attention has a quadratic complexity and ignores potential correlation between different samples. This paper proposes a novel attention mechanism which we call external attention, based on two external, small, learnable, and shared memories, which can be implemented easily by simply using two cascaded linear layers and two normalization layers; it conveniently replaces self-attention in existing popular architectures. External attention has linear complexity and implicitly considers the correlations between all samples. Extensive experiments on image classification, semantic segmentation, image generation, point cloud classification and point cloud segmentation tasks reveal that our method provides comparable or superior performance to the self-attention mechanism and some of its variants, with much lower computational and memory costs.

Jittor

Jittor is a high-performance deep learning framework which is easy to learn and use. It provides interfaces like Pytorch.

You can learn how to use Jittor in following links:

Jittor homepage: https://cg.cs.tsinghua.edu.cn/jittor/

Jittor github: https://github.com/Jittor/jittor

If you has any questions about Jittor, you can ask in Jittor developer QQ Group: 761222083

Citation

If it is helpful for your work, please cite this paper:

@misc{guo2021attention,
      title={Beyond Self-attention: External Attention using Two Linear Layers for Visual Tasks}, 
      author={Meng-Hao Guo and Zheng-Ning Liu and Tai-Jiang Mu and Shi-Min Hu},
      year={2021},
      eprint={2105.02358},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

External Attention Network

Related tags

Overview

Beyond Self-attention: External Attention using Two Linear Layers for Visual Tasks

Jittor code will come soon

Pascal VOC test result link

Other implementation:

TODO

Acknowledgments

Astract

Jittor

Citation

Owner

MenghaoGuo

Pytorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.

This is a collection of our NAS and Vision Transformer work.

[ICLR 2022] DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR

This repository contains the source code for the paper First Order Motion Model for Image Animation

Deep learning with TensorFlow and earth observation data.

TransMVSNet: Global Context-aware Multi-view Stereo Network with Transformers.

Code for the paper "Query Embedding on Hyper-relational Knowledge Graphs"

PyTorch Implementation of Small Lesion Segmentation in Brain MRIs with Subpixel Embedding (ORAL, MICCAIW 2021)

Distance correlation and related E-statistics in Python

Implementation of "Fast and Flexible Temporal Point Processes with Triangular Maps" (Oral @ NeurIPS 2020)

code for our paper "Source Data-absent Unsupervised Domain Adaptation through Hypothesis Transfer and Labeling Transfer"

[Pedestron] Generalizable Pedestrian Detection: The Elephant In The Room. @ CVPR2021

Towards Rolling Shutter Correction and Deblurring in Dynamic Scenes (CVPR2021)

Noether Networks: meta-learning useful conserved quantities

[CVPR 2022] "The Principle of Diversity: Training Stronger Vision Transformers Calls for Reducing All Levels of Redundancy" by Tianlong Chen, Zhenyu Zhang, Yu Cheng, Ahmed Awadallah, Zhangyang Wang

a practicable framework used in Deep Learning. So far UDL only provide DCFNet implementation for the ICCV paper (Dynamic Cross Feature Fusion for Remote Sensing Pansharpening)

Statistical-Rethinking-with-Python-and-PyMC3 - Python/PyMC3 port of the examples in " Statistical Rethinking A Bayesian Course with Examples in R and Stan" by Richard McElreath

Autoencoder - Reducing the Dimensionality of Data with Neural Network

A tool to visualise the results of AlphaFold2 and inspect the quality of structural predictions

OpenMMLab Model Deployment Toolset