Locally Enhanced Self-Attention: Rethinking Self-Attention as Local and Context Terms

Last update: Dec 31, 2021

Related tags

Overview

LESA

Introduction

This repository contains the official implementation of Locally Enhanced Self-Attention: Rethinking Self-Attention as Local and Context Terms. The code for image classification and object detection is based on axial-deeplab and mmdetection.

Citing LESA

If you find LESA is helpful in your project, please consider citing our paper.

@article{yang2021locally,
  title={Locally Enhanced Self-Attention: Rethinking Self-Attention as Local and Context Terms},
  author={Yang, Chenglin and Qiao, Siyuan and Kortylewski, Adam and Yuille, Alan},
  journal={arXiv preprint arXiv:2107.05637},
  year={2021}
}

Main Results on ImageNet

Please refer to LESA_classification for details.

Method	Model	Top-1 Acc.	Top-5 Acc.
LESA_ResNet50	Download	79.55	94.79
LESA_WRN50	Download	80.18	95.07

Main Results on COCO test-dev

Please refer to LESA_detection for details.

Method	Backbone	Pretrained	Model	Box AP	Mask AP
Mask-RCNN	LESA_ResNet50	Download	Download	44.2	39.6
HTC	LESA_WRN50	Download	Download	50.5	44.4

Credits

This project is based on axial-deeplab and mmdetection.

Relative position embedding is based on bottleneck-transformer-pytorch

ResNet is based on pytorch/vision. Classification helper functions are based on pytorch-classification.

Locally Enhanced Self-Attention: Rethinking Self-Attention as Local and Context Terms

Related tags

Overview

LESA

Introduction

Citing LESA

Main Results on ImageNet

Main Results on COCO test-dev

Credits

Owner

Chenglin Yang

LWCC: A LightWeight Crowd Counting library for Python that includes several pretrained state-of-the-art models.

Simple node deletion tool for onnx.

Survival analysis in Python

Official repository for the paper, MidiBERT-Piano: Large-scale Pre-training for Symbolic Music Understanding.

Source code for Fathony, Sahu, Willmott, & Kolter, "Multiplicative Filter Networks", ICLR 2021.

Codes for AAAI22 paper "Learning to Solve Travelling Salesman Problem with Hardness-Adaptive Curriculum"

Implementation of Cross Transformer for spatially-aware few-shot transfer, in Pytorch

A code generator from ONNX to PyTorch code

Answering Open-Domain Questions of Varying Reasoning Steps from Text

Some pre-commit hooks for OpenMMLab projects

Unsupervised Video Interpolation using Cycle Consistency

Implementation for paper MLP-Mixer: An all-MLP Architecture for Vision

The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"

using STGCN to achieve egg classification task

magiCARP: Contrastive Authoring+Reviewing Pretraining

A Python library for Deep Graph Networks

A Python parser that takes the content of a text file and then reads it into variables.

This repo is for segmentation of T2 hyp regions in gliomas.

The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.

Differential fuzzing for the masses!