Unifying Global-Local Representations in Salient Object Detection with Transformer

Last update: Aug 24, 2022

Related tags

Overview

GLSTR (Global-Local Saliency Transformer)

This is the official implementation of paper "Unifying Global-Local Representations in Salient Object Detection with Transformer" by Sucheng Ren, Qiang Wen, Nanxuan Zhao, Guoqiang Han, Shengfeng He

Prerequisites

The whole training process can be done on eight RTX2080Ti or four RTX3090.

Pytorch 1.6

Datasets

Training Set

We use the training set of DUTS (DUTS-TR) to train our model.

/path/to/DUTS-TR/
   img/
      img1.jpg
   label/
      label1.png

Testing Set

We test our model on the testing set of DUTS, ECSSD, HKU-IS, PASCAL-S, DUT-OMRON, and SOD to test our model.

Training

Download the pretrained transformer backbone on ImageNet.

# input the path to training data and pretrained backbone in train.sh
bash train.sh

Testing

Download the pretrained model from Baidu pan(code: uo0a), Google drive, and put it int ./ckpt/

python test.py

Evaluation

The precomputed saliency maps (DUTS-TE, ECSSD, HKU-IS, PASCAL-S, DUT-OMRON, and SOD) can be found at Baidu pan(code: uo0a), Google drive.

After paper submission, we retrain the model, and the performance is improved. Feel free to use the results of our paper or the precomputed saliency maps.

Contact

If you have any questions, feel free to email Sucheng Ren :) ([email protected])

Citation

Please cite our paper if you think the code and paper are helpful.

@article{ren2021unifying,
  title={Unifying Global-Local Representations in Salient Object Detection with Transformer},
  author={Ren, Sucheng and Wen, Qiang and Zhao, Nanxuan and Han, Guoqiang and He, Shengfeng},
  journal={arXiv preprint arXiv:2108.02759},
  year={2021}
}

Unifying Global-Local Representations in Salient Object Detection with Transformer

Related tags

Overview

GLSTR (Global-Local Saliency Transformer)

Prerequisites

Datasets

Training Set

Testing Set

Training

Testing

Evaluation

Contact

Citation

Owner

Anagram Generator in Python

For storing the complete exploration of Visual Question Answering for our B.Tech Project

ChebLieNet, a spectral graph neural network turned equivariant by Riemannian geometry on Lie groups.

PaSST: Efficient Training of Audio Transformers with Patchout

x-transformers-paddle 2.x version

Explaining Deep Neural Networks - A comparison of different CAM methods based on an insect data set

Meta Learning Backpropagation And Improving It (VSML)

Implementation for Homogeneous Unbalanced Regularized Optimal Transport

End-To-End Optimization of LiDAR Beam Configuration

Python scripts for performing object detection with the 1000 labels of the ImageNet dataset in ONNX.

GAN-based Matrix Factorization for Recommender Systems

TensorFlow implementation of Elastic Weight Consolidation

To propose and implement a multi-class classification approach to disaster assessment from the given data set of post-earthquake satellite imagery.

Combining Automatic Labelers and Expert Annotations for Accurate Radiology Report Labeling Using BERT

Generating synthetic mobility data for a realistic population with RNNs to improve utility and privacy

Point cloud processing tool library.

Expressive Power of Invariant and Equivaraint Graph Neural Networks (ICLR 2021)

Object Detection with YOLOv3

Official Pytorch implementation of the paper "Action-Conditioned 3D Human Motion Synthesis with Transformer VAE", ICCV 2021

RIFE: Real-Time Intermediate Flow Estimation for Video Frame Interpolation