Official implementation of "SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers"

Last update: Dec 31, 2022

Overview

SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers

Figure 1: Performance of SegFormer-B0 to SegFormer-B5.

Project page | Paper | Demo (Youtube) | Demo (Bilibili)

SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers.
Enze Xie, Wenhai Wang, Zhiding Yu, Anima Anandkumar, Jose M. Alvarez, and Ping Luo.
Technical Report 2021.

This repository contains the PyTorch training/evaluation code and the pretrained models for SegFormer.

SegFormer is a simple, efficient and powerful semantic segmentation method, as shown in Figure 1.

We use MMSegmentation v0.13.0 as the codebase.

Installation

For install and data preparation, please refer to the guidelines in MMSegmentation v0.13.0.

Other requirements: pip install timm==0.3.2

Evaluation

Download trained weights.

Example: evaluate SegFormer-B1 on ADE20K:

# Single-gpu testing
python tools/test.py local_configs/segformer/B1/segformer.b1.512x512.ade.160k.py /path/to/checkpoint_file

# Multi-gpu testing
./tools/dist_test.sh local_configs/segformer/B1/segformer.b1.512x512.ade.160k.py /path/to/checkpoint_file <GPU_NUM>

# Multi-gpu, multi-scale testing
tools/dist_test.sh local_configs/segformer/B1/segformer.b1.512x512.ade.160k.py /path/to/checkpoint_file <GPU_NUM> --aug-test

Training

Download weights pretrained on ImageNet-1K, and put them in a folder pretrained/.

Example: train SegFormer-B1 on ADE20K:

# Single-gpu training
python tools/train.py local_configs/segformer/B1/segformer.b1.512x512.ade.160k.py 

# Multi-gpu training
./tools/dist_train.sh local_configs/segformer/B1/segformer.b1.512x512.ade.160k.py <GPU_NUM>

License

Please check the LICENSE file. SegFormer may be used non-commercially, meaning for research or evaluation purposes only. For business inquiries, please contact [email protected].

Citation

@article{xie2021segformer,
  title={SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers},
  author={Xie, Enze and Wang, Wenhai and Yu, Zhiding and Anandkumar, Anima and Alvarez, Jose M and Luo, Ping},
  journal={arXiv preprint arXiv:2105.15203},
  year={2021}
}

Official implementation of "SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers"

Related tags

Overview

SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers

Project page | Paper | Demo (Youtube) | Demo (Bilibili)

Installation

Evaluation

Training

License

Citation

Owner

NVIDIA Research Projects

The official repository for BaMBNet

[TNNLS 2021] The official code for the paper "Learning Deep Context-Sensitive Decomposition for Low-Light Image Enhancement"

An implementation of the WHATWG URL Standard in JavaScript

VOS: Learning What You Don’t Know by Virtual Outlier Synthesis

BabelCalib: A Universal Approach to Calibrating Central Cameras. In ICCV (2021)

Tgbox-bench - Simple TGBOX upload speed benchmark

Neural Contours: Learning to Draw Lines from 3D Shapes (CVPR2020)

Summary Explorer is a tool to visually explore the state-of-the-art in text summarization.

CVAT is free, online, interactive video and image annotation tool for computer vision

PyTorch implementation of "VRT: A Video Restoration Transformer"

Specification language for generating Generalized Linear Models (with or without mixed effects) from conceptual models

Self-Supervised Contrastive Learning of Music Spectrograms

PyTorch implementation for 3D human pose estimation

PyTorch implementation of TSception V2 using DEAP dataset

📚 Papermill is a tool for parameterizing, executing, and analyzing Jupyter Notebooks.

Automated Hyperparameter Optimization Competition

Python module providing a framework to trace individual edges in an image using Gaussian process regression.

Probabilistic Gradient Boosting Machines

3DMV jointly combines RGB color and geometric information to perform 3D semantic segmentation of RGB-D scans.

Small little script to scrape, parse and check for active tor nodes. Can be used as proxies.