Per-Pixel Classification is Not All You Need for Semantic Segmentation

Last update: Jan 08, 2023

Related tags

Deep Learning MaskFormer

Overview

MaskFormer: Per-Pixel Classification is Not All You Need for Semantic Segmentation

Bowen Cheng, Alexander G. Schwing, Alexander Kirillov

[arXiv] [Project] [BibTeX]

Features

Better results while being more efficient.
Unified view of semantic- and instance-level segmentation tasks.
Support major semantic segmentation datasets: ADE20K, Cityscapes, COCO-Stuff, Mapillary Vistas.
Support ALL Detectron2 models.

Installation

See installation instructions.

Getting Started

See Preparing Datasets for MaskFormer.

See Getting Started with MaskFormer.

Model Zoo and Baselines

We provide a large set of baseline results and trained models available for download in the MaskFormer Model Zoo.

License

Shield:

The majority of MaskFormer is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

However portions of the project are available under separate license terms: Swin-Transformer-Semantic-Segmentation is licensed under the MIT license.

Citing MaskFormer

If you use MaskFormer in your research or wish to refer to the baseline results published in the Model Zoo, please use the following BibTeX entry.

@article{cheng2021maskformer,
  title={Per-Pixel Classification is Not All You Need for Semantic Segmentation},
  author={Bowen Cheng and Alexander G. Schwing and Alexander Kirillov},
  journal={arXiv},
  year={2021}
}

Per-Pixel Classification is Not All You Need for Semantic Segmentation

Related tags

Overview

MaskFormer: Per-Pixel Classification is Not All You Need for Semantic Segmentation

Features

Installation

Getting Started

Model Zoo and Baselines

License

Citing MaskFormer

Owner

Facebook Research

Data and code from COVID-19 machine learning paper

Retina blood vessel segmentation with a convolutional neural network

Long Expressive Memory (LEM)

[KDD 2021, Research Track] DiffMG: Differentiable Meta Graph Search for Heterogeneous Graph Neural Networks

Sequential GCN for Active Learning

Official implementation of EfficientPose

An Easy-to-use, Modular and Prolongable package of deep-learning based Named Entity Recognition Models.

This is the official released code for our paper, The Emergence of Objectness: Learning Zero-Shot Segmentation from Videos

Unofficial Tensorflow Implementation of ConvNeXt from A ConvNet for the 2020s

Using multidimensional LSTM neural networks to create a forecast for Bitcoin price

Create time-series datacubes for supervised machine learning with ICEYE SAR images.

Learning RGB-D Feature Embeddings for Unseen Object Instance Segmentation

A real-time approach for mapping all human pixels of 2D RGB images to a 3D surface-based model of the body

Implementation for the paper: Invertible Denoising Network: A Light Solution for Real Noise Removal (CVPR2021).

A Broader Picture of Random-walk Based Graph Embedding

基于tensorflow 2.x的图片识别工具集

MultiSiam: Self-supervised Multi-instance Siamese Representation Learning for Autonomous Driving

We present a regularized self-labeling approach to improve the generalization and robustness properties of fine-tuning.

A Python-based development platform for automated trading systems - from backtesting to optimisation to livetrading.

Delta Conformity Sociopatterns Analysis - Delta Conformity Sociopatterns Analysis