Mask2Former: Masked-attention Mask Transformer for Universal Image Segmentation in TensorFlow 2

Last update: Dec 16, 2021

Related tags

Deep Learning Mask2Former

Overview

Mask2Former: Masked-attention Mask Transformer for Universal Image Segmentation in TensorFlow 2

Bowen Cheng, Ishan Misra, Alexander G. Schwing, Alexander Kirillov, Rohit Girdhar [arXiv]

Features

A single architecture for three tasks: panoptic, instance and semantic segmentation. This straightforward mini project was built as part of the main project, IST: A TensorFlow 2 compatible instance segmentation toolbox, with the purpose of adapting recent research into segmentation approaches into TensorFlow.
Support common benchmark datasets: ADE20K, Cityscapes, COCO, Mapillary Vistas.

Getting started

Project is currently being built, with SwinTransformerV1 and SwinTransformerV2 and a few bits and pieces ready.

License

Shield:

The majority of MaskFormer is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

However portions of the project are available under separate license terms: Swin-Transformer-Semantic-Segmentation is licensed under the MIT license.

Citation

@article{cheng2021mask2former,
  title={Masked-attention Mask Transformer for Universal Image Segmentation},
  author={Bowen Cheng and Ishan Misra and Alexander G. Schwing and Alexander Kirillov and Rohit Girdhar},
  journal={arXiv},
  year={2021}
}

Mask2Former: Masked-attention Mask Transformer for Universal Image Segmentation in TensorFlow 2

Related tags

Overview

Mask2Former: Masked-attention Mask Transformer for Universal Image Segmentation in TensorFlow 2

Features

Getting started

License

Citation

Owner

Phan Nguyen

[ICML 2022] The official implementation of Graph Stochastic Attention (GSAT).

A tool to visualise the results of AlphaFold2 and inspect the quality of structural predictions

A PyTorch implementation of "Graph Wavelet Neural Network" (ICLR 2019)

Useful materials and tutorials for 110-1 NTU DBME5028 (Application of Deep Learning in Medical Imaging)

Back to the Feature: Learning Robust Camera Localization from Pixels to Pose (CVPR 2021)

Implementation of ICCV 2021 oral paper -- A Novel Self-Supervised Learning for Gaussian Mixture Model

Point cloud processing tool library.

Pytorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.

Source code of generalized shuffled linear regression

The implementation of the lifelong infinite mixture model

DCGAN LSGAN WGAN-GP DRAGAN PyTorch

SimplEx - Explaining Latent Representations with a Corpus of Examples

Scaling and Benchmarking Self-Supervised Visual Representation Learning

This is a yolo3 implemented via tensorflow 2.7

Old Photo Restoration (Official PyTorch Implementation)

code for generating data set ES-ImageNet with corresponding training code

Flickr-Faces-HQ (FFHQ) is a high-quality image dataset of human faces, originally created as a benchmark for generative adversarial networks (GAN)

Asymmetric Bilateral Motion Estimation for Video Frame Interpolation, ICCV2021

Train an RL agent to execute natural language instructions in a 3D Environment (PyTorch)

Synthesizing and manipulating 2048x1024 images with conditional GANs