Implementation of SETR model, Original paper: Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers.

Last update: Dec 16, 2022

Overview

SETR - Pytorch

Since the original paper (Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers.) has no official code,I implemented SETR-Progressive UPsampling(SETR-PUP) using pytorch.

Original paper: Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers.

Vit

The Vit model is also implemented, and you can use it for image classification.

Usage SETR

from SETR.transformer_seg import SETRModel
import torch 

if __name__ == "__main__":
    net = SETRModel(patch_size=(32, 32), 
                    in_channels=3, 
                    out_channels=1, 
                    hidden_size=1024, 
                    num_hidden_layers=8, 
                    num_attention_heads=16, 
                    decode_features=[512, 256, 128, 64])
    t1 = torch.rand(1, 3, 256, 256)
    print("input: " + str(t1.shape))
    
    # print(net)
    print("output: " + str(net(t1).shape))

If the output size is (1, 1, 256, 256), the code runs successfully.

Usage Vit

from SETR.transformer_seg import Vit
import torch 

if __name__ == "__main__":
    model = Vit(patch_size=(7, 7), 
                    in_channels=1, 
                    out_class=10, 
                    hidden_size=1024, 
                    num_hidden_layers=1, 
                    num_attention_heads=16)
    print(model)
    t1 = torch.rand(1, 1, 28, 28)
    print("input: " + str(t1.shape))

    print("output: " + str(model(t1).shape))

The output shape is (1, 10).

current examples

task_mnist: The simplest example, using the Vit model to classify the minst dataset.
task_car_seg: The example is sample segmentation task. data download: https://www.kaggle.com/c/carvana-image-masking-challenge/data

More examples will be updated later.

Implementation of SETR model, Original paper: Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers.

Related tags

Overview

SETR - Pytorch

Vit

Usage SETR

Usage Vit

current examples

more

Owner

zhaohu xing

Utilities to bridge Canvas-generated course rosters with GitLab's API.

Code for the ICCV 2021 paper "Pixel Difference Networks for Efficient Edge Detection" (Oral).

An ML & Correlation platform for transforming disparate data points of interest into usable intelligence.

Tutorial in Python targeted at Epidemiologists. Will discuss the basics of analysis in Python 3

Explainable Zero-Shot Topic Extraction

Code for "Modeling Indirect Illumination for Inverse Rendering", CVPR 2022

Code base for reproducing results of I.Schubert, D.Driess, O.Oguz, and M.Toussaint: Learning to Execute: Efficient Learning of Universal Plan-Conditioned Policies in Robotics. NeurIPS (2021)

Framework for training options with different attention mechanism and using them to solve downstream tasks.

Learning embeddings for classification, retrieval and ranking.

Pytorch implementation of "Forward Thinking: Building and Training Neural Networks One Layer at a Time"

Group R-CNN for Point-based Weakly Semi-supervised Object Detection (CVPR2022)

Implementation of CVPR'2022:Surface Reconstruction from Point Clouds by Learning Predictive Context Priors

Implementation of Axial attention - attending to multi-dimensional data efficiently

PyTorch implementation for Graph Contrastive Learning with Augmentations

Implementation for Shape from Polarization for Complex Scenes in the Wild

Source code for NAACL 2021 paper "TR-BERT: Dynamic Token Reduction for Accelerating BERT Inference"

AAAI 2022: Stationary diffusion state neural estimation

Revisting Open World Object Detection

[3DV 2020] PeeledHuman: Robust Shape Representation for Textured 3D Human Body Reconstruction