ST++: Make Self-training Work Better for Semi-supervised Semantic Segmentation

Last update: Jan 03, 2023

Overview

ST++

This is the official PyTorch implementation of our paper:

ST++: Make Self-training Work Better for Semi-supervised Semantic Segmentation.
Lihe Yang, Wei Zhuo, Lei Qi, Yinghuan Shi and Yang Gao.

Getting Started

Data Preparation

Pre-trained Model

ResNet-50 | ResNet-101 | DeepLabv2-ResNet-101

Dataset

Pascal | Augmented Masks | Cityscapes | Class Mapped Masks

File Organization

├── ./pretrained
    ├── resnet50.pth
    ├── resnet101.pth
    └── deeplabv2_resnet101_coco_pretrained.pth
    
├── [Your Pascal Path]
    ├── JPEGImages
    └── SegmentationClass    # replace the official folder with above augmented masks 
    
├── [Your Cityscapes Path]
    ├── gtFine               # replace the official folder with above class mapped masks 
    └── leftImg8bit

Training and Testing

export semi_setting='pascal/1_8/split_0'

CUDA_VISIBLE_DEVICES=0,1 python -W ignore main.py \
  --dataset pascal --data-root [Your Pascal Path] \
  --batch-size 16 --backbone resnet50 --model deeplabv3plus \
  --labeled-id-path dataset/splits/$semi_setting/labeled.txt \
  --unlabeled-id-path dataset/splits/$semi_setting/unlabeled.txt \
  --pseudo-mask-path outdir/pseudo_masks/$semi_setting \
  --save-path outdir/models/$semi_setting

This script is for our ST framework. To run ST++, add --plus --reliable-id-path outdir/reliable_ids/$semi_setting.

Acknowledgement

The DeepLabv2 MS COCO pre-trained model is borrowed and converted from AdvSemiSeg. The image partitions are borrowed from Context-Aware-Consistency and PseudoSeg. Part of the training hyper-parameters and network structures are adapted from PyTorch-Encoding. The strong data augmentations are borrowed from MoCo v2 and PseudoSeg.

AdvSemiSeg: https://github.com/hfslyc/AdvSemiSeg.
Context-Aware-Consistency: https://github.com/dvlab-research/Context-Aware-Consistency.
PseudoSeg: https://github.com/googleinterns/wss.
PyTorch-Encoding: https://github.com/zhanghang1989/PyTorch-Encoding.
MoCo: https://github.com/facebookresearch/moco.
OpenSelfSup: https://github.com/open-mmlab/OpenSelfSup.

Thanks a lot for their great works!

Citation

If you find this project useful, please consider citing:

@article{yang2021st++,
  title={ST++: Make Self-training Work Better for Semi-supervised Semantic Segmentation},
  author={Yang, Lihe and Zhuo, Wei and Qi, Lei and Shi, Yinghuan and Gao, Yang},
  journal={arXiv preprint arXiv:2106.05095},
  year={2021}
}

ST++: Make Self-training Work Better for Semi-supervised Semantic Segmentation

Related tags

Overview

ST++

Getting Started

Data Preparation

Pre-trained Model

Dataset

File Organization

Training and Testing

Acknowledgement

Citation

Owner

Lihe Yang

RIFE: Real-Time Intermediate Flow Estimation for Video Frame Interpolation

Stock-Prediction - prediction of stock market movements using sentiment analysis and deep learning.

ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis

Official Pytorch implementation for video neural representation (NeRV)

Simple improvement of VQVAE that allow to generate x2 sized images compared to baseline

Vertical Federated Principal Component Analysis and Its Kernel Extension on Feature-wise Distributed Data based on Pytorch Framework

A flexible framework of neural networks for deep learning

Official PyTorch implementation of N-ImageNet: Towards Robust, Fine-Grained Object Recognition with Event Cameras (ICCV 2021)

Class-Balanced Loss Based on Effective Number of Samples. CVPR 2019

Implementation of paper "DCS-Net: Deep Complex Subtractive Neural Network for Monaural Speech Enhancement"

City-seeds - A random generator of cultural characteristics intended to spark ideas and help draw threads

Doods2 - API for detecting objects in images and video streams using Tensorflow

Poplar implementation of "Bundle Adjustment on a Graph Processor" (CVPR 2020)

DeepStochlog Package For Python

Deep Structured Instance Graph for Distilling Object Detectors (ICCV 2021)

AI-Fitness-Tracker - AI Fitness Tracker With Python

Code for EMNLP2021 paper "Allocating Large Vocabulary Capacity for Cross-lingual Language Model Pre-training"

NovelD: A Simple yet Effective Exploration Criterion

Official code for the CVPR 2021 paper "How Well Do Self-Supervised Models Transfer?"

Intro-to-dl - Resources for "Introduction to Deep Learning" course.