TransFGU: A Top-down Approach to Fine-Grained Unsupervised Semantic Segmentation

Last update: Dec 16, 2022

Related tags

Deep Learning TransFGU

Overview

TransFGU: A Top-down Approach to Fine-Grained Unsupervised Semantic Segmentation

Zhaoyun Yin, Pichao Wang, Fan Wang, Xianzhe Xu, Hanling Zhang, Hao Li, Rong Jin

[Preprint]

Getting Started

Create the environment

# create conda env
conda create -n TransFGU python=3.8
# activate conda env
conda activate TransFGU
# install pytorch
conda install pytorch=1.8 torchvision cudatoolkit=10.1
# install other dependencies
pip install mmcv-full -f https://download.openmmlab.com/mmcv/dist/cu101/torch1.8.0/index.html
pip install -r requirements.txt

Dataset Preparation

MS-COCO Dataset: Download the trainset, validset, annotations and the json files, place the extracted files into root/data/MSCOCO.
PascalVOC Dataset: Download training/validation data, place the extracted files into root/data/PascalVOC.
Cityscapes Dataset: Download leftImg8bit_trainvaltest.zip and gtFine_trainvaltest.zip, place the extracted files into root/data/Cityscapes.
LIP Dataset: Download TrainVal_images.zip and TrainVal_parsing_annotations.zip, place the extracted files into root/data/LIP.

the structure of dataset folders should be as follow:

data/
    │── MSCOCO/
    │     ├── images/
    │     │     ├── train2017/
    │     │     └── val2017/
    │     └── annotations/
    │           ├── train2017/
    │           ├── val2017/
    │           ├── instances_train2017.json
    │           └── instances_val2017.json
    │── Cityscapes/
    │     ├── leftImg8bit/
    │     │     ├── train/
    │     │     │       ├── aachen
    │     │     │       └── ...
    │     │     └──── val/
    │     │             ├── frankfurt
    │     │             └── ...
    │     └── gtFine/
    │           ├── train/
    │           │       ├── aachen
    │           │       └── ...
    │           └──── val/
    │                   ├── frankfurt
    │                   └── ...
    │── PascalVOC/
    │     ├── JPEGImages/
    │     ├── SegmentationClass/
    │     └── ImageSets/
    │           └── Segmentation/
    │                   ├── train.txt
    │                   └── val.txt
    └── LIP/
          ├── train_images/
          ├── train_segmentations/
          ├── val_images/
          ├── val_segmentations/
          ├── train_id.txt
          └── val_id.txt

Model download

please download the pretrained dino model (deit small 8x8), then place it into root/weight/dino/
download trained model from Google Drive or Baidu Netdisk (code:1118), then place them into root/weight/trained/

Name	mIoU	Pixel Accuracy	Model
COCOStuff-27	16.19	44.52	Google Drive
COCOStuff-171	11.93	34.32	Google Drive
COCO-80	12.69	64.31	Google Drive
Cityscapes	16.83	77.92	Google Drive
Pascal-VOC	37.15	83.59	Google Drive
LIP-5	25.16	65.76	Google Drive
LIP-16	15.49	60.08	Google Drive
LIP-19	12.24	42.52	Google Drive

Train and Evaluate Our Method

To train and evaluate our method on different datasets under desired granularity level, please follow the instructions here.

Citation

If you find our work useful in your research, please consider citing:

@article{yin2021transfgu,
  title={TransFGU: A Top-down Approach to Fine-Grained Unsupervised Semantic Segmentation},
  author={Zhaoyun, Yin and Pichao, Wang and Fan, Wang and Xianzhe, Xu and Hanling, Zhang and Hao, Li and Rong, Jin},
  journal={arXiv preprint arXiv:2112.01515},
  year={2021}
}

LICENSE

The code is released under the MIT license.

TransFGU: A Top-down Approach to Fine-Grained Unsupervised Semantic Segmentation

Related tags

Overview

TransFGU: A Top-down Approach to Fine-Grained Unsupervised Semantic Segmentation

Getting Started

Dataset Preparation

Model download

Train and Evaluate Our Method

Citation

LICENSE

Copyright

Owner

DamoCV

PPLNN is a Primitive Library for Neural Network is a high-performance deep-learning inference engine for efficient AI inferencing

Augmenting Physical Models with Deep Networks for Complex Dynamics Forecasting

Citation Intent Classification in scientific papers using the Scicite dataset an Pytorch

Implementation for the paper SMPLicit: Topology-aware Generative Model for Clothed People (CVPR 2021)

PyTorch Implementation of [1611.06440] Pruning Convolutional Neural Networks for Resource Efficient Inference

Bayesian optimization in PyTorch

Code of TIP2021 Paper《SFace: Sigmoid-Constrained Hypersphere Loss for Robust Face Recognition》. We provide both MxNet and Pytorch versions.

A deep learning CNN model to identify and classify and check if a person is wearing a mask or not.

Food recognition model using convolutional neural network & computer vision

Dynamic Attentive Graph Learning for Image Restoration, ICCV2021 [PyTorch Code]

Code for the paper: Sketch Your Own GAN

Awesome Deep Graph Clustering is a collection of SOTA, novel deep graph clustering methods

A Data Annotation Tool for Semantic Segmentation, Object Detection and Lane Line Detection.(In Development Stage)

A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more

Code repository for the work "Multi-Domain Incremental Learning for Semantic Segmentation", accepted at WACV 2022

High-performance moving least squares material point method (MLS-MPM) solver.

CSPML (crystal structure prediction with machine learning-based element substitution)

Implementation of Enformer, Deepmind's attention network for predicting gene expression, in Pytorch

Filtering variational quantum algorithms for combinatorial optimization

An implementation of a sequence to sequence neural network using an encoder-decoder