Uncertainty Estimation via Response Scaling for Pseudo-mask Noise Mitigation in Weakly-supervised Semantic Segmentation

Last update: Sep 23, 2022

Related tags

Overview

Uncertainty Estimation via Response Scaling for Pseudo-mask Noise Mitigation in Weakly-supervised Semantic Segmentation

Introduction

This is a PyTorch implementation of Uncertainty Estimation via Response Scaling for Pseudo-mask Noise Mitigation in Weakly-supervised Semantic Segmentation (AAAI2022), based on mmsegmentation. Please refer the classification phase to PMM and refer the segmentation phase to WSSS_MMSeg.

In this papper, we mitigate the noise of pseudo-mask in segmentation phase via uncertainty from response scaling which simulates the behavior of noise. This technique is applicable to all weakly-supervised semantic segmentation methods based on fully-supervised semantic segmentation.

Uncertainty visualization

Framework visualization

Preparation

(Extract code of BaiduYun: mtci)

Datasets and pretrained weights

VOC12 OneDrive, BaiduYun; COCO14 BaiduYun; Pretrained weights OneDrive, BaiduYun

Pseduo-masks from classification phase

Pseudo-masks (if you want to skip cls phase), VOC12 OneDrive, COCO14 BaiduYun

Intermediate segmentation weights for uncertainty and cyclic pseudo-mask

Intermediate weights (if you want to skip first segmentation), BaiduYun

Released segmentation weights for test and visualization

Released weights, BaiduYun

Once downloaded, execute the following commands to link the datasets and weights.

git clone https://github.com/XMed-Lab/URN.git
cd URN
mkdir data
cd  data
ln -s [path to model files] models
ln -s [path to voc12] voc12
ln -s [path to coco2014] coco2014
ln -s [path to your voc pseudo-mask] voc12/VOC2012/ppmg
ln -s [path to your coco pseudo-mask] coco2014/voc_format/ppmg

Run the code

(If you don't run on server cluster based on srun, please modify the scripts "tools/dist_*.sh" refer to given scripts "tools/srun_*.sh")

Installation

cd URN
pip install mmcv==1.1.5
pip install -e .

(If you meet installation problems, please refer to mmsegmentation)

Train segmentation for the first time (you can skip it by intermediate weights)

cd URN
bash tools/slurm_train.sh [cluster partition] python configs/pspnet_wsss/pspnet_res2net_20k_voc12aug_pus.py work_dirs/voc12_r2n_pus 8

Uncertainty estimation and generate cyclic pseudo-mask

bash tools/slurm_test.sh [cluster partition] python configs/pspnet_wsss/pspnet_res2net_20k_voc12aug_uncertainty.py [intermediate weights] 8

Train segmentation with reweight strategy

bash tools/slurm_train.sh [cluster partition] python configs/pspnet_wsss/pspnet_res2net_20k_voc12aug_urn.py work_dirs/voc12_r2n_urn 8

Notes:

We provide other backbones, including ResNet101, ScaleNet101, Wide-ResNet38
Configs of COCO14 are provided in "configs/pspnet_wsss"
It's suggested to use multiple cluster nodes to accelerate the genetation of pseudo-mask when use "tools/slurm_test.sh"
Run "tools/run_pmm.sh" to get baselines of PMM

License

Please refer to: LICENSE.

Uncertainty Estimation via Response Scaling for Pseudo-mask Noise Mitigation in Weakly-supervised Semantic Segmentation

Related tags

Overview

Uncertainty Estimation via Response Scaling for Pseudo-mask Noise Mitigation in Weakly-supervised Semantic Segmentation

Introduction

Preparation

Datasets and pretrained weights

Pseduo-masks from classification phase

Intermediate segmentation weights for uncertainty and cyclic pseudo-mask

Released segmentation weights for test and visualization

Run the code

Installation

Train segmentation for the first time (you can skip it by intermediate weights)

Uncertainty estimation and generate cyclic pseudo-mask

Train segmentation with reweight strategy

Notes:

License

Owner

XMed-Lab

Layered Neural Atlases for Consistent Video Editing

RRL: Resnet as representation for Reinforcement Learning

Official code for article "Expression is enough: Improving traﬀic signal control with advanced traﬀic state representation"

SberSwap Video Swap base on deep learning

領域を指定し、キーを入力することで画像を保存するツールです。クラス分類用のデータセット作成を想定しています。

Code for "Reconstructing 3D Human Pose by Watching Humans in the Mirror", CVPR 2021 oral

Official Pytorch implementation for video neural representation (NeRV)

A pytorch-version implementation codes of paper: "BSN++: Complementary Boundary Regressor with Scale-Balanced Relation Modeling for Temporal Action Proposal Generation"

Effect of Different Encodings and Distance Functions on Quantum Instance-based Classifiers

Pytorch implementation of "Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech"

Hierarchical Few-Shot Generative Models

Benchmark library for high-dimensional HPO of black-box models based on Weighted Lasso regression

Code for Deep Single-image Portrait Image Relighting

A package related to building quasi-fibration symmetries

Sparse Progressive Distillation: Resolving Overfitting under Pretrain-and-Finetune Paradigm

A Player for Kanye West's Stem Player. Sort of an emulator.

Boosted CVaR Classification (NeurIPS 2021)

FocusFace: Multi-task Contrastive Learning for Masked Face Recognition

End-to-End Dense Video Captioning with Parallel Decoding (ICCV 2021)

Contains code for the paper "Vision Transformers are Robust Learners".