Deep Watershed Transform for Instance Segmentation

Last update: Nov 20, 2022

Related tags

Overview

Deep Watershed Transform

Performs instance level segmentation detailed in the following paper:

Min Bai and Raquel Urtasun, Deep Watershed Transformation for Instance Segmentation, in CVPR 2017. Accessible at https://arxiv.org/abs/1611.08303.

This page is still under construction.

Dependencies

Developed and tested on Ubuntu 14.04 and 16.04.

TensorFlow www.tensorflow.org
Numpy, Scipy, and Skimage (sudo apt-get install python-numpy python-scipy python-skimage)

Inputs

Cityscapes images (www.cityscapes-dataset.com).
Semantic Segmentation for input images. In our case, we used the output from PSPNet (by H. Zhao et al. https://github.com/hszhao/PSPNet). These are uint8 images with pixel-wise semantic labels encoded with 'trainIDs' defined by Cityscapes. For more information, visit https://github.com/mcordts/cityscapesScripts/blob/master/cityscapesscripts/helpers/labels.py

Outputs

The model produces pixel-wise instance labels as a uint16 image with the same formatting as the Cityscapes instance segmentation challenge ground truth. In particular, each pixel is labeled as 'id' * 1000 + instance_id, where 'id' is as defined by Cityscapes (for more information, consult labels.py in the above link), and instance_id is an integer indexing the object instance.

Testing the Model

Clone repository into dwt/.
Download the model from www.cs.toronto.edu/~mbai/dwt_cityscapes_pspnet.mat and place into the "dwt/model" directory.
run "cd E2E"
run "python main.py"
The results will be available in "dwt/example/output".

Training the Model

Will be available soon.

Deep Watershed Transform for Instance Segmentation

Related tags

Overview

Deep Watershed Transform

Dependencies

Inputs

Outputs

Testing the Model

Training the Model

Owner

PyTorch and Tensorflow functional model definitions

Code repository for paper `Skeleton Merger: an Unsupervised Aligned Keypoint Detector`.

The official PyTorch implementation of the paper: Xili Dai, Xiaojun Yuan, Haigang Gong, Yi Ma. "Fully Convolutional Line Parsing." .

DeLiGAN - This project is an implementation of the Generative Adversarial Network

People log into different sites every day to get information and browse through these sites one by one

A collection of awesome resources image-to-image translation.

Collection of generative models in Pytorch version.

[CVPR 2021] VirTex: Learning Visual Representations from Textual Annotations

Faster RCNN with PyTorch

Neural Cellular Automata + CLIP

A face dataset generator with out-of-focus blur detection and dynamic interval adjustment.

First-Order Probabilistic Programming Language

PFENet: Prior Guided Feature Enrichment Network for Few-shot Segmentation (TPAMI).

Train the HRNet model on ImageNet

A Pythonic library for Nvidia Codec.

Opinionated code formatter, just like Python's black code formatter but for Beancount

Code for "Hierarchical Skills for Efficient Exploration" HSD-3 Algorithm and Baselines

Reinforcement learning library(framework) designed for PyTorch, implements DQN, DDPG, A2C, PPO, SAC, MADDPG, A3C, APEX, IMPALA ...

Code for KHGT model, AAAI2021

YOLOX Win10 Project

Deep Watershed Transform for Instance Segmentation

Related tags

Overview

Deep Watershed Transform

Dependencies

Inputs

Outputs

Testing the Model

Training the Model

Owner

PyTorch and Tensorflow functional model definitions

Code repository for paper `Skeleton Merger: an Unsupervised Aligned Keypoint Detector`.

The official PyTorch implementation of the paper: *Xili Dai, Xiaojun Yuan, Haigang Gong, Yi Ma. "Fully Convolutional Line Parsing." *.

DeLiGAN - This project is an implementation of the Generative Adversarial Network

People log into different sites every day to get information and browse through these sites one by one

A collection of awesome resources image-to-image translation.

Collection of generative models in Pytorch version.

[CVPR 2021] VirTex: Learning Visual Representations from Textual Annotations

Faster RCNN with PyTorch

Neural Cellular Automata + CLIP

A face dataset generator with out-of-focus blur detection and dynamic interval adjustment.

First-Order Probabilistic Programming Language

PFENet: Prior Guided Feature Enrichment Network for Few-shot Segmentation (TPAMI).

Train the HRNet model on ImageNet

A Pythonic library for Nvidia Codec.

Opinionated code formatter, just like Python's black code formatter but for Beancount

Code for "Hierarchical Skills for Efficient Exploration" HSD-3 Algorithm and Baselines

Reinforcement learning library(framework) designed for PyTorch, implements DQN, DDPG, A2C, PPO, SAC, MADDPG, A3C, APEX, IMPALA ...

Code for KHGT model, AAAI2021

YOLOX Win10 Project

The official PyTorch implementation of the paper: Xili Dai, Xiaojun Yuan, Haigang Gong, Yi Ma. "Fully Convolutional Line Parsing." .