Weakly-Supervised Semantic Segmentation Network with Deep Seeded Region Growing (CVPR 2018).

Last update: Dec 13, 2022

Overview

Weakly-Supervised Semantic Segmentation Network with Deep Seeded Region Growing (CVPR2018)

By Zilong Huang, Xinggang Wang, Jiasi Wang, Wenyu Liu and Jingdong Wang.

This code is a implementation of the weakly-supervised semantic segmentation experiments in the paper DSRG. The code is developed based on the Caffe framework.

Introduction

Overview of the proposed approach. The Deep Seeded Region Growing module takes the seed cues and segmentation map as input to produces latent pixel-wise supervision which is more accurate and more complete than seed cues. Our method iterates between reﬁning pixel-wise supervision and optimizing the parameters of a segmentation network.

License

DSRG is released under the MIT License (refer to the LICENSE file for details).

Citing DSRG

If you find DSRG useful in your research, please consider citing:

@inproceedings{huang2018dsrg,
    title={Weakly-Supervised Semantic Segmentation Network with Deep Seeded Region Growing},
    author={Huang, Zilong and Wang, Xinggang and Wang, Jiasi and Liu, Wenyu and Wang, Jingdong},
    booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
    pages={7014--7023},
    year={2018}
}

Installing dependencies

Python packages:

      $ pip install -r python-dependencies.txt

caffe (deeplabv2 version): deeplabv2 caffe installation instructions are available at https://bitbucket.org/aquariusjay/deeplab-public-ver2. Note, you need to compile caffe with python wrapper and support for python layers. Then add the caffe python path into training/tools/findcaffe.py.
Fully connected CRF wrapper (requires the Eigen3 package).

      $ pip install CRF/

Training the DSRG model

Go into the training directory:

      $ cd training
      $ mkdir localization_cues

Download the initial VGG16 model pretrained on Imagenet and put it in training/ folder.
Download CAM seed and put it in training/localization_cues folder. We use CAM for localizing the foreground seed classes and utilize the saliency detection technology DRFI for localizing background seed. We provide the python interface to DRFI here for convenience if you want to generate the seed by yourself.

      $ cd training/experiment/seed_mc
      $ mkdir models

Set root_folder parameter in train-s.prototxt, train-f.prototxt and PASCAL_DIR in run-s.sh to the directory with PASCAL VOC 2012 images
Run:

      $ bash run.sh

The trained model will be created in models

Acknowledgment

This code is heavily borrowed from SEC.

Weakly-Supervised Semantic Segmentation Network with Deep Seeded Region Growing (CVPR 2018).

Related tags

Overview

Weakly-Supervised Semantic Segmentation Network with Deep Seeded Region Growing (CVPR2018)

Introduction

License

Citing DSRG

Installing dependencies

Training the DSRG model

Acknowledgment

Owner

Zilong Huang

Transformers4Rec is a flexible and efficient library for sequential and session-based recommendation, available for both PyTorch and Tensorflow.

Parallel and High-Fidelity Text-to-Lip Generation; AAAI 2022 ; Official code

Fast Neural Style for Image Style Transform by Pytorch

git《Self-Attention Attribution: Interpreting Information Interactions Inside Transformer》(AAAI 2021) GitHub:

Deep Inertial Prediction (DIPr)

RRxIO - Robust Radar Visual/Thermal Inertial Odometry: Robust and accurate state estimation even in challenging visual conditions.

Real-time face detection and emotion/gender classification using fer2013/imdb datasets with a keras CNN model and openCV.

arxiv-sanity, but very lite, simply providing the core value proposition of the ability to tag arxiv papers of interest and have the program recommend similar papers.

A dataset for online Arabic calligraphy

Source code for PairNorm (ICLR 2020)

Official Repository for the paper "Improving Baselines in the Wild".

3D cascade RCNN for object detection on point cloud

Official tensorflow implementation for CVPR2020 paper “Learning to Cartoonize Using White-box Cartoon Representations”

Graph Transformer Architecture. Source code for

Implementation of the paper titled "Using Sampling to Estimate and Improve Performance of Automated Scoring Systems with Guarantees"

TLDR: Twin Learning for Dimensionality Reduction

CS506-Spring2022 - Code and Slides for Boston University CS 506

Fast, flexible and easy to use probabilistic modelling in Python.

Package to compute Mauve, a similarity score between neural text and human text. Install with `pip install mauve-text`.

Node for thenewboston digital currency network.