Weakly Supervised Learning of Instance Segmentation with Inter-pixel Relations, CVPR 2019 (Oral)

Last update: Dec 29, 2022

Overview

Weakly Supervised Learning of Instance Segmentation with Inter-pixel Relations

The code of:

Weakly Supervised Learning of Instance Segmentation with Inter-pixel Relations, Jiwoon Ahn, Sunghyun Cho, and Suha Kwak, CVPR 2019 [Paper]

This repository contains a framework for learning instance segmentation with image-level class labels as supervision. The key component of our approach is Inter-pixel Relation Network (IRNet) that estimates two types of information: a displacement vector field and a class boundary map, both of which are in turn used to generate pseudo instance masks from CAMs.

Citation

If you find the code useful, please consider citing our paper using the following BibTeX entry.

@InProceedings{Ahn_2019_CVPR,
author = {Ahn, Jiwoon and Cho, Sunghyun and Kwak, Suha},
title = {Weakly Supervised Learning of Instance Segmentation with Inter-pixel Relations},
booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2019}
}

Prerequisite

Python 3.7, PyTorch 1.1.0, and more in requirements.txt
PASCAL VOC 2012 devkit
NVIDIA GPU with more than 1024MB of memory

Usage

Install python dependencies

pip install -r requirements.txt

Download PASCAL VOC 2012 devkit

Follow instructions in http://host.robots.ox.ac.uk/pascal/VOC/voc2012/#devkit

Run run_sample.py or make your own script

python run_sample.py

You can either mannually edit the file, or specify commandline arguments.

Train Mask R-CNN or DeepLab with the generated pseudo labels

For the reports, we used Detectron.
- Run step/make_cocoann.py to create COCO-style annotations.
- Note: Do not employ https://storage.googleapis.com/coco-dataset/external/PASCAL_VOC.zip to measure the performance of the Mask R-CNN! It only contains bounding box annotations.
TorchVision now supports Mask R-CNN and DeepLab. I personally recommend to use this.

TO DO

Training code for MS-COCO
Code refactoring
IRNet v2

Weakly Supervised Learning of Instance Segmentation with Inter-pixel Relations, CVPR 2019 (Oral)

Related tags

Overview

Weakly Supervised Learning of Instance Segmentation with Inter-pixel Relations

Citation

Prerequisite

Usage

Install python dependencies

Download PASCAL VOC 2012 devkit

Run run_sample.py or make your own script

Train Mask R-CNN or DeepLab with the generated pseudo labels

TO DO

Owner

Jiwoon Ahn

Cross Quality LFW: A database for Analyzing Cross-Resolution Image Face Recognition in Unconstrained Environments

Tensorflow implementation of MIRNet for Low-light image enhancement

Source code for paper: Knowledge Inheritance for Pre-trained Language Models

PyTorch implementation of GLOM

Reinforcement learning for self-driving in a 3D simulation

A PyTorch implementation for Unsupervised Domain Adaptation by Backpropagation(DANN), support Office-31 and Office-Home dataset

official implementation for the paper "Simplifying Graph Convolutional Networks"

MMGeneration is a powerful toolkit for generative models, based on PyTorch and MMCV.

Pgn2tex - Scripts to convert pgn files to latex document. Useful to build books or pdf from pgn studies

Runtime type annotations for the shape, dtype etc. of PyTorch Tensors.

Official Pytorch implementation for Deep Contextual Video Compression, NeurIPS 2021

RCD: Relation Map Driven Cognitive Diagnosis for Intelligent Education Systems

We present a framework for training multi-modal deep learning models on unlabelled video data by forcing the network to learn invariances to transformations applied to both the audio and video streams.

Official code for Next Check-ins Prediction via History and Friendship on Location-Based Social Networks (MDM 2018)

Package for working with hypernetworks in PyTorch.

Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch

MinkLoc3D-SI: 3D LiDAR place recognition with sparse convolutions,spherical coordinates, and intensity

RipsNet: a general architecture for fast and robust estimation of the persistent homology of point clouds

Code release for The Devil is in the Channels: Mutual-Channel Loss for Fine-Grained Image Classification (TIP 2020)

Implementations of LSTM: A Search Space Odyssey variants and their training results on the PTB dataset.