code for `Look Closer to Segment Better: Boundary Patch Refinement for Instance Segmentation`

Last update: Jan 05, 2023

Related tags

Overview

Look Closer to Segment Better: Boundary Patch Refinement for Instance Segmentation (CVPR 2021)

Introduction

PBR is a conceptually simple yet effective post-processing refinement framework to improve the boundary quality of instance segmentation. Following the idea of looking closer to segment boundaries better, BPR extracts and refines a series of small boundary patches along the predicted instance boundaries. The proposed BPR framework (as shown below) yields significant improvements over the Mask R-CNN baseline on the Cityscapes benchmark, especially on the boundary-aware metrics.

For more details, please refer to our paper.

Installation

Please refer to INSTALL.md.

Training

Prepare patches dataset [optional]

First, you need to generate the instance segmentation results on the Cityscapes training and validation set, as the following format:

maskrcnn_train
- aachen_000000_000019_leftImg8bit_pred.txt
- aachen_000001_000019_leftImg8bit_0_person.png
- aachen_000001_000019_leftImg8bit_10_car.png
- ...

maskrcnn_val
- frankfurt_000001_064130_leftImg8bit_pred.txt
- frankfurt_000001_064305_leftImg8bit_0_person.png
- frankfurt_000001_064305_leftImg8bit_10_motorcycle.png
- ...

The content of the txt file is the same as the standard format required by cityscape script, e.g.:

frankfurt_000000_000294_leftImg8bit_0_person.png 24 0.9990299940109253
frankfurt_000000_000294_leftImg8bit_1_person.png 24 0.9810258746147156
...

Then use the provided script to generate the training set:

sh tools/prepare_dataset.sh \
  maskrcnn_train \
  maskrcnn_val \
  maskrcnn_r50

Note that this step can take about 2 hours. Feel free to skip it by downloading the processed training set.

Train the network

Point DATA_ROOT to the patches dataset and run the training script

DATA_ROOT=maskrcnn_r50/patches \
bash tools/dist_train.sh \
  configs/bpr/hrnet18s_128.py \
  4

Inference

Suppose you have some instance segmentation results of Cityscapes dataset, as the following format:

maskrcnn_val
- frankfurt_000001_064130_leftImg8bit_pred.txt
- frankfurt_000001_064305_leftImg8bit_0_person.png
- frankfurt_000001_064305_leftImg8bit_10_motorcycle.png
- ...

We provide a script (tools/inference.sh) to perform refinement operation, usage:

IOU_THRESH=0.55 \
IMG_DIR=data/cityscapes/leftImg8bit/val \
GT_JSON=data/cityscapes/annotations/instancesonly_filtered_gtFine_val.json \
BPR_ROOT=. \
GPUS=4 \
sh tools/inference.sh configs/bpr/hrnet48_256.py ckpts/hrnet48_256.pth maskrcnn_val maskrcnn_val_refined

The refinement results will be saved in maskrcnn_val_refined/refined.

For COCO model, use tools/inference_coco.sh instead.

Models

Backbone	Dataset	Checkpoint
HRNet-18s	Cityscapes	Tsinghua Cloud
HRNet-48	Cityscapes	Tsinghua Cloud
HRNet-18s	COCO	Tsinghua Cloud

Acknowledgement

This project is based on mmsegmentation code base.

Citation

If you find this project useful in your research, please consider citing:

@article{tang2021look,
  title={Look Closer to Segment Better: Boundary Patch Refinement for Instance Segmentation},
  author={Chufeng Tang and Hang Chen and Xiao Li and Jianmin Li and Zhaoxiang Zhang and Xiaolin Hu},
  journal={arXiv preprint arXiv:2104.05239},
  year={2021}
}

code for `Look Closer to Segment Better: Boundary Patch Refinement for Instance Segmentation`

Related tags

Overview

Look Closer to Segment Better: Boundary Patch Refinement for Instance Segmentation (CVPR 2021)

Introduction

Installation

Training

Prepare patches dataset [optional]

Train the network

Inference

Models

Acknowledgement

Citation

Owner

H.Chen

Fast SHAP value computation for interpreting tree-based models

MultiMix: Sparingly Supervised, Extreme Multitask Learning From Medical Images (ISBI 2021, MELBA 2021)

This is a pytorch implementation for the BST model from Alibaba https://arxiv.org/pdf/1905.06874.pdf

FTIR-Deep Learning - FTIR Deep Learning With Python

From the basics to slightly more interesting applications of Tensorflow

ML course - EPFL Machine Learning Course, Fall 2021

This repository contains the source code and data for reproducing results of Deep Continuous Clustering paper

Implémentation en pyhton de l'article Depixelizing pixel art de Johannes Kopf et Dani Lischinski

VM3000 Microphones

Image-to-Image Translation in PyTorch

Next-gen Rowhammer fuzzer that uses non-uniform, frequency-based patterns.

Code for our NeurIPS 2021 paper Mining the Benefits of Two-stage and One-stage HOI Detection

Self-Supervised Monocular DepthEstimation with Internal Feature Fusion(arXiv), BMVC2021

StyleGAN-Human: A Data-Centric Odyssey of Human Generation

Invasive Plant Species Identification

Roach: End-to-End Urban Driving by Imitating a Reinforcement Learning Coach

SMORE: Knowledge Graph Completion and Multi-hop Reasoning in Massive Knowledge Graphs

Implementation supporting the ICCV 2017 paper "GANs for Biological Image Synthesis"

A python library for highly configurable transformers - easing model architecture search and experimentation.

Predicting Auction Sale Price using the kaggle bulldozer auction sales data: Modeling with Ensembles vs Neural Network