ICNet for Real-Time Semantic Segmentation on High-Resolution Images, ECCV2018

Last update: Dec 31, 2022

Related tags

Deep Learning ICNet

Overview

ICNet for Real-Time Semantic Segmentation on High-Resolution Images

by Hengshuang Zhao, Xiaojuan Qi, Xiaoyong Shen, Jianping Shi, Jiaya Jia, details are in project page.

Introduction

Based on PSPNet, this repository is build for evaluation in ICNet. For installation, please follow the description in PSPNet repository (support CUDA 7.0/7.5 + cuDNN v4).

Usage

Clone the repository recursively:

git clone --recursive https://github.com/hszhao/ICNet.git

Build Caffe and matcaffe:

cd $ICNET_ROOT/PSPNet
cp Makefile.config.example Makefile.config
vim Makefile.config
make -j8 && make matcaffe
cd ..

Evaluation mIoU:
- Evaluation code is in folder 'evaluation'.
- Download trained models and put them in folder 'evaluation/model':
  - icnet_cityscapes_train_30k.caffemodel: GoogleDrive
    
    (31M, md5: c7038630c4b6c869afaaadd811bdb539; train on trainset for 30k)
  - icnet_cityscapes_trainval_90k.caffemodel: GoogleDrive
    
    (31M, md5: 4f4dd9eecd465dd8de7e4cf88ba5d5d5; train on trainvalset for 90k)
- Modify the related paths in 'eval_all.m':
  - Mainly variables 'data_root' and 'eval_list', and your image list for evaluation should be similar to that in folder 'evaluation/samplelist' if you use this evaluation code structure.
```
cd evaluation
vim eval_all.m
```
- Run the evaluation scripts:
```
./run.sh
```
Evaluation time:
- To get inference time as accurate as possible, it's suggested to make sure the GPU card with specified ID in script 'test_time.sh' is empty (without other processes executing)
- Run the evaluation scripts:
```
./test_time.sh
```
Results:
- Prediction results will show in folder 'evaluation/mc_result' and the expected scores are:
  - ICNet train on trainset for 30K, evaluated on valset (mIoU/pAcc): 67.7/94.5
  - ICNet train on trainvalset for 90K, evaluated on testset (mIoU): 69.5
- Log information of inference time will be in file 'time.log', approximately 33~36ms on TitanX.
Demo video:
- Video processed by ICNet on cityscapes dataset:
  - Alpha blending with value as 0.5: Video

Citation

If ICNet is useful for your research, please consider citing:

@inproceedings{zhao2018icnet,
  title={ICNet for Real-Time Semantic Segmentation on High-Resolution Images},
  author={Zhao, Hengshuang and Qi, Xiaojuan and Shen, Xiaoyong and Shi, Jianping and Jia, Jiaya},
  booktitle={ECCV},
  year={2018}
}

Questions

Please contact '[email protected]'

ICNet for Real-Time Semantic Segmentation on High-Resolution Images, ECCV2018

Related tags

Overview

ICNet for Real-Time Semantic Segmentation on High-Resolution Images

Introduction

Usage

Citation

Questions

Owner

Hengshuang Zhao

Parris, the automated infrastructure setup tool for machine learning algorithms.

Structured Data Gradient Pruning (SDGP)

Fibonacci Method Gradient Descent

MISSFormer: An Effective Medical Image Segmentation Transformer

Code for the Population-Based Bandits Algorithm, presented at NeurIPS 2020.

[NeurIPS 2021] Source code for the paper "Qu-ANTI-zation: Exploiting Neural Network Quantization for Achieving Adversarial Outcomes"

Towards Improving Embedding Based Models of Social Network Alignment via Pseudo Anchors

a basic code repository for basic task in CV(classification,detection,segmentation)

A toolkit for document-level event extraction, containing some SOTA model implementations

Official implementation of FCL-taco2: Fast, Controllable and Lightweight version of Tacotron2 @ ICASSP 2021

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

The official implementation of paper Siamese Transformer Pyramid Networks for Real-Time UAV Tracking, accepted by WACV22

Supervised Contrastive Learning for Downstream Optimized Sequence Representations

Implementation of the paper "Language-agnostic representation learning of source code from structure and context".

Repository for GNSS-based position estimation using a Deep Neural Network

Stereo Radiance Fields (SRF): Learning View Synthesis for Sparse Views of Novel Scenes

Social Network Ads Prediction

Gated-Shape CNN for Semantic Segmentation (ICCV 2019)

3D-aware GANs based on NeRF (arXiv).

Learning Correspondence from the Cycle-consistency of Time (CVPR 2019)