DeepLab is a state-of-art deep learning system for semantic image segmentation built on top of Caffe.

Last update: Nov 14, 2022

Related tags

Deep Learning DeepLab-Context

Overview

DeepLab

Introduction

DeepLab is a state-of-art deep learning system for semantic image segmentation built on top of Caffe.

It combines densely-computed deep convolutional neural network (CNN) responses with densely connected conditional random fields (CRF).

This distribution provides a publicly available implementation for the key model ingredients first reported in an arXiv paper, accepted in revised form as conference publication to the ICLR-2015 conference. It also contains implementations for methods supporting model learning using only weakly labeled examples, described in a second follow-up arXiv paper. Please consult and consider citing the following papers:

@inproceedings{chen14semantic,
  title={Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs},
  author={Liang-Chieh Chen and George Papandreou and Iasonas Kokkinos and Kevin Murphy and Alan L Yuille},
  booktitle={ICLR},
  url={http://arxiv.org/abs/1412.7062},
  year={2015}
}

@article{papandreou15weak,
  title={Weakly- and Semi-Supervised Learning of a DCNN for Semantic Image Segmentation},
  author={George Papandreou and Liang-Chieh Chen and Kevin Murphy and Alan L Yuille},
  journal={arxiv:1502.02734},
  year={2015}
}

Note that if you use the densecrf implementation, please consult and cite the following paper:

@inproceedings{KrahenbuhlK11,
  title={Efficient Inference in Fully Connected CRFs with Gaussian Edge Potentials},
  author={Philipp Kr{\"{a}}henb{\"{u}}hl and Vladlen Koltun},
  booktitle={NIPS},      
  year={2011}
}

Performance

DeepLab currently achieves 73.9% on the challenging PASCAL VOC 2012 image segmentation task -- see the leaderboard.

Pre-trained models

We have released several trained models and corresponding prototxt files at here. Please check it for more model details.

The best model among the released ones yields 73.6% on PASCAL VOC 2012 test set.

Python wrapper requirements

Install wget library for python

sudo pip install wget

Change DATA_ROOT to point to the PASCAL images
To use the mat_read_layer and mat_write_layer, please download and install matio.

Running the code

python run.py

FAQ

Check FAQ if you have some problems while using the code.

DeepLab is a state-of-art deep learning system for semantic image segmentation built on top of Caffe.

Related tags

Overview

DeepLab

Introduction

Performance

Pre-trained models

Python wrapper requirements

Running the code

FAQ

Owner

Ali

The Multi-Mission Maximum Likelihood framework (3ML)

My 1st place solution at Kaggle Hotel-ID 2021

Fast and simple implementation of RL algorithms, designed to run fully on GPU.

A tutorial on DataFrames.jl prepared for JuliaCon2021

Running AlphaFold2 (from ColabFold) in Azure Machine Learning

Is RobustBench/AutoAttack a suitable Benchmark for Adversarial Robustness?

An implementation for Neural Architecture Search with Random Labels (CVPR 2021 poster) on Pytorch.

This is the dataset for testing the robustness of various VO/VIO methods

An LSTM based GAN for Human motion synthesis

This is the official Pytorch-version code of FlatGCN (Flattened Graph Convolutional Networks for Recommendation).

Advanced Signal Processing Notebooks and Tutorials

M3DSSD: Monocular 3D Single Stage Object Detector

A Keras implementation of YOLOv4 (Tensorflow backend)

Official PyTorch implementation of BlobGAN: Spatially Disentangled Scene Representations

Real-time VIBE: Frame by Frame Inference of VIBE (Video Inference for Human Body Pose and Shape Estimation)

OpenCVのGrabCut()を利用したセマンティックセグメンテーション向けアノテーションツール(Annotation tool using GrabCut() of OpenCV. It can be used to create datasets for semantic segmentation.)

Harmonious Textual Layout Generation over Natural Images via Deep Aesthetics Learning

Code for "On Memorization in Probabilistic Deep Generative Models"

Dense matching library based on PyTorch

Code for the paper "Adversarial Generator-Encoder Networks"