A Strong Baseline for Image Semantic Segmentation

Introduction

This project is an open source semantic segmentation toolbox based on PyTorch. It is based on the codes of our Tianchi competition in 2021 (https://tianchi.aliyun.com/competition/entrance/531860/introduction).
In the competition, our team won the third place (please see Tianchi_README.md).

Overview

The master branch works with PyTorch 1.6+.The project now supports popular and contemporary semantic segmentation frameworks, e.g. UNet, DeepLabV3+, HR-Net etc.

Requirements

Support

Backbone

ResNet (CVPR'2016)
SeNet (CVPR'2018)
IBN-Net (CVPR'2018)
EfficientNet (CVPR'2020)

Methods

Tricks

Tools

large image inference (cut and merge)
post process (crf/superpixels)

Quick Start

Train a model

python train.py --config_file ${CONFIG_FILE}

CONFIG_FILE: File of training config about model

Examples:
We trained our model in Tianchi competition according to the following script:
Stage 1 (160e)

python train.py --config_file configs/tc_seg/tc_seg_res_unet_r34_ibn_a_160e.yml

Stage 2 (swa 24e)

python train.py --config_file configs/tc_seg/tc_seg_res_unet_r34_ibn_a_swa.yml

Inference with pretrained models

python inference.py --config_file ${CONFIG_FILE}

CONFIG_FILE: File of inference config about model

Predict large image with pretrained models

python predict_demo.py --config_file ${CONFIG_FILE} --rs_img_file ${IMAGE_FILE_PATH} --temp_img_save_path ${TEMP_CUT_PATH} -temp_seg_map_save_path ${TEMP_SAVE_PATH} --save_seg_map_file ${SAVE_SEG_FILE}

CONFIG_FILE: File of inference config about model
IMAGE_FILE_PATH: File of large input image to predict
TEMP_CUT_PATH: Temp folder of small cutting samples
TEMP_SAVE_PATH: Temp folder of predict results of cutting samples
SAVE_SEG_FILE: Predict result of the large image

A Strong Baseline for Image Semantic Segmentation

Related tags

Overview

A Strong Baseline for Image Semantic Segmentation

Introduction

Overview

Requirements

Support

Backbone

Methods

Tricks

Tools

Quick Start

Train a model

Inference with pretrained models

Predict large image with pretrained models

Owner

Clark He

How the Deep Q-learning method works and discuss the new ideas that makes the algorithm work

Pytorch modules for paralel models with same architecture. Ideal for multi agent-based systems

AI Toolkit for Healthcare Imaging

Referring Video Object Segmentation

This code is a toolbox that uses Torch library for training and evaluating the ERFNet architecture for semantic segmentation.

Motion planning algorithms commonly used on autonomous vehicles. (path planning + path tracking)

High performance distributed framework for training deep learning recommendation models based on PyTorch.

A benchmark for the task of translation suggestion

Python/Rust implementations and notes from Proofs Arguments and Zero Knowledge

Automatically erase objects in the video, such as logo, text, etc.

A unified 3D Transformer Pipeline for visual synthesis

以孤立语假设和宽度优先搜索为基础，构建了一种多通道堆叠注意力Transformer结构的斗地主ai

Not All Points Are Equal: Learning Highly Efficient Point-based Detectors for 3D LiDAR Point Clouds (CVPR 2022, Oral)

CondenseNet: Light weighted CNN for mobile devices

Official repository for "On Generating Transferable Targeted Perturbations" (ICCV 2021)

DTCN SMP Challenge - Sequential prediction learning framework and algorithm

FishNet: One Stage to Detect, Segmentation and Pose Estimation

Finding all things on-prem Microsoft for password spraying and enumeration.

Synthesizing and manipulating 2048x1024 images with conditional GANs

This repository contains the map content ontology used in narrative cartography