Official repository of Semantic Image Matting

Last update: Dec 29, 2022

Related tags

Overview

Semantic Image Matting

This is the official repository of Semantic Image Matting (CVPR2021).

Overview

Natural image matting separates the foreground from background in fractional occupancy which can be caused by highly transparent objects, complex foreground (e.g., net or tree), and/or objects containing very fine details (e.g., hairs). Although conventional matting formulation can be applied to all of the above cases, no previous work has attempted to reason the underlying causes of matting due to various foreground semantics.

We show how to obtain better alpha mattes by incorporating into our framework semantic classification of matting regions. Specifically, we consider and learn 20 classes of matting patterns, and propose to extend the conventional trimap to semantic trimap. The proposed semantic trimap can be obtained automatically through patch structure analysis within trimap regions. Meanwhile, we learn a multi-class discriminator to regularize the alpha prediction at semantic level, and content-sensitive weights to balance different regularization losses.

Dataset

Download our semantic image matting dataset (SIMD) here. SIMD is composed self-collected images and a subset of adobe images. To obtain the complete dataset, please contact Brian Price ([email protected]) for the Adobe Image Matting dataset first and follow the instructions within SIMD.zip.

Requirements

The codes are tested in the following environment:

Python 3.7
Pytorch 1.9.0
CUDA 10.2 & CuDNN 7.6.5

Performance

Some pretrained models are listed below with their performance.

Methods	SAD	MSE	Grad	Conn	Link
SIMD	27.9	4.7	11.6	20.8	model
Composition-1K (paper)	28.0	5.8	10.8	24.8
Composition-1K (repo)	27.7	5.6	10.7	24.4	model

Run

Download the model and put it under checkpoints/DIM or checkpoints/Adobe in the root directory. Download the classifier here and put it under checkpoints. Run the inference and evaluation by

python scripts/main.py -c config/CONFIG.yaml

Results

Reference

If you find our work useful in your research, please consider citing:

@inproceedings{sun2021sim,
  author    = {Yanan Sun and Chi-Keung Tang and Yu-Wing Tai}
  title     = {Semantic Image Matting},
  booktitle = {Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
  year      = {2021},
}

Acknowledgment

This repo borrows code from several repos, like GCA and FBA.

Official repository of Semantic Image Matting

Related tags

Overview

Semantic Image Matting

Overview

Dataset

Requirements

Performance

Run

Results

Reference

Acknowledgment

Owner

This library provides an abstraction to perform Model Versioning using Weight & Biases.

Building a real-time environment using webcam frame division in OpenCV and classify cropped images using a fine-tuned vision transformers on hybryd datasets samples for facial emotion recognition.

Segmentation models with pretrained backbones. PyTorch.

Deployment of PyTorch chatbot with Flask

Real-world Anomaly Detection in Surveillance Videos- pytorch Re-implementation

Offical code for the paper: "Growing 3D Artefacts and Functional Machines with Neural Cellular Automata" https://arxiv.org/abs/2103.08737

Code and models for ICCV2021 paper "Robust Object Detection via Instance-Level Temporal Cycle Confusion".

DeepCO3: Deep Instance Co-segmentation by Co-peak Search and Co-saliency

Depression Asisstant GDSC Challenge Solution

UMich 500-Level Mobile Robotics Course

Improving adversarial robustness by a coupling rejection strategy

Beyond a Gaussian Denoiser: Residual Learning of Deep CNN for Image Denoising

Open source code for the paper of Neural Sparse Voxel Fields.

Implementation of CVPR'21: RfD-Net: Point Scene Understanding by Semantic Instance Reconstruction

Chess reinforcement learning by AlphaGo Zero methods.

Neural Magic Eye: Learning to See and Understand the Scene Behind an Autostereogram, arXiv:2012.15692.

[MedIA2021]MIDeepSeg: Minimally Interactive Segmentation of Unseen Objects from Medical Images Using Deep Learning

Projecting interval uncertainty through the discrete Fourier transform

QuanTaichi evaluation suite

pyspark🍒🥭 is delicious，just eat it!😋😋