[IJCAI'21] Deep Automatic Natural Image Matting

Last update: Jan 06, 2023

Overview

Deep Automatic Natural Image Matting [IJCAI-21]

This is the official repository of the paper Deep Automatic Natural Image Matting.

Introduction | Network | AIM-500 | Results | Statement

📆 News
The training code, inference code and the pretrained models will be released soon.
[2021-07-16]: Publish the validation dataset AIM-500. Please follow the readme.txt for details.

Introduction

Different from previous methods only focusing on images with salient opaque foregrounds such as humans and animals, in this paper, we investigate the difficulties when extending the automatic matting methods to natural images with salient transparent/meticulous foregrounds or non-salient foregrounds.

To address the problem, we propose a novel end-to-end matting network, which can predict a generalized trimap for any image of the above types as a unified semantic representation. Simultaneously, the learned semantic features guide the matting network to focus on the transition areas via an attention mechanism.

We also construct a test set AIM-500 that contains 500 diverse natural images covering all types along with manually labeled alpha mattes, making it feasible to benchmark the generalization ability of AIM models. Results of the experiments demonstrate that our network trained on available composite matting datasets outperforms existing methods both objectively and subjectively.

Network

We propose the methods consist of:

Improved Backbone for Matting: an advanced max-pooling version of ResNet-34, serves as the backbone for the matting network, pretrained on ImageNet;
Unified Semantic Representation: a type-wise semantic representation to replace the traditional trimaps;
Guided Matting Process: an attention based mechanism to guide the matting process by leveraging the learned semantic features from the semantic decoder to focus on extracting details only within transition area.

The backbone pretrained on ImageNet and the model pretrained on synthetic matting dataset will be released soon.

Pretrained-backbone	Pretrained-model
coming soon	coming soon

AIM-500

We propose AIM-500 (Automatic Image Matting-500), the first natural image matting test set, which contains 500 high-resolution real-world natural images from all three types (SO, STM, NS), many categories, and the manually labeled alpha mattes. Some examples and the amount of each category are shown below. The AIM-500 dataset is published now, can be downloaded directly from this link. Please follow the readme.txt for more details.

Portrait	Animal	Transparent	Plant	Furniture	Toy	Fruit
100	200	34	75	45	36	10

Results

We test our network on different types of images in AIM-500 and compare with previous SOTA methods, the results are shown below.

Statement

If you are interested in our work, please consider citing the following:

@inproceedings{ijcai2021-danim,
  title     = {Deep Automatic Natural Image Matting},
  author    = {Li, Jizhizi and Zhang, Jing and Tao, Dacheng},
  publisher = {International Joint Conferences on Artificial Intelligence Organization},
  year      = {2021},
}

This project is under the MIT license. For further questions, please contact [email protected].

Relevant Projects

End-to-end Animal Image Matting
Jizhizi Li, Jing Zhang, Stephen J. Maybank, Dacheng Tao

[IJCAI'21] Deep Automatic Natural Image Matting

Related tags

Overview

Deep Automatic Natural Image Matting [IJCAI-21]

This is the official repository of the paper Deep Automatic Natural Image Matting.

📆 News

Introduction

Network

AIM-500

Results

Statement

Relevant Projects

Owner

Jizhizi_Li

Official repository of "Investigating Tradeoffs in Real-World Video Super-Resolution"

Revisiting, benchmarking, and refining Heterogeneous Graph Neural Networks.

Official implementation of ACMMM'20 paper 'Self-supervised Video Representation Learning Using Inter-intra Contrastive Framework'

The implementation of ICASSP 2020 paper "Pixel-level self-paced learning for super-resolution"

An implementation of a discriminant function over a normal distribution to help classify datasets.

Detection of PCBA defect

FedCV: A Federated Learning Framework for Diverse Computer Vision Tasks

[ICCV 2021] Official Pytorch implementation for Discriminative Region-based Multi-Label Zero-Shot Learning SOTA results on NUS-WIDE and OpenImages

TensorFlow implementation for Bayesian Modeling and Uncertainty Quantification for Learning to Optimize: What, Why, and How

Codes for building and training the neural network model described in Domain-informed neural networks for interaction localization within astroparticle experiments.

A repo to show how to use custom dataset to train s2anet, and change backbone to resnext101

Framework for estimating the structures and parameters of Bayesian networks (DAGs) at per-sample resolution

AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data

UniLM AI - Large-scale Self-supervised Pre-training across Tasks, Languages, and Modalities

CSAC - Collaborative Semantic Aggregation and Calibration for Separated Domain Generalization

Official PyTorch implementation of the preprint paper "Stylized Neural Painting", accepted to CVPR 2021.

dualFace: Two-Stage Drawing Guidance for Freehand Portrait Sketching (CVMJ)

Implementation for On Provable Benefits of Depth in Training Graph Convolutional Networks

Code and data of the Fine-Grained R2R Dataset proposed in paper Sub-Instruction Aware Vision-and-Language Navigation

Breaching - Breaching privacy in federated learning scenarios for vision and text