Code for the ICCV2021 paper "Personalized Image Semantic Segmentation"

Last update: Jul 09, 2022

Related tags

Overview

PSS: Personalized Image Semantic Segmentation

Paper

PSS: Personalized Image Semantic Segmentation
Yu Zhang, Chang-Bin Zhang, Peng-Tao Jiang, Ming-Ming Cheng, Feng Mao. International Conference on Computer Vision (ICCV), 2021

If you find this code useful for your research, please cite our paper:

@inproceedings{zhang2021pss,
  title={Personalized Image Semantic Segmentation},
  author={Yu, Zhang and Chang-Bin, Zhang and Peng-Tao, Jiang and Ming-Ming, Cheng and Feng, Mao},
  booktitle={ICCV},
  year={2021}
}

Abstract

Semantic segmentation models trained on public datasets have achieved great success in recent years. However, these models didn't consider the personalization issue of segmentation though it is important in practice. In this paper, we address the problem of personalized image segmentation. The objective is to generate more accurate segmentation results on unlabeled personalized images by investigating the data's personalized traits. To open up future research in this area, we collect a large dataset containing various users' personalized images called PIS (Personalized Image Semantic Segmentation). We also survey some recent researches related to this problem and report their performance on our dataset. Furthermore, by observing the correlation among a user's personalized images, we propose a baseline method that incorporates the inter-image context when segmenting certain images. Extensive experiments show that our method outperforms the existing methods on the proposed dataset. The code and the PIS dataset will be made publicly available.

Test code

Preparation

Our code is built based on ADVENT. So after clone our repo, you need to install advent(https://github.com/valeoai/ADVENT):

$ conda install -c menpo opencv  # install opencv
$ pip install -e <root_dir>  # install advent

Make a new directory to put datasets and results:

makedir ./data

Dataset

You shold download our PSS dataset and put them under ./data/personal.

Dataset License:

Our dataset is made available only for academic research. Although we have obtained the personalized photos' copyright, the user's privacy is still important. If you want to get access to our data, please send me a request from your school or company email. The request should include the purpose of using our dataset. Thank you for your understanding. （pt.jiang AT mail.nankai.edu.cn）

Pre-trained models

Our pretrained models can be downloaded here. We provide the step2 models that finetuned with pseudo labels, which are reported as OURS-S2 in the paper. Download and put them under ./data/final_res50_step2.

The directory structure should be like

./data/personal/
               id1
               id2
               ...
               id15
      /final_res50_step2/
                         id1.pth
                         id2.pth
                         ...
                         id15.pth

after preparing dataset and pretrained models.

Run test

Run:

bash ./PSS_test.sh

Then you should get the segmentation results of different users' images under ./data/final_res50_step2. The test codes inference all 15 ID's results at a time. If you only want to test on certain user ID, you can modify line153 of script ./test.py.

License

PSS code is released under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International Public License for NonCommercial use only. Any commercial use should get formal permission first.

Code for the ICCV2021 paper "Personalized Image Semantic Segmentation"

Related tags

Overview

PSS: Personalized Image Semantic Segmentation

Paper

Abstract

Test code

Preparation

Dataset

Dataset License:

Pre-trained models

Run test

License

Owner

张宇

Simple reference implementation of GraphSAGE.

Meta-TTS: Meta-Learning for Few-shot SpeakerAdaptive Text-to-Speech

Code for the Paper "Diffusion Models for Handwriting Generation"

PyTorch Implementation of "Non-Autoregressive Neural Machine Translation"

Flexible Option Learning - NeurIPS 2021

Official Pytorch Implementation of Length-Adaptive Transformer (ACL 2021)

Fight Recognition from Still Images in the Wild @ WACVW2022, Real-world Surveillance Workshop

A Novel Plug-in Module for Fine-grained Visual Classification

Code release for NeurIPS 2020 paper "Co-Tuning for Transfer Learning"

Official implementation for ICDAR 2021 paper "Handwritten Mathematical Expression Recognition with Bidirectionally Trained Transformer"

Train an imgs.ai model on your own dataset

GeDML is an easy-to-use generalized deep metric learning library

Implementation of Vaswani, Ashish, et al. "Attention is all you need."

A library for implementing Decentralized Graph Neural Network algorithms.

A pytorch reproduction of { Co-occurrence Feature Learning from Skeleton Data for Action Recognition and Detection with Hierarchical Aggregation }.

Management Dashboard for Torchserve

Machine learning, in numpy

Python library for analysis of time series data including dimensionality reduction, clustering, and Markov model estimation

Meshed-Memory Transformer for Image Captioning. CVPR 2020

Dimension Reduced Turbulent Flow Data From Deep Vector Quantizers