Learning from Synthetic Data with Fine-grained Attributes for Person Re-Identification

Related tags

Deep LearningFineGPR
Overview

Less is More: Learning from Synthetic Data with Fine-grained Attributes for Person Re-Identification

Suncheng Xiang

Shanghai Jiao Tong University

Overview

In this paper, we construct and label a large-scale synthetic person dataset named FineGPR with fine-grained attribute distribution. Moreover, aiming to fully exploit the potential of FineGPR and promote the efficient training from millions of synthetic data, we propose an attribute analysis pipeline AOST to learn attribute distribution in target domain, then apply style transfer network to eliminate the gap between synthetic and real-world data and thus is freely deployed to new scenarios. Experiments conducted on benchmarks demonstrate that FineGPR with AOST outperforms (or is on par with) existing real and synthetic datasets, which suggests its feasibility for re-ID and proves the proverbial less-is-more principle. We hope this fine-grained dataset could advance research towards re-ID in real scenarios.


[Paper] [Video Sample] [Related Project]


๐Ÿ”ฅ NEWS ๐Ÿ”ฅ

  • [10/2021] ๐Ÿ“ฃ The first FineGPR-C caption dataset involving human describing event is coming !

  • [09/2021] ๐Ÿ“ฃ The large-scale synthetic person dataset FineGPR with fine-grained attribute distribution is released !


Table of Contents ๐Ÿ‘€


FineGPR Introduction

The FineGPR dataset is generated by a popular GTA5 game engine that can synthesise images under controllable viewpoints, weathers,illuminations and backgrounds, as well as 13 fine-grained attributes at the identity level ๐Ÿ‘ .

Our FineGPR dataset provides fine-grained and accurately configurable annotations, including 36 different viewpoints, 7 different kinds of weathers, 7 different kinds of illuminations, and 9 different kinds of backgrounds.

Viewpoint ๐Ÿ“ท

Definition of different viewpoints. Viewpoints of one identity are sampled at an interval of 10ยฐ, e.g. 0ยฐ-80ยฐ denotes that a person has 9 different angles in total.

Weather ๐ŸŒจ and Illumination ๐ŸŽ‡

The exemplars of different weather distribution (left) and illumination distribution (right) from the proposed FineGPR dataset.

Attributes at the Identity Level โ›น๏ธโ€โ™€๏ธ

The distributions of attributes at the identity level on FineGPR. The left figure shows the numbers of IDs for each attribute. The middle and right pies illustrate the distribution of the colors of upper-body and low-body clothes respectively.

Some visual exemplars with ID-level pedestrian attributes in the proposed FineGPR dataset, such as Wear short sleeve , Wear dress, Wear hat, Carry bag, etc.


Comparison with existing datasets

Some Mainstream Datasets for Person Re-Identification

For related FineGPR dataset (details of the previous related work, please refer to the our homepage GPR ๐Ÿ”Ž :

dataset IDs (ID-Attributes) boxs cams weathers illumination scene resolution
Market-1501 1,501 ( โœ”๏ธ ) 32,668 6 - - - low
CUHK03 1,467 ( โŒ ) 14,096 2 - - - low
DukeMTMC-reID 1,404 ( โœ”๏ธ ) 36,411 8 - - - low
MSMT17 4,101 ( โŒ ) 126,441 15 - - - vary
SOMAset 50 ( โŒ ) 100,000 250 - - - -
SyRI 100 ( โŒ ) 1,680,000 100 - 140 - -
PersonX 1,266 ( โŒ ) 273,456 6 - - 1 vary
Unreal 3,000 ( โŒ ) 120,000 34 - - 1 low
RandPerson 8,000 ( โŒ ) 1,801,816 19 - - 4 low
FineGPR 1150 ( โœ”๏ธ ) 2,028,600 36 7 7 9 high

Link of the Dataset

Data of FineGPR for Viewpoint Analysis

A small subset of FineGPR can be downloaded from the following links:

Directories & Files of images

FineGPR_Dataset 
โ”œโ”€โ”€ FineGPR/   # This file is our original dataset, we provide the samples of ID=0001 and ID=0003 in this file folder.
โ”‚   โ”œโ”€โ”€ 0001
โ”‚   โ”‚   โ”œโ”€โ”€ 0001_c01_w01_l01_p01.jpg 
โ”‚   โ”‚	โ”œโ”€โ”€ 0001_c01_w01_l02_p01.jpg  
โ”‚   โ”‚   โ”œโ”€โ”€ 0001_c01_w01_l03_p01.jpg
โ”‚   โ”‚   โ””โ”€โ”€ ...
โ”‚   โ”œโ”€โ”€ 0003/
โ”‚   โ”‚   โ”œโ”€โ”€ 0003_c01_w01_l01_p06.jpg  
โ”‚   โ”‚   โ”œโ”€โ”€ 0003_c01_w01_l02_p06.jpg
โ”‚   โ”‚   โ”œโ”€โ”€ 0003_c01_w01_l03_p06.jpg	   
โ”‚   โ”‚   โ””โ”€โ”€ ...
โ”‚   โ””โ”€โ”€ ...
โ”œโ”€โ”€ FineGPR_subset   # This file is the subset of FineGPR dataset, each Identity contains 4 images. 
โ”‚   โ”œโ”€โ”€ 0001_c01_w03_l05_p03.jpg 
โ”‚   โ”œโ”€โ”€ 0001_c10_w03_l05_p03.jpg
โ”‚   โ”œโ”€โ”€ 0001_c19_w03_l05_p03.jpg
โ”‚   โ”œโ”€โ”€ 0001_c28_w03_l05_p03.jpg
โ”‚   โ”œโ”€โ”€ 0003_c01_w03_l05_p08.jpg 
โ”‚   โ”œโ”€โ”€ 0003_c10_w03_l05_p08.jpg
โ”‚   โ”œโ”€โ”€ 0003_c19_w03_l05_p08.jpg
โ”‚   โ”œโ”€โ”€ 0003_c28_w03_l05_p08.jpg  
โ”‚   โ””โ”€โ”€ ...
โ””โ”€โ”€ README.md   # Readme file

Name of the image

Taking "0001_c01_w01_l01_p01.jpg" as an example:

  • 0001 is the id of the person
  • c01 is the id of the camera
  • w01 is the id of the weather
  • l01 is the id of the illumination
  • p01 is the id of the background

Viewpoint annotations

FineGPR
โ”œโ”€โ”€ c01๏ผš90ยฐ      โ”œโ”€โ”€ c10๏ผš180ยฐ      โ”œโ”€โ”€ c19๏ผš270ยฐ      โ”œโ”€โ”€ c28๏ผš0ยฐ
โ”œโ”€โ”€ c02๏ผš100ยฐ     โ”œโ”€โ”€ c11๏ผš190ยฐ      โ”œโ”€โ”€ c20๏ผš280ยฐ      โ”œโ”€โ”€ c29๏ผš10ยฐ
โ”œโ”€โ”€ c03๏ผš110ยฐ     โ”œโ”€โ”€ c12๏ผš200ยฐ      โ”œโ”€โ”€ c21๏ผš290ยฐ      โ”œโ”€โ”€ c30๏ผš20ยฐ
โ”œโ”€โ”€ c04๏ผš120ยฐ     โ”œโ”€โ”€ c13๏ผš210ยฐ      โ”œโ”€โ”€ c22๏ผš300ยฐ      โ”œโ”€โ”€ c31๏ผš30ยฐ
โ”œโ”€โ”€ c05๏ผš130ยฐ     โ”œโ”€โ”€ c14๏ผš220ยฐ      โ”œโ”€โ”€ c23๏ผš310ยฐ      โ”œโ”€โ”€ c32๏ผš40ยฐ
โ”œโ”€โ”€ c06๏ผš140ยฐ     โ”œโ”€โ”€ c15๏ผš230ยฐ      โ”œโ”€โ”€ c24๏ผš320ยฐ      โ”œโ”€โ”€ c33๏ผš50ยฐ
โ”œโ”€โ”€ c07๏ผš150ยฐ     โ”œโ”€โ”€ c16๏ผš240ยฐ      โ”œโ”€โ”€ c25๏ผš330ยฐ      โ”œโ”€โ”€ c34๏ผš60ยฐ
โ”œโ”€โ”€ c08๏ผš160ยฐ     โ”œโ”€โ”€ c17๏ผš250ยฐ      โ”œโ”€โ”€ c26๏ผš340ยฐ      โ”œโ”€โ”€ c35๏ผš70ยฐ
โ””โ”€โ”€ c09๏ผš170ยฐ     โ””โ”€โ”€ c18๏ผš260ยฐ      โ””โ”€โ”€ c27๏ผš350ยฐ      โ””โ”€โ”€ c36๏ผš80ยฐ

Weather annotations

FineGPR
โ”œโ”€โ”€ w01๏ผšSunny
โ”œโ”€โ”€ w02๏ผšClouds    
โ”œโ”€โ”€ w03๏ผšOvercast
โ”œโ”€โ”€ w04๏ผšFoggy   
โ”œโ”€โ”€ w05๏ผšNeutral
โ”œโ”€โ”€ w06๏ผšBlizzard 
โ””โ”€โ”€ w07๏ผšSnowlight 	   

Illumination annotations

FineGPR
โ”œโ”€โ”€ l01๏ผšMidnight
โ”œโ”€โ”€ l02๏ผšDawn    
โ”œโ”€โ”€ l03๏ผšForenoon
โ”œโ”€โ”€ l04๏ผšNoon   
โ”œโ”€โ”€ l05๏ผšAfternoon
โ”œโ”€โ”€ l06๏ผšDusk 
โ””โ”€โ”€ l07๏ผšNight 	   

Scene annotations

FineGPR
โ”œโ”€โ”€ p01๏ผšUrban
โ”œโ”€โ”€ p02๏ผšUrban   
โ”œโ”€โ”€ p03๏ผšWild
โ”œโ”€โ”€ p04๏ผšUrban   
โ”œโ”€โ”€ p05๏ผšWild
โ”œโ”€โ”€ p06๏ผšUrban
โ”œโ”€โ”€ p07๏ผšUrban
โ”œโ”€โ”€ p08๏ผšWild 
โ””โ”€โ”€ p09๏ผšUrban 	   

Method

๐Ÿ’ก The two-stage pipeline AOST to learn attribute distribution of target domain. Firstly, we learn attribute distribution of real domain on the basis of XGBoost & PSO learning system. Secondly, we perform style transfer to enhance the reality of optimal dataset. Finally, the transferred data are adopted for downstream re-ID task.


Results

Performance comparison with existing Real and Synthetic datasets on Market-1501, DukeMTMC-reID and CUHK03, respectively.

References

  • [1] Image-image domain adaptation with preserved self-similarity and domain-dissimilarity for person re-identification. CVPR 2018.
  • [2] Bag of tricks and a strong baseline for deep person re-identification. CVPRW 2019.

Extendibility

Accompanied with our FineGPR, we also provide some human body masks (Middle) and keypoint locations (Bottom) of all characters during the annotation. We hope that our synthetic dataset FineGPR can not only contribute a lot to the development of generalizable person re-ID, but also advance the research of other computer vision tasks, such as human part segmentation and pose estimation.

FineGPR-C caption dataset

On the basis of FineGPR dafaset, we introduce a dynamic strategy to generate high-quality captions with fine-grained attribute annotations for semantic-based pretraining. To be more specific, we rearrange the different attributes as word embeddings into caption formula in the different position, and then generate semantically dense caption with high-quality description, which gives rise to our newly constructed FineGPR-C caption dataset.

A small subset of FineGPR-C caption dataset can be downloaded from the following links:

Citation

If you use our FineGPR dataset for your research, please cite our Paper.

@article{xiang2021less,
  title={Less is More: Learning from Synthetic Data with Fine-grained Attributes for Person Re-Identification},
  author={Xiang, Suncheng and You, Guanjie and Guan, Mengyuan and Chen, Hao and Wang, Feng and Liu, Ting and Fu, Yuzhuo},
  journal={arXiv preprint arXiv:2109.10498},
  year={2021}
}

If you do think this FineGPR-C caption dataset is useful and have used it in your research, please cite our Paper.

@article{xiang2021vtbr,
  title={VTBR: Semantic-based Pretraining for Person Re-Identification},
  author={Xiang, Suncheng and Zhang, Zirui and Guan, Mengyuan and Chen, Hao and Yan, Binjie and Liu, Ting and Fu, Yuzhuo},
  journal={arXiv preprint arXiv:2110.05074},
  year={2021}
}

Ethical Considerations

Our task and dataset were created with careful attention to ethical questions, which we encountered throughout our work. Access to our dataset will be provided for research purposes only and with restrictions on redistribution. Additionally, as we filtered out the sensitive attribute name in our fine-grained attribute annotation, our dataset cannot be easily repurposed for unintended tasks. Importantly, we are very cautious of human-annotation procedure of large scale datasets towards the social and ethical implications. Furthermore, we do not consider the datasets for developing non-research systems without further processing or augmentation. We hope this fine-grained dataset will shed light into potential tasks for the research community to move forward.


LICENSE

  • The FineGPR Dataset and FineGPR-C caption is made available for non-commercial purposes only.
  • You will not, directly or indirectly, reproduce, use, or convey the FineGPR dataset and FineGPR-C caption dataset or any Content, or any work product or data derived therefrom, for commercial purposes.

Permissions of this strong copyleft license (GNU General Public License v3.0) are conditioned on making available complete source code of licensed works and modifications, which include larger works using a licensed work, under the same license. Copyright and license notices must be preserved. Contributors provide an express grant of patent rights.


Acknowledgements

This research was supported by the National Natural Science Foundation of China under Project (Grant No. 61977045). We would like to thank authors of FineGPR, and FineGPR-Caption dataset for their work. They provide tremendous efforts in these dataset to advance the research in this field. We also appreciate Zefang Yu, Mingye Xie and Guanjie You for insightful feedback and discussion.


For further questions and suggestions about our datasets and methods, please feel free to contact Suncheng Xiang: [email protected]

Owner
SunchengXiang
SunchengXiang
KITTI-360 Annotation Tool is a framework that developed based on python(cherrypy + jinja2 + sqlite3) as the server end and javascript + WebGL as the front end.

KITTI-360 Annotation Tool is a framework that developed based on python(cherrypy + jinja2 + sqlite3) as the server end and javascript + WebGL as the front end.

86 Dec 12, 2022
A simple algorithm for extracting tree height in sparse scene from point cloud data.

TREE HEIGHT EXTRACTION IN SPARSE SCENES BASED ON UAV REMOTE SENSING This is the offical python implementation of the paper "Tree Height Extraction in

6 Oct 28, 2022
Official code for UnICORNN (ICML 2021)

UnICORNN (Undamped Independent Controlled Oscillatory RNN) [ICML 2021] This repository contains the implementation to reproduce the numerical experime

Konstantin Rusch 21 Dec 22, 2022
Face Depixelizer based on "PULSE: Self-Supervised Photo Upsampling via Latent Space Exploration of Generative Models" repository.

NOTE We have noticed a lot of concern that PULSE will be used to identify individuals whose faces have been blurred out. We want to emphasize that thi

Denis Malimonov 2k Dec 29, 2022
Gradient Step Denoiser for convergent Plug-and-Play

Source code for the paper "Gradient Step Denoiser for convergent Plug-and-Play"

Samuel Hurault 11 Sep 17, 2022
Square Root Bundle Adjustment for Large-Scale Reconstruction

RootBA: Square Root Bundle Adjustment Project Page | Paper | Poster | Video | Code Table of Contents Citation Dependencies Installing dependencies on

Nikolaus Demmel 205 Dec 20, 2022
Diagnostic tests for linguistic capacities in language models

LM diagnostics This repository contains the diagnostic datasets and experimental code for What BERT is not: Lessons from a new suite of psycholinguist

61 Jan 02, 2023
Julia and Matlab codes to simulated all problems in El-Hachem, McCue and Simpson (2021)

Substrate_Mediated_Invasion Julia and Matlab codes to simulated all problems in El-Hachem, McCue and Simpson (2021) 2DSolver.jl reproduces the simulat

Matthew Simpson 0 Nov 09, 2021
RetinaFace: Deep Face Detection Library in TensorFlow for Python

RetinaFace is a deep learning based cutting-edge facial detector for Python coming with facial landmarks.

Sefik Ilkin Serengil 512 Dec 29, 2022
Code for the paper SphereRPN: Learning Spheres for High-Quality Region Proposals on 3D Point Clouds Object Detection, ICIP 2021.

SphereRPN Code for the paper SphereRPN: Learning Spheres for High-Quality Region Proposals on 3D Point Clouds Object Detection, ICIP 2021. Authors: Th

Thang Vu 15 Dec 02, 2022
Code for our paper "SimCLS: A Simple Framework for Contrastive Learning of Abstractive Summarization", ACL 2021

SimCLS Code for our paper: "SimCLS: A Simple Framework for Contrastive Learning of Abstractive Summarization", ACL 2021 1. How to Install Requirements

Yixin Liu 150 Dec 12, 2022
A universal memory dumper using Frida

Fridump Fridump (v0.1) is an open source memory dumping tool, primarily aimed to penetration testers and developers. Fridump is using the Frida framew

551 Jan 07, 2023
Retrieve and analysis data from SDSS (Sloan Digital Sky Survey)

Author: Behrouz Safari License: MIT sdss A python package for retrieving and analysing data from SDSS (Sloan Digital Sky Survey) Installation Install

Behrouz 3 Oct 28, 2022
A pytorch implementation of Detectron. Both training from scratch and inferring directly from pretrained Detectron weights are available.

Use this instead: https://github.com/facebookresearch/maskrcnn-benchmark A Pytorch Implementation of Detectron Example output of e2e_mask_rcnn-R-101-F

Roy 2.8k Dec 29, 2022
Prototype-based Incremental Few-Shot Semantic Segmentation

Prototype-based Incremental Few-Shot Semantic Segmentation Fabio Cermelli, Massimiliano Mancini, Yongqin Xian, Zeynep Akata, Barbara Caputo -- BMVC 20

Fabio Cermelli 21 Dec 29, 2022
BARF: Bundle-Adjusting Neural Radiance Fields ๐Ÿคฎ (ICCV 2021 oral)

BARF ๐Ÿคฎ : Bundle-Adjusting Neural Radiance Fields Chen-Hsuan Lin, Wei-Chiu Ma, Antonio Torralba, and Simon Lucey IEEE International Conference on Comp

Chen-Hsuan Lin 539 Dec 28, 2022
Back to Event Basics: SSL of Image Reconstruction for Event Cameras

Back to Event Basics: SSL of Image Reconstruction for Event Cameras Minimal code for Back to Event Basics: Self-Supervised Learning of Image Reconstru

TU Delft 42 Dec 26, 2022
Our CIKM21 Paper "Incorporating Query Reformulating Behavior into Web Search Evaluation"

Reformulation-Aware-Metrics Introduction This codebase contains source-code of the Python-based implementation of our CIKM 2021 paper. Chen, Jia, et a

xuanyuan14 5 Mar 05, 2022
[PyTorch] Official implementation of CVPR2021 paper "PointDSC: Robust Point Cloud Registration using Deep Spatial Consistency". https://arxiv.org/abs/2103.05465

PointDSC repository PyTorch implementation of PointDSC for CVPR'2021 paper "PointDSC: Robust Point Cloud Registration using Deep Spatial Consistency",

153 Dec 14, 2022
TorchOk - The toolkit for fast Deep Learning experiments in Computer Vision

TorchOk - The toolkit for fast Deep Learning experiments in Computer Vision

52 Dec 23, 2022