Exploring the Dual-task Correlation for Pose Guided Person Image Generation

Last update: Dec 15, 2022

Overview

Dual-task Pose Transformer Network

The source code for our paper "Exploring Dual-task Correlation for Pose Guided Person Image Generation“ (CVPR2022)

Get Start

1) Requirement

Python 3.7.9
Pytorch 1.7.1
torchvision 0.8.2
CUDA 11.1
NVIDIA A100 40GB PCIe

2) Data Preperation

Following PATN, the dataset split files and extracted keypoints files can be obtained as follows:

DeepFashion

Download the DeepFashion dataset in-shop clothes retrival benchmark, and put them under the ./dataset/fashion directory.
Download train/test pairs and train/test keypoints annotations from Google Drive, including fasion-resize-pairs-train.csv, fasion-resize-pairs-test.csv, fasion-resize-annotation-train.csv, fasion-resize-annotation-train.csv, train.lst, test.lst, and put them under the ./dataset/fashion directory.
Split the raw image into the training set (./dataset/fashion/train) and test set (./dataset/fashion/test):

python data/generate_fashion_datasets.py

Market1501

Download the Market1501 dataset from here. Rename bounding_box_train and bounding_box_test as train and test, and put them under the ./dataset/market directory.
Download train/test key points annotations from Google Drive including market-pairs-train.csv, market-pairs-test.csv, market-annotation-train.csv, market-annotation-train.csv. Put these files under the ./dataset/market directory.

3) Train a model

DeepFashion

python train.py --name=DPTN_fashion --model=DPTN --dataset_mode=fashion --dataroot=./dataset/fashion --batchSize 32 --gpu_id=0

Market1501

python train.py --name=DPTN_market --model=DPTN --dataset_mode=market --dataroot=./dataset/market --dis_layer=3 --lambda_g=5 --lambda_rec 2 --t_s_ratio=0.8 --save_latest_freq=10400 --batchSize 32 --gpu_id=0

4) Test the model

You can directly download our test results from Google Drive: Deepfashion, Market1501.

DeepFashion

python test.py --name=DPTN_fashion --model=DPTN --dataset_mode=fashion --dataroot=./dataset/fashion --which_epoch latest --results_dir ./results/DPTN_fashion --batchSize 1 --gpu_id=0

Market1501

python test.py --name=DPTN_market --model=DPTN --dataset_mode=market --dataroot=./dataset/market --which_epoch latest --results_dir=./results/DPTN_market  --batchSize 1 --gpu_id=0

5) Evaluation

We adopt SSIM, PSNR, FID and LPIPS for the evaluation.

DeepFashion

python -m  metrics.metrics --gt_path=./dataset/fashion/test --distorated_path=./results/DPTN_fashion --fid_real_path=./dataset/fashion/train --name=./fashion

Market1501

python -m  metrics.metrics --gt_path=./dataset/market/test --distorated_path=./results/DPTN_market --fid_real_path=./dataset/market/train --name=./market --market

6) Pre-trained Model

Our pre-trained model can be downloaded from Google Drive: Deepfashion, Market1501.

Citation

Acknowledgement

We build our project based on pix2pix. Some dataset preprocessing methods are derived from PATN.

Exploring the Dual-task Correlation for Pose Guided Person Image Generation

Related tags

Overview

Dual-task Pose Transformer Network

Get Start

1) Requirement

2) Data Preperation

3) Train a model

4) Test the model

5) Evaluation

6) Pre-trained Model

Citation

Acknowledgement

Owner

Implementation of the method described in the Speech Resynthesis from Discrete Disentangled Self-Supervised Representations.

Rule Extraction Methods for Interactive eXplainability

SenseNet is a sensorimotor and touch simulator for deep reinforcement learning research

PyTorch Implementation of NCSOFT's FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis

Attendance Monitoring with Face Recognition using Python

A Python library for adversarial machine learning focusing on benchmarking adversarial robustness.

PyTorch version of the paper 'Enhanced Deep Residual Networks for Single Image Super-Resolution' (CVPRW 2017)

My personal Home Assistant configuration.

Source code and data in paper "MDFEND: Multi-domain Fake News Detection (CIKM'21)"

68 keypoint annotations for COFW test data

This repository contains a pytorch implementation of "StereoPIFu: Depth Aware Clothed Human Digitization via Stereo Vision".

Companion code for the paper Theoretical characterization of uncertainty in high-dimensional linear classification

A graph adversarial learning toolbox based on PyTorch and DGL.

Spiking Neural Network for Computer Vision using SpikingJelly framework and Pytorch-Lightning

QQ Browser 2021 AI Algorithm Competition Track 1 1st Place Program

Single-step adversarial training (AT) has received wide attention as it proved to be both efficient and robust.

DeepRec is a recommendation engine based on TensorFlow.

A Pytorch Implementation for Compact Bilinear Pooling.

Code for our EMNLP 2021 paper “Heterogeneous Graph Neural Networks for Keyphrase Generation”

A stable algorithm for GAN training