Official Implementation and Dataset of "PPR10K: A Large-Scale Portrait Photo Retouching Dataset with Human-Region Mask and Group-Level Consistency", CVPR 2021

Last update: Dec 11, 2022

Related tags

Deep Learning PPR10K

Overview

Portrait Photo Retouching with PPR10K

Paper | Supplementary Material

PPR10K: A Large-Scale Portrait Photo Retouching Dataset with Human-Region Mask and Group-Level Consistency
Jie Liang*, Hui Zeng*, Miaomiao Cui, Xuansong Xie and Lei Zhang.
In CVPR 2021.

The proposed Portrait Photo Retouching dataset (PPR10K) is a large-scale and diverse dataset that contains:

11,161 high-quality raw portrait photos (resolutions from 4K to 8K) in 1,681 groups;
3 versions of manual retouched targets of all photos given by 3 expert retouchers;
full resolution human-region masks of all photos.

Samples

Two example groups of photos from the PPR10K dataset. Top: the raw photos; Bottom: the retouched results from expert-a and the human-region masks. The raw photos exhibit poor visual quality and large variance in subject views, background contexts, lighting conditions and camera settings. In contrast, the retouched results demonstrate both good visual quality (with human-region priority) and group-level consistency.

This dataset is first of its kind to consider the two special and practical requirements of portrait photo retouching task, i.e., Human-Region Priority and Group-Level Consistency. Three main challenges are expected to be tackled in the follow-up researches:

Flexible and content-adaptive models for such a diverse task regarding both image contents and lighting conditions;
Highly efficient models to process practical resolution from 4K to 8K;
Robust and stable models to meet the requirement of group-level consistency.

Agreement

All files in the PPR10K dataset are available for non-commercial research purposes only.
You agree not to reproduce, duplicate, copy, sell, trade, resell or exploit for any commercial purposes, any portion of the images and any portion of derived data.

Overview

All data is hosted on GoogleDrive, OneDrive and 百度网盘 (验证码: mrwn):

Path	Size	Files	Format	Description
PPR10K-dataset	406 GB	176,072		Main folder
├ raw	313 GB	11,161	RAW	All photos in raw format (.CR2, .NEF, .ARW, etc)
├ xmp_source	130 MB	11,161	XMP	Default meta-file of the raw photos in CameraRaw, used in our data augmentation
├ xmp_target_a	130 MB	11,161	XMP	CameraRaw meta-file of the raw photos recoding the full adjustments by expert a
├ xmp_target_b	130 MB	11,161	XMP	CameraRaw meta-file of the raw photos recoding the full adjustments by expert b
├ xmp_target_c	130 MB	11,161	XMP	CameraRaw meta-file of the raw photos recoding the full adjustments by expert c
├ masks_full	697 MB	11,161	PNG	Full-resolution human-region masks in binary format
├ masks_360p	56 MB	11,161	PNG	360p human-region masks for fast training and validation
├ train_val_images_tif_360p	91 GB	97894	TIF	360p Source (16 bit tiff, with 5 versions of augmented images) and target (8 bit tiff) images for fast training and validation
├ pretrained_models	268 MB	12	PTH	pretrained models for all 3 versions
└ hists	624KB	39	PNG	Overall statistics of the dataset

One can directly use the 360p (of 540x360 or 360x540 resolution in sRGB color space) training and validation files (photos, 5 versions of augmented photos and the corresponding human-region masks) we have provided following the settings in our paper (train with the first 8,875 files and validate with the last 2286 files).
Also, see the instructions to customize your data (e.g., augment the training samples regarding illuminations and colors, get photos with higher or full resolutions).

Training and Validating the PPR using 3DLUT

Installation

Clone this repo.

git clone https://github.com/csjliang/PPR10K
cd PPR10K/code_3DLUT/

Install dependencies.

pip install -r requirements.txt

Build. Modify the CUDA path in trilinear_cpp/setup.sh adaptively and

cd trilinear_cpp
sh trilinear_cpp/setup.sh

Training

Training without HRP and GLC strategy, save models:

python train.py --data_path [path_to_dataset] --gpu_id [gpu_id] --use_mask False --output_dir [path_to_save_models]

Training with HRP and without GLC strategy, save models:

python train.py --data_path [path_to_dataset] --gpu_id [gpu_id] --use_mask True --output_dir [path_to_save_models]

Training without HRP and with GLC strategy, save models:

python train_GLC.py --data_path [path_to_dataset] --gpu_id [gpu_id] --use_mask False --output_dir [path_to_save_models]

Training with both HRP and GLC strategy, save models:

python train_GLC.py --data_path [path_to_dataset] --gpu_id [gpu_id] --use_mask True --output_dir [path_to_save_models]

Evaluation

Generate the retouched results:

python validation.py --data_path [path_to_dataset] --gpu_id [gpu_id] --model_dir [path_to_models]

Use matlab to calculate the measures in our paper:

calculate_metrics(source_dir, target_dir, mask_dir)

Pretrained Models

Download the pretrained models from GoogleDrive, OneDrive or 百度网盘, and move them to the directory saved_models:

mv your/path/to/pretrained_models/* saved_models/

specify the --model_dir and --epoch (-1) to validate or initialize the training using the pretrained models, e.g.,

python validation.py --data_path [path_to_dataset] --gpu_id [gpu_id] --model_dir mask_noglc_a --epoch -1
python train.py --data_path [path_to_dataset] --gpu_id [gpu_id] --use_mask True --output_dir mask_noglc_a --epoch -1

Citation

If you use this dataset or code for your research, please cite our paper.

@inproceedings{jie2021PPR10K,
  title={PPR10K: A Large-Scale Portrait Photo Retouching Dataset with Human-Region Mask and Group-Level Consistency},
  author={Liang, Jie and Zeng, Hui and Cui, Miaomiao and Xie, Xuansong and Zhang, Lei},
  booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
  year={2021}
}

Related Projects

3D LUT

Contact

Should you have any questions, please contact me via [email protected].

Official Implementation and Dataset of "PPR10K: A Large-Scale Portrait Photo Retouching Dataset with Human-Region Mask and Group-Level Consistency", CVPR 2021

Related tags

Overview

Portrait Photo Retouching with PPR10K

Paper | Supplementary Material

Samples

Agreement

Overview

Training and Validating the PPR using 3DLUT

Installation

Training

Evaluation

Pretrained Models

Citation

Related Projects

Contact

Owner

CDGAN: Cyclic Discriminative Generative Adversarial Networks for Image-to-Image Transformation

The original implementation of TNDM used in the NeurIPS 2021 paper (no longer being updated)

Testbed of AI Systems Quality Management

PCAM: Product of Cross-Attention Matrices for Rigid Registration of Point Clouds

Official Implementation for HyperStyle: StyleGAN Inversion with HyperNetworks for Real Image Editing

TensorFlow 2 implementation of the Yahoo Open-NSFW model

A curated list of awesome Machine Learning frameworks, libraries and software.

This repository contains answers of the Shopify Summer 2022 Data Science Intern Challenge.

SafePicking: Learning Safe Object Extraction via Object-Level Mapping, ICRA 2022

The official PyTorch implementation of recent paper - SAINT: Improved Neural Networks for Tabular Data via Row Attention and Contrastive Pre-Training

Time series annotation library.

Sound-guided Semantic Image Manipulation - Official Pytorch Code (CVPR 2022)

An unofficial styleguide and best practices summary for PyTorch

PyTorch code accompanying the paper "Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning" (NeurIPS 2021).

PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners for self-supervised ViT.

HODEmu, is both an executable and a python library that is based on Ragagnin 2021 in prep.

Official implementation of "SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers"

This project is the official implementation of our accepted ICLR 2021 paper BiPointNet: Binary Neural Network for Point Clouds.

The offcial repository for 'CharacterBERT and Self-Teaching for Improving the Robustness of Dense Retrievers on Queries with Typos', SIGIR2022

The official implementation of Equalization Loss for Long-Tailed Object Recognition (CVPR 2020) based on Detectron2