Commonality in Natural Images Rescues GANs: Pretraining GANs with Generic and Privacy-free Synthetic Data - Official PyTorch Implementation (CVPR 2022)

Last update: Sep 27, 2022

Related tags

Deep Learning Primitives-PS

Overview

Commonality in Natural Images Rescues GANs: Pretraining GANs with Generic and Privacy-free Synthetic Data
(CVPR 2022)

Potentials of primitive shapes for representing things. We only use a line, ellipse, and rectangle to express a cat and a temple. These examples motivate us to develop Primitives, which generates the data by a simple composition of the shapes.

Official pytorch implementation of "Commonality in Natural Images Rescues GANs: Pretraining GANs with Generic and Privacy-free Synthetic Data"

Commonality in Natural Images Rescues GANs: Pretraining GANs with Generic and Privacy-free Synthetic Data
Kyungjune Baek and Hyunjung Shim

Yonsei University

Absract Transfer learning for GANs successfully improves generation performance under low-shot regimes. However, existing studies show that the pretrained model using a single benchmark dataset is not generalized to various target datasets. More importantly, the pretrained model can be vulnerable to copyright or privacy risks as membership inference attack advances. To resolve both issues, we propose an effective and unbiased data synthesizer, namely Primitives-PS, inspired by the generic characteristics of natural images. Specifically, we utilize 1) the generic statistics on the frequency magnitude spectrum, 2) the elementary shape (i.e., image composition via elementary shapes) for representing the structure information, and 3) the existence of saliency as prior. Since our synthesizer only considers the generic properties of natural images, the single model pretrained on our dataset can be consistently transferred to various target datasets, and even outperforms the previous methods pretrained with the natural images in terms of Fr'echet inception distance. Extensive analysis, ablation study, and evaluations demonstrate that each component of our data synthesizer is effective, and provide insights on the desirable nature of the pretrained model for the transferability of GANs.

Requirement

Environment

For the easy construction of environment, please use the docker image.

Replace $DOCKER_CONTAINER_NAME, $LOCAL_MAPPING_DIRECTORY, and $DOCKER_MAPPING_DIRECTORY to your own name and directories.

nvidia-docker run -it --entrypoint /bin/bash --shm-size 96g --name $DOCKER_CONTAINER_NAME -v $LOCAL_MAPPING_DIRECTORY:$DOCKER_MAPPING_DIRECTORY bkjbkj12/stylegan2_ada-pytorch1.8:1.0

nvidia-docker start $DOCKER_CONTAINER_NAME
nvidia-docker exec -it $DOCKER_CONTAINER_NAME bash

Then, go to the directory containing the source code

Dataset

The low-shot datasets are from DiffAug repository.

Pretrained checkpoint

Please download the source model (pretrained model) below. (Mainly used Primitives-PS)

Hardware

Mainly tested on Titan XP (12GB), V100 (32GB) and A6000 (48GB).

How to Run (Quick Start)

Pretraining To change the type of the pretraining dataset, comment out ant in these lines.

The file "noise.zip" is not required. (Just running the script will work well.)

CUDA_VISIBLE_DEVICES=$GPU_NUMBER python train.py --outdir=$OUTPUT_DIR --data=./data/noise.zip --gpus=1

Finetuning Change or locate the pretrained pkl file into the directory specified at the code.

CUDA_VISIBLE_DEVICES=$GPU_NUMBER python train.py --outdir=$OUTPUT_DIR --gpus=1 --data $DATA_DIR --kimg 400 --resume $PKL_NAME_TO_RESUME

Examples

Pretraining:
CUDA_VISIBLE_DEVICES=0 python train.py --outdir=Primitives-PS-Pretraining --data=./data/noise.zip --gpus=1

Finetuning:
CUDA_VISIBLE_DEVICES=0 python train.py --outdir=Primitives-PS-to-Obama --gpus=1 --data ../data/obama.zip --kimg 400 --resume Primitives-PS

Pretrained Model

Download

Google Drive


PinkNoise	Primitives	Primitives-S	Primitives-PS
Obama	Grumpy Cat	Panda	Bridge of Sigh
Medici fountain	Temple of heaven	Wuzhen	Buildings

Synthetic Datasets

Results

Generating images from the same latent vector

GIF

Because of the limitation on the file size, the model dose not fully converge (total 400K but .gif contains 120K iterations).

Low-shot generation

CIFAR

Note

This repository is built upon DiffAug.

Citation

If you find this work useful for your research, please cite our paper:

@InProceedings{Baek2022Commonality,
    author    = {Baek, Kyungjune and Shim, Hyunjung},
    title     = {Commonality in Natural Images Rescues GANs: Pretraining GANs with Generic and Privacy-free Synthetic Data},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
    year      = {2022}
}

Commonality in Natural Images Rescues GANs: Pretraining GANs with Generic and Privacy-free Synthetic Data - Official PyTorch Implementation (CVPR 2022)

Related tags

Overview

Commonality in Natural Images Rescues GANs: Pretraining GANs with Generic and Privacy-free Synthetic Data
(CVPR 2022)

Requirement

How to Run (Quick Start)

Pretrained Model

Synthetic Datasets

Results

Generating images from the same latent vector

GIF

Low-shot generation

CIFAR

Note

Citation

Owner

Read number plates with https://platerecognizer.com/

Attention-guided gan for synthesizing IR images

This is the official implementation of TrivialAugment and a mini-library for the application of multiple image augmentation strategies including RandAugment and TrivialAugment.

Chainer Implementation of Fully Convolutional Networks. (Training code to reproduce the original result is available.)

Let Python optimize the best stop loss and take profits for your TradingView strategy.

Official pytorch implementation of Active Learning for deep object detection via probabilistic modeling (ICCV 2021)

This is the repository for our paper Ditch the Gold Standard: Re-evaluating Conversational Question Answering

Official code repository for the work: "The Implicit Values of A Good Hand Shake: Handheld Multi-Frame Neural Depth Refinement"

Code for KHGT model, AAAI2021

Generative Autoregressive, Normalized Flows, VAEs, Score-based models (GANVAS)

This repo. is an implementation of ACFFNet, which is accepted for in Image and Vision Computing.

Train a deep learning net with OpenStreetMap features and satellite imagery.

Locally Constrained Self-Attentive Sequential Recommendation

LLVM-based compiler for LightGBM gradient-boosted trees. Speeds up prediction by ≥10x.

[IROS2021] NYU-VPR: Long-Term Visual Place Recognition Benchmark with View Direction and Data Anonymization Influences

Deep Reinforcement Learning for mobile robot navigation in ROS Gazebo simulator

Personal thermal comfort models using digital twins: Preference prediction with BIM-extracted spatial-temporal proximity data from Build2Vec

A library for efficient similarity search and clustering of dense vectors.

This is the official implementation for "Do Transformers Really Perform Bad for Graph Representation?".

Finite-temperature variational Monte Carlo calculation of uniform electron gas using neural canonical transformation.

Commonality in Natural Images Rescues GANs: Pretraining GANs with Generic and Privacy-free Synthetic Data - Official PyTorch Implementation (CVPR 2022)

Related tags

Overview

Commonality in Natural Images Rescues GANs: Pretraining GANs with Generic and Privacy-free Synthetic Data(CVPR 2022)

Requirement

How to Run (Quick Start)

Pretrained Model

Synthetic Datasets

Results

Generating images from the same latent vector

GIF

Low-shot generation

CIFAR

Note

Citation

Owner

Read number plates with https://platerecognizer.com/

Attention-guided gan for synthesizing IR images

This is the official implementation of TrivialAugment and a mini-library for the application of multiple image augmentation strategies including RandAugment and TrivialAugment.

Chainer Implementation of Fully Convolutional Networks. (Training code to reproduce the original result is available.)

Let Python optimize the best stop loss and take profits for your TradingView strategy.

Official pytorch implementation of Active Learning for deep object detection via probabilistic modeling (ICCV 2021)

This is the repository for our paper Ditch the Gold Standard: Re-evaluating Conversational Question Answering

Official code repository for the work: "The Implicit Values of A Good Hand Shake: Handheld Multi-Frame Neural Depth Refinement"

Code for KHGT model, AAAI2021

Generative Autoregressive, Normalized Flows, VAEs, Score-based models (GANVAS)

This repo. is an implementation of ACFFNet, which is accepted for in Image and Vision Computing.

Train a deep learning net with OpenStreetMap features and satellite imagery.

Locally Constrained Self-Attentive Sequential Recommendation

LLVM-based compiler for LightGBM gradient-boosted trees. Speeds up prediction by ≥10x.

[IROS2021] NYU-VPR: Long-Term Visual Place Recognition Benchmark with View Direction and Data Anonymization Influences

Deep Reinforcement Learning for mobile robot navigation in ROS Gazebo simulator

Personal thermal comfort models using digital twins: Preference prediction with BIM-extracted spatial-temporal proximity data from Build2Vec

A library for efficient similarity search and clustering of dense vectors.

This is the official implementation for "Do Transformers Really Perform Bad for Graph Representation?".

Finite-temperature variational Monte Carlo calculation of uniform electron gas using neural canonical transformation.

Commonality in Natural Images Rescues GANs: Pretraining GANs with Generic and Privacy-free Synthetic Data
(CVPR 2022)