Pytorch implementation for reproducing StackGAN_v2 results in the paper StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks

Last update: Dec 16, 2022

Related tags

Deep Learning StackGAN-v2

Overview

StackGAN-v2

Pytorch implementation for reproducing StackGAN_v2 results in the paper StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks by Han Zhang*, Tao Xu*, Hongsheng Li, Shaoting Zhang, Xiaogang Wang, Xiaolei Huang, Dimitris Metaxas.

Dependencies

python 2.7

Pytorch

In addition, please add the project folder to PYTHONPATH and pip install the following packages:

tensorboard
python-dateutil
easydict
pandas
torchfile

Data

Download our preprocessed char-CNN-RNN text embeddings for birds and save them to data/

[Optional] Follow the instructions reedscot/icml2016 to download the pretrained char-CNN-RNN text encoders and extract text embeddings.

Download the birds image data. Extract them to data/birds/
Download ImageNet dataset and extract the images to data/imagenet/
Download LSUN dataset and save the images to data/lsun

Training

Train a StackGAN-v2 model on the bird (CUB) dataset using our preprocessed embeddings:
- python main.py --cfg cfg/birds_3stages.yml --gpu 0
Train a StackGAN-v2 model on the ImageNet dog subset:
- python main.py --cfg cfg/dog_3stages_color.yml --gpu 0
Train a StackGAN-v2 model on the ImageNet cat subset:
- python main.py --cfg cfg/cat_3stages_color.yml --gpu 0
Train a StackGAN-v2 model on the lsun bedroom subset:
- python main.py --cfg cfg/bedroom_3stages_color.yml --gpu 0
Train a StackGAN-v2 model on the lsun church subset:
- python main.py --cfg cfg/church_3stages_color.yml --gpu 0
*.yml files are example configuration files for training/evaluation our models.
If you want to try your own datasets, here are some good tips about how to train GAN. Also, we encourage to try different hyper-parameters and architectures, especially for more complex datasets.

Pretrained Model

StackGAN-v2 for bird. Download and save it to models/ (The inception score for this Model is 4.04±0.05)
StackGAN-v2 for dog. Download and save it to models/ (The inception score for this Model is 9.55±0.11)
StackGAN-v2 for cat. Download and save it to models/
StackGAN-v2 for bedroom. Download and save it to models/
StackGAN-v2 for church. Download and save it to models/

Evaluating

Run python main.py --cfg cfg/eval_birds.yml --gpu 1 to generate samples from captions in birds validation set.
Change the eval_*.yml files to generate images from other pre-trained models.

Examples generated by StackGAN-v2

Tsne visualization of randomly generated birds, dogs, cats, churchs and bedrooms

Citing StackGAN++

If you find StackGAN useful in your research, please consider citing:

@article{Han17stackgan2,
  author    = {Han Zhang and Tao Xu and Hongsheng Li and Shaoting Zhang and Xiaogang Wang and Xiaolei Huang and Dimitris Metaxas},
  title     = {StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks},
  journal   = {arXiv: 1710.10916},
  year      = {2017},
}

@inproceedings{han2017stackgan,
Author = {Han Zhang and Tao Xu and Hongsheng Li and Shaoting Zhang and Xiaogang Wang and Xiaolei Huang and Dimitris Metaxas},
Title = {StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks},
Year = {2017},
booktitle = {{ICCV}},
}

Our follow-up work

AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks [Supplementary][code]

References

Generative Adversarial Text-to-Image Synthesis Paper Code
Learning Deep Representations of Fine-grained Visual Descriptions Paper Code

Pytorch implementation for reproducing StackGAN_v2 results in the paper StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks

Related tags

Overview

StackGAN-v2

Dependencies

Citing StackGAN++

Owner

Han Zhang

Deep Learning Datasets Maker is a QGIS plugin to make datasets creation easier for raster and vector data.

DANA paper supplementary materials

A lightweight library to compare different PyTorch implementations of the same network architecture.

A Library for Modelling Probabilistic Hierarchical Graphical Models in PyTorch

Paper list of log-based anomaly detection

An unreferenced image captioning metric (ACL-21)

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition

This project is for a Twitter bot that monitors a bird feeder in my backyard. Any detected birds are identified and posted to Twitter.

Source code for our EMNLP'21 paper 《Raise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuning》

CSAW-M: An Ordinal Classification Dataset for Benchmarking Mammographic Masking of Cancer

Code for Low-Cost Algorithmic Recourse for Users With Uncertain Cost Functions

Cluttered MNIST Dataset

JstDoS - HTTP Protocol Stack Remote Code Execution Vulnerability

Mesh TensorFlow: Model Parallelism Made Easier

paper list in the area of reinforcenment learning for recommendation systems

Implementation of Continuous Sparsification, a method for pruning and ticket search in deep networks

More than a hundred strange attractors

Official PyTorch code for Hierarchical Conditional Flow: A Unified Framework for Image Super-Resolution and Image Rescaling (HCFlow, ICCV2021)

PyTorch implementation of the paper: Label Noise Transition Matrix Estimation for Tasks with Lower-Quality Features

DTCN IJCAI - Sequential prediction learning framework and algorithm