The code for the NeurIPS 2021 paper "A Unified View of cGANs with and without Classifiers".

Last update: May 28, 2022

Overview

Energy-based Conditional Generative Adversarial Network (ECGAN)

This is the code for the NeurIPS 2021 paper "A Unified View of cGANs with and without Classifiers". The repository is modified from StudioGAN. If you find our work useful, please consider citing the following paper:

@inproceedings{chen2021ECGAN,
  title   = {A Unified View of cGANs with and without Classifiers},
  author  = {Si-An Chen and Chun-Liang Li and Hsuan-Tien Lin},
  booktitle = {Advances in Neural Information Processing Systems},
  year    = {2021}
}

Please feel free to contact Si-An Chen if you have any questions about the code/paper.

Introduction

We propose a new Conditional Generative Adversarial Network (cGAN) framework called Energy-based Conditional Generative Adversarial Network (ECGAN) which provides a unified view of cGANs and achieves state-of-the-art results. We use the decomposition of the joint probability distribution to connect the goals of cGANs and classification as a unified framework. The framework, along with a classic energy model to parameterize distributions, justifies the use of classifiers for cGANs in a principled manner. It explains several popular cGAN variants, such as ACGAN, ProjGAN, and ContraGAN, as special cases with different levels of approximations. An illustration of the framework is shown below.

Requirements

Anaconda
Python >= 3.6
6.0.0 <= Pillow <= 7.0.0
scipy == 1.1.0 (Recommended for fast loading of Inception Network)
sklearn
seaborn
h5py
tqdm
torch >= 1.6.0 (Recommended for mixed precision training and knn analysis)
torchvision >= 0.7.0
tensorboard
5.4.0 <= gcc <= 7.4.0 (Recommended for proper use of adaptive discriminator augmentation module)

You can install the recommended environment as follows:

conda env create -f environment.yml -n studiogan

With docker, you can use:

docker pull mgkang/studiogan:0.1

Quick Start

Train (-t) and evaluate (-e) the model defined in CONFIG_PATH using GPU 0

CUDA_VISIBLE_DEVICES=0 python3 src/main.py -t -e -c CONFIG_PATH

Train (-t) and evaluate (-e) the model defined in CONFIG_PATH using GPUs (0, 1, 2, 3) and DataParallel

CUDA_VISIBLE_DEVICES=0,1,2,3 python3 src/main.py -t -e -c CONFIG_PATH

Try python3 src/main.py to see available options.

Dataset

CIFAR10: StudioGAN will automatically download the dataset once you execute main.py.
Tiny Imagenet, Imagenet, or a custom dataset:
1. download Tiny Imagenet and Imagenet. Prepare your own dataset.
2. make the folder structure of the dataset as follows:

┌── docs
├── src
└── data
    └── ILSVRC2012 or TINY_ILSVRC2012 or CUSTOM
        ├── train
        │   ├── cls0
        │   │   ├── train0.png
        │   │   ├── train1.png
        │   │   └── ...
        │   ├── cls1
        │   └── ...
        └── valid
            ├── cls0
            │   ├── valid0.png
            │   ├── valid1.png
            │   └── ...
            ├── cls1
            └── ...

Examples and Results

The src/configs directory contains config files used in our experiments.

CIFAR10 (3x32x32)

To train and evaluate ECGAN-UC on CIFAR10:

python3 src/main.py -t -e -c src/configs/CIFAR10/ecgan_v2_none_0_0p01.json

Method	Reference	IS(⭡)	FID(⭣)	F_1/8(⭡)	F_8(⭡)	Cfg	Log	Weights
BigGAN-Mod	StudioGAN	9.746	8.034	0.995	0.994	-	-	-
ContraGAN	StudioGAN	9.729	8.065	0.993	0.992	-	-	-
Ours	-	10.078	7.936	0.990	0.988	Cfg	Log	Link

Tiny ImageNet (3x64x64)

To train and evaluate ECGAN-UC on Tiny ImageNet:

python3 src/main.py -t -e -c src/configs/TINY_ILSVRC2012/ecgan_v2_none_0_0p01.json --eval_type valid

Method	Reference	IS(⭡)	FID(⭣)	F_1/8(⭡)	F_8(⭡)	Cfg	Log	Weights
BigGAN-Mod	StudioGAN	11.998	31.92	0.956	0.879	-	-	-
ContraGAN	StudioGAN	13.494	27.027	0.975	0.902	-	-	-
Ours	-	18.445	18.319	0.977	0.973	Cfg	Log	Link

ImageNet (3x128x128)

To train and evaluate ECGAN-UCE on ImageNet (~12 days on 8 NVIDIA V100 GPUs):

python3 src/main.py -t -e -l -sync_bn -c src/configs/ILSVRC2012/imagenet_ecgan_v2_contra_1_0p05.json --eval_type valid

Method	Reference	IS(⭡)	FID(⭣)	F_1/8(⭡)	F_8(⭡)	Cfg	Log	Weights
BigGAN	StudioGAN	28.633	24.684	0.941	0.921	-	-	-
ContraGAN	StudioGAN	25.249	25.161	0.947	0.855	-	-	-
Ours	-	80.685	8.491	0.984	0.985	Cfg	Log	Link

Generated Images

Here are some selected images generated by ECGAN.

The code for the NeurIPS 2021 paper "A Unified View of cGANs with and without Classifiers".

Related tags

Overview

Energy-based Conditional Generative Adversarial Network (ECGAN)

Introduction

Requirements

Quick Start

Dataset

Examples and Results

CIFAR10 (3x32x32)

Tiny ImageNet (3x64x64)

ImageNet (3x128x128)

Generated Images

Owner

sianchen

Image restoration with neural networks but without learning.

Bayesian Image Reconstruction using Deep Generative Models

Colossal-AI: A Unified Deep Learning System for Large-Scale Parallel Training

A Python library for Deep Graph Networks

ObjDetApp deploys a pytorch model for object detection

A PyTorch implementation of Multi-digit Number Recognition from Street View Imagery using Deep Convolutional Neural Networks

Code for unmixing audio signals in four different stems "drums, bass, vocals, others". The code is adapted from "Jukebox: A Generative Model for Music"

ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training

MicroNet: Improving Image Recognition with Extremely Low FLOPs (ICCV 2021)

PyTorch implementation of Convolutional Neural Fabrics http://arxiv.org/abs/1606.02492

A curated list of awesome projects and resources related fastai

Official implementation of the paper Vision Transformer with Progressive Sampling, ICCV 2021.

The source code of the ICCV2021 paper "PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering"

PyTorch implementations of neural network models for keyword spotting

Tensorflow-Project-Template - A best practice for tensorflow project template architecture.

A naive ROS interface for visualDet3D.

ML-PersonalWork - Big assignment PersonalWork in Machine Learning, 2021 autumn BUAA.

FedMM: Saddle Point Optimization for Federated Adversarial Domain Adaptation

Cross-lingual Transfer for Speech Processing using Acoustic Language Similarity

Neural Scene Flow Fields for Space-Time View Synthesis of Dynamic Scenes

The code for the NeurIPS 2021 paper "A Unified View of cGANs with and without Classifiers".

Related tags

Overview

Energy-based Conditional Generative Adversarial Network (ECGAN)

Introduction

Requirements

Quick Start

Dataset

Examples and Results

CIFAR10 (3x32x32)

Tiny ImageNet (3x64x64)

ImageNet (3x128x128)

Generated Images

Owner

sianchen

Image restoration with neural networks but without learning.

Bayesian Image Reconstruction using Deep Generative Models

Colossal-AI: A Unified Deep Learning System for Large-Scale Parallel Training

A Python library for Deep Graph Networks

*ObjDetApp* deploys a pytorch model for object detection

A PyTorch implementation of Multi-digit Number Recognition from Street View Imagery using Deep Convolutional Neural Networks

Code for unmixing audio signals in four different stems "drums, bass, vocals, others". The code is adapted from "Jukebox: A Generative Model for Music"

ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training

MicroNet: Improving Image Recognition with Extremely Low FLOPs (ICCV 2021)

PyTorch implementation of Convolutional Neural Fabrics http://arxiv.org/abs/1606.02492

A curated list of awesome projects and resources related fastai

Official implementation of the paper Vision Transformer with Progressive Sampling, ICCV 2021.

The source code of the ICCV2021 paper "PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering"

PyTorch implementations of neural network models for keyword spotting

Tensorflow-Project-Template - A best practice for tensorflow project template architecture.

A naive ROS interface for visualDet3D.

ML-PersonalWork - Big assignment PersonalWork in Machine Learning, 2021 autumn BUAA.

FedMM: Saddle Point Optimization for Federated Adversarial Domain Adaptation

Cross-lingual Transfer for Speech Processing using Acoustic Language Similarity

Neural Scene Flow Fields for Space-Time View Synthesis of Dynamic Scenes

ObjDetApp deploys a pytorch model for object detection