Simple Tensorflow implementation of Toward Spatially Unbiased Generative Models (ICCV 2021)

Last update: Apr 15, 2022

Overview

Spatial unbiased GANs — Simple TensorFlow Implementation [Paper]

: Toward Spatially Unbiased Generative Models (ICCV 2021)

Abstract Recent image generation models show remarkable generation performance. However, they mirror strong location preference in datasets, which we call spatial bias. Therefore, generators render poor samples at unseen locations and scales. We argue that the generators rely on their implicit positional encoding to render spatial content. From our observations, the generator’s implicit positional encoding is translation-variant, making the generator spatially biased. To address this issue, we propose injecting explicit positional encoding at each scale of the generator. By learning the spatially unbiased generator, we facilitate the robust use of generators in multiple tasks, such as GAN inversion, multi-scale generation, generation of arbitrary sizes and aspect ratios. Furthermore, we show that our method can also be applied to denoising diffusion probabilistic models.

Requirements

Tensorflow >= 2.x

Usage

├── dataset
   └── YOUR_DATASET_NAME
       ├── 000001.jpg 
       ├── 000002.png
       └── ...

Train

> python main.py --dataset FFHQ --phase train --img_size 256 --batch_size 4 --n_total_image 6400

Generate Video

> python generate_video.py

Results

FID: 3.81 (6.4M images(200k iterations), 8GPU, each 4 batch size)

Video

Uncuratd

Style mixing

It's worse than stylegan2.

Truncation trick

Reference

Author

Junho Kim

Simple Tensorflow implementation of Toward Spatially Unbiased Generative Models (ICCV 2021)

Related tags

Overview

Spatial unbiased GANs — Simple TensorFlow Implementation [Paper]

: Toward Spatially Unbiased Generative Models (ICCV 2021)

Requirements

Usage

Train

Generate Video

Results

Video

Uncuratd

Style mixing

Truncation trick

Reference

Author

Owner

Junho Kim

ImageNet Adversarial Image Evaluation

Code for Phase diagram of Stochastic Gradient Descent in high-dimensional two-layer neural networks

WORD: Revisiting Organs Segmentation in the Whole Abdominal Region

Decoding the Protein-ligand Interactions Using Parallel Graph Neural Networks

Advantage Actor Critic (A2C): jax + flax implementation

Pytorch implementation of Masked Auto-Encoder

Multiple Object Tracking with Yolov5!

Towards uncontrained hand-object reconstruction from RGB videos

【ACMMM 2021】DSANet: Dynamic Segment Aggregation Network for Video-Level Representation Learning

Label Studio is a multi-type data labeling and annotation tool with standardized output format

Python scripts for performing stereo depth estimation using the MobileStereoNet model in ONNX

Introduction to CPM

Jax/Flax implementation of Variational-DiffWave.

Code for our paper at ECCV 2020: Post-Training Piecewise Linear Quantization for Deep Neural Networks

PyTorch implementation of MICCAI 2018 paper "Liver Lesion Detection from Weakly-labeled Multi-phase CT Volumes with a Grouped Single Shot MultiBox Detector"

Official implementation of "Learning Forward Dynamics Model and Informed Trajectory Sampler for Safe Quadruped Navigation" (RSS 2022)

torchbearer: A model fitting library for PyTorch

Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)

HALO: A Skeleton-Driven Neural Occupancy Representation for Articulated Hands

ACV is a python library that provides explanations for any machine learning model or data.