The implementation of 'Image synthesis via semantic composition'.

Last update: Jan 06, 2023

Related tags

Overview

Image synthesis via semantic synthesis [Project Page]

by Yi Wang, Lu Qi, Ying-Cong Chen, Xiangyu Zhang, Jiaya Jia.

Introduction

This repository gives the implementation of our semantic image synthesis method in ICCV 2021 paper, 'Image synthesis via semantic synthesis'.

Our framework

Usage

git clone https://github.com/dvlab-research/SCGAN.git
cd SCGAN/code

To use this code, please install PyTorch 1.0 and Python 3+. Other dependencies can be installed by

pip install -r requirements.txt

Dataset Preparation

Please refer to SPADE for detailed execution.

Testing

Downloading pretrained models, then putting the folder containing model weights in the folder ./checkpoints.
Producing images with the pretrained models.

python test.py --gpu_ids 0,1,2,3 --dataset_mode [dataset] --config config/scgan_[dataset]_test.yml --fid --gt [gt_path] --visual_n 1

For example,

python test.py --gpu_ids 0,1,2,3 --dataset_mode celeba --config config/scgan_celeba-test.yml --fid --gt /data/datasets/celeba --visual_n 1

Visual results are stored at ./results/scgan_[dataset]/ by default.

Pretrained Models (to be updated)

Dataset	Download link
CelebAMask-HQ	Baidu Disk (Code: face)

Training

Using train.sh to train new models. Or you can specify training options in config/[config_file].yml.

Key operators

Our proposed dynamic computation units (spatial conditional convolution and normalization) are extended from conditionally parameterized convolutions [1]. We generalize the scalar condition into a spatial one and also apply these techniques to normalization.

Citation

If our research is useful for you, please consider citing:

@inproceedings{wang2021image,
  title={Image Synthesis via Semantic Composition},
  author={Wang, Yi and Qi, Lu and Chen, Ying-Cong and Zhang, Xiangyu and Jia, Jiaya},
  booktitle={ICCV},
  year={2021}
}

Acknowledgements

This code is built upon SPADE, Imaginaire, and PyTorch-FID.

Reference

[1] Brandon Yang, Gabriel Bender, Quoc V Le, and Jiquan Ngiam. Condconv: Conditionally parameterized convolutions for efficient inference. In NeurIPS. 2019.

Contact

Please send email to [email protected].

The implementation of 'Image synthesis via semantic composition'.

Related tags

Overview

Image synthesis via semantic synthesis [Project Page]

Introduction

Our framework

Usage

Dataset Preparation

Testing

Pretrained Models (to be updated)

Training

Key operators

Citation

Acknowledgements

Reference

Contact

Owner

DV Lab

Code for Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation (CVPR 2021)

Official code for "End-to-End Optimization of Scene Layout" -- including VAE, Diff Render, SPADE for colorization (CVPR 2020 Oral)

Model parallel transformers in Jax and Haiku

Alpha-IoU: A Family of Power Intersection over Union Losses for Bounding Box Regression

HeartRate detector with ArduinoandPython - Use Arduino and Python create a heartrate detector.

CVPR2021 Workshop - HDRUNet: Single Image HDR Reconstruction with Denoising and Dequantization.

Audio2Face - Audio To Face With Python

👨‍💻 run nanosaur in simulation with Gazebo/Ingnition

Neural network for stock price prediction

Phy-Q: A Benchmark for Physical Reasoning

RIFE: Real-Time Intermediate Flow Estimation for Video Frame Interpolation

Hierarchical probabilistic 3D U-Net, with attention mechanisms (—𝘈𝘵𝘵𝘦𝘯𝘵𝘪𝘰𝘯 𝘜-𝘕𝘦𝘵, 𝘚𝘌𝘙𝘦𝘴𝘕𝘦𝘵) and a nested decoder structure with deep supervision (—𝘜𝘕𝘦𝘵++).

Numbering permanent and deciduous teeth via deep instance segmentation in panoramic X-rays

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

Jittor 64*64 implementation of StyleGAN

Style transfer between images was performed using the VGG19 model

✅ How Robust are Fact Checking Systems on Colloquial Claims?. In NAACL-HLT, 2021.

Anti-UAV base on PaddleDetection

Help you understand Manual and w/ Clutch point while driving.

TensorFlow (Python API) implementation of Neural Style