SMIS - Semantically Multi-modal Image Synthesis(CVPR 2020)

Last update: Dec 01, 2022

Related tags

Deep Learning SMIS

Overview

Semantically Multi-modal Image Synthesis

Project page / Paper / Demo

Semantically Multi-modal Image Synthesis(CVPR2020).
Zhen Zhu, Zhiliang Xu, Ansheng You, Xiang Bai

Requirements

torch>=1.0.0
torchvision
dominate
dill
scikit-image
tqdm
opencv-python

Getting Started

Data Preperation

DeepFashion
Note: We provide an example of the DeepFashion dataset. That is slightly different from the DeepFashion used in our paper due to the impact of the COVID-19.

Cityscapes
The Cityscapes dataset can be downloaded at here

ADE20K
The ADE20K dataset can be downloaded at here

Test/Train the models

Download the tar of the pretrained models from the Google Drive Folder. Save it in checkpoints/ and unzip it. There are deepfashion.sh, cityscapes.sh and ade20k.sh in the scripts folder. Change the parameters like --dataroot and so on, then comment or uncomment some code to test/train model. And you can specify the --test_mask for SMIS test.

Acknowledgments

Our code is based on the popular SPADE

SMIS - Semantically Multi-modal Image Synthesis(CVPR 2020)

Related tags

Overview

Semantically Multi-modal Image Synthesis

Project page / Paper / Demo

Requirements

Getting Started

Data Preperation

Test/Train the models

Acknowledgments

Owner

TensorFlow implementation of ENet, trained on the Cityscapes dataset.

StyleSpace Analysis: Disentangled Controls for StyleGAN Image Generation

Multi-task Learning of Order-Consistent Causal Graphs (NeuRIPs 2021)

Official and maintained implementation of the paper "OSS-Net: Memory Efficient High Resolution Semantic Segmentation of 3D Medical Data" [BMVC 2021].

Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene Understanding

Code for NeurIPS 2021 paper 'Spatio-Temporal Variational Gaussian Processes'

Fight Recognition from Still Images in the Wild @ WACVW2022, Real-world Surveillance Workshop

Official implementation of Long-Short Transformer in PyTorch.

用强化学习DQN算法，训练AI模型来玩合成大西瓜游戏，提供Keras版本和PARL（paddle）版本

EMNLP 2021 paper Models and Datasets for Cross-Lingual Summarisation.

GPT-Code-Clippy (GPT-CC) is an open source version of GitHub Copilot

A small fun project using python OpenCV, mediapipe, and pydirectinput

Hyperparameter Optimization for TensorFlow, Keras and PyTorch

Codes for CVPR2021 paper "PWCLO-Net: Deep LiDAR Odometry in 3D Point Clouds Using Hierarchical Embedding Mask Optimization"

COCO Style Dataset Generator GUI

PyTorch implementation of the paper Deep Networks from the Principle of Rate Reduction

Keyword spotting on Arm Cortex-M Microcontrollers

Localizing Visual Sounds the Hard Way

SSL_SLAM2: Lightweight 3-D Localization and Mapping for Solid-State LiDAR (mapping and localization separated) ICRA 2021

Deep Ensembling with No Overhead for either Training or Testing: The All-Round Blessings of Dynamic Sparsity