Curated list of awesome GAN applications and demo

Overview

banner

gans-awesome-applications

Curated list of awesome GAN applications and demonstrations.

Note: General GAN papers targeting simple image generation such as DCGAN, BEGAN etc. are not included in the list. I mainly care about applications.

The landmark papers that I respect.

  • Generative Adversarial Networks, [paper], [github]
  • Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks, [paper], [github]
  • Improved Techniques for Training GANs, [paper], [github]
  • BEGAN: Boundary Equilibrium Generative Adversarial Networks, [paper], [github]

Contents

Use this contents list or simply press command + F to search for a keyword


Applications using GANs

Font generation

  • Learning Chinese Character style with conditional GAN, [blog], [github]
  • Artistic Glyph Image Synthesis via One-Stage Few-Shot Learning, [paper], [github]
  • Attribute2Font: Creating Fonts You Want From Attributes, [paper], [github]

Anime character generation

  • Towards the Automatic Anime Characters Creation with Generative Adversarial Networks, [paper]
  • [Project] A simple PyTorch Implementation of Generative Adversarial Networks, focusing on anime face drawing, [github]
  • [Project] A simple, clean TensorFlow implementation of Generative Adversarial Networks with a focus on modeling illustrations, [github]
  • [Project] Keras-GAN-Animeface-Character, [github]
  • [Project] A DCGAN to generate anime faces using custom mined dataset, [github]

Interactive Image generation

  • Generative Visual Manipulation on the Natural Image Manifold, [paper], [github]
  • Neural Photo Editing with Introspective Adversarial Networks, [paper], [github]

Text2Image (text to image)

  • TAC-GAN – Text Conditioned Auxiliary Classifier Generative Adversarial Network, [paper], [github]
  • StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks, [paper], [github]
  • Generative Adversarial Text to Image Synthesis, [paper], [github], [github]
  • Learning What and Where to Draw, [paper], [github]

3D Object generation

  • Parametric 3D Exploration with Stacked Adversarial Networks, [github], [youtube]
  • Learning a Probabilistic Latent Space of Object Shapes via 3D Generative-Adversarial Modeling, [paper], [github], [youtube]
  • 3D Shape Induction from 2D Views of Multiple Objects, [paper]
  • Fully Convolutional Refined Auto-Encoding Generative Adversarial Networks for 3D Multi Object Scenes, [github], [blog]

Image Editing

  • Invertible Conditional GANs for image editing, [paper], [github]
  • Image De-raining Using a Conditional Generative Adversarial Network, [paper], [github]

Face Aging

  • Age Progression/Regression by Conditional Adversarial Autoencoder, [paper], [github]
  • CAN: Creative Adversarial Networks Generating “Art” by Learning About Styles and Deviating from Style Norms, [paper]
  • FACE AGING WITH CONDITIONAL GENERATIVE ADVERSARIAL NETWORKS, [paper]

Human Pose Estimation

  • Joint Discriminative and Generative Learning for Person Re-identification, [paper], [github], [video]
  • Pose Guided Person Image Generation, [paper]

Domain-transfer (e.g. style-transfer, pix2pix, sketch2image)

  • Image-to-Image Translation with Conditional Adversarial Networks, [paper], [github], [youtube]
  • Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks, [paper], [github], [youtube]
  • Learning to Discover Cross-Domain Relations with Generative Adversarial Networks, [paper], [github]
  • Unsupervised Creation of Parameterized Avatars, [paper]
  • UNSUPERVISED CROSS-DOMAIN IMAGE GENERATION, [paper]
  • Precomputed Real-Time Texture Synthesis with Markovian Generative Adversarial Networks, [paper], [github]
  • Pixel-Level Domain Transfer [paper], [github]
  • TextureGAN: Controlling Deep Image Synthesis with Texture Patches, [paper], [demo]
  • Vincent AI Sketch Demo Draws In Throngs at GTC Europe, [blog], [youtube]
  • Deep Photo Style Transfer, [paper], [github]

Image Inpainting (hole filling)

  • Context Encoders: Feature Learning by Inpainting, [paper], [github]
  • Semantic Image Inpainting with Perceptual and Contextual Losses, [paper], [github]
  • SEMI-SUPERVISED LEARNING WITH CONTEXT-CONDITIONAL GENERATIVE ADVERSARIAL NETWORKS, [paper]
  • Generative Face Completion, [paper], [github]

Super-resolution

  • Image super-resolution through deep learning, [github]
  • Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network, [paper], [github]
  • High-Quality Face Image Super-Resolution Using Conditional Generative Adversarial Networks, [paper]
  • Analyzing Perception-Distortion Tradeoff using Enhanced Perceptual Super-resolution Network, [paper], [github]

Image Blending

  • GP-GAN: Towards Realistic High-Resolution Image Blending, [paper], [github]

High-resolution image generation (large-scale image)

  • Generating Large Images from Latent Vectors, [blog], [github]
  • PROGRESSIVE GROWING OF GANS FOR IMPROVED QUALITY, STABILITY, AND VARIATION, [paper], [github]

Adversarial Examples (Defense vs Attack)

  • SafetyNet: Detecting and Rejecting Adversarial Examples Robustly, [paper]
  • ADVERSARIAL EXAMPLES FOR GENERATIVE MODELS, [paper]
  • Adversarial Examples Generation and Defense Based on Generative Adversarial Network, [paper]

Visual Saliency Prediction (attention prediction)

  • SalGAN: Visual Saliency Prediction with Generative Adversarial Networks, [paper], [github]

Object Detection/Recognition

  • Perceptual Generative Adversarial Networks for Small Object Detection, [paper]
  • Adversarial Generation of Training Examples for Vehicle License Plate Recognition, [paper]

Robotics

  • Unsupervised Pixel–Level Domain Adaptation with Generative Adversarial Networks, [paper], [github]

Video (generation/prediction)

  • DEEP MULTI-SCALE VIDEO PREDICTION BEYOND MEAN SQUARE ERROR, [paper], [github]

Synthetic Data Generation

  • Learning from Simulated and Unsupervised Images through Adversarial Training, [paper], [github]

Others

  • (Physics) Learning Particle Physics by Example: Location-Aware Generative Adversarial Networks for Physics Synthesis, [paper], [github]
  • (Games) STYLE TRANSFER GENERATIVE ADVERSARIAL NETWORKS: LEARNING TO PLAY CHESS DIFFERENTLY, [paper], [github]
  • (General) Spectral Normalization for Generative Adversarial Networks, [paper], [github]

Did not use GAN, but still interesting applications.

Real-time face reconstruction

  • Model-based Deep Convolutional Face Autoencoder for Unsupervised Monocular Reconstruction, [paper], [github], [youtube]

Super-resolution

Photorealistic Image generation (e.g. pix2pix, sketch2image)

Human Pose Estimation

  • Knowledge-Guided Deep Fractal Neural Networks for Human Pose Estimation, [paper], [github]

3D Object generation

  • 3D-R2N2: A Unified Approach for Single and Multi-view 3D Object Reconstruction, [paper], [github]

GAN tutorials with easy and simple example code for starters


Implementations of various types of GANs collection


Trendy AI-application Articles

Author

Minchul Shin, @nashory

Any recommendations to add to the list are welcome :)
Feel free to make pull requests!

Owner
Minchul Shin
Deep Learning, Computer Vision | Research Scientist at kakaobrain (2021-present) | ex-SWE at NAVER (2017-2021)
Minchul Shin
AI grand challenge 2020 Repo (Speech Recognition Track)

KorBERT를 활용한 한국어 텍스트 기반 위협 상황인지(2020 인공지능 그랜드 챌린지) 본 프로젝트는 ETRI에서 제공된 한국어 korBERT 모델을 활용하여 폭력 기반 한국어 텍스트를 분류하는 다양한 분류 모델들을 제공합니다. 본 개발자들이 참여한 2020 인공지

Young-Seok Choi 23 Jan 25, 2022
CAPITAL: Optimal Subgroup Identification via Constrained Policy Tree Search

CAPITAL: Optimal Subgroup Identification via Constrained Policy Tree Search This repository is the official implementation of CAPITAL: Optimal Subgrou

Hengrui Cai 0 Oct 19, 2021
An Agnostic Computer Vision Framework - Pluggable to any Training Library: Fastai, Pytorch-Lightning with more to come

IceVision is the first agnostic computer vision framework to offer a curated collection with hundreds of high-quality pre-trained models from torchvision, MMLabs, and soon Pytorch Image Models. It or

airctic 789 Dec 29, 2022
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more

Alpha Zero General (any game, any framework!) A simplified, highly flexible, commented and (hopefully) easy to understand implementation of self-play

Surag Nair 3.1k Jan 05, 2023
Scalable Optical Flow-based Image Montaging and Alignment

SOFIMA SOFIMA (Scalable Optical Flow-based Image Montaging and Alignment) is a tool for stitching, aligning and warping large 2d, 3d and 4d microscopy

Google Research 16 Dec 21, 2022
RM Operation can equivalently convert ResNet to VGG, which is better for pruning; and can help RepVGG perform better when the depth is large.

RM Operation can equivalently convert ResNet to VGG, which is better for pruning; and can help RepVGG perform better when the depth is large.

184 Jan 04, 2023
This application is the basic of automated online-class-joiner(for YıldızEdu) within the right time. Gets the ZOOM link by scheduled date and time.

This application is the basic of automated online-class-joiner(for YıldızEdu) within the right time. Gets the ZOOM link by scheduled date and time.

215355 1 Dec 16, 2021
Composable transformations of Python+NumPy programsComposable transformations of Python+NumPy programs

Chex Chex is a library of utilities for helping to write reliable JAX code. This includes utils to help: Instrument your code (e.g. assertions) Debug

DeepMind 506 Jan 08, 2023
The pyrelational package offers a flexible workflow to enable active learning with as little change to the models and datasets as possible

pyrelational is a python active learning library developed by Relation Therapeutics for rapidly implementing active learning pipelines from data management, model development (and Bayesian approximat

Relation Therapeutics 95 Dec 27, 2022
Unsupervised Attributed Multiplex Network Embedding (AAAI 2020)

Unsupervised Attributed Multiplex Network Embedding (DMGI) Overview Nodes in a multiplex network are connected by multiple types of relations. However

Chanyoung Park 114 Dec 06, 2022
Repo for 2021 SDD assessment task 2, by Felix, Anna, and James.

SoftwareTask2 Repo for 2021 SDD assessment task 2, by Felix, Anna, and James. File/folder structure: helloworld.py - demonstrates various map backgrou

3 Dec 13, 2022
Bling's Object detection tool

BriVL for Building Applications This repo is used for illustrating how to build applications by using BriVL model. This repo is re-implemented from fo

chuhaojin 47 Nov 01, 2022
A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation

Segnet is deep fully convolutional neural network architecture for semantic pixel-wise segmentation. This is implementation of http://arxiv.org/pdf/15

Pradyumna Reddy Chinthala 190 Dec 15, 2022
performing moving objects segmentation using image processing techniques with opencv and numpy

Moving Objects Segmentation On this project I tried to perform moving objects segmentation using background subtraction technique. the introduced meth

Mohamed Magdy 15 Dec 12, 2022
HNN: Human (Hollywood) Neural Network

HNN: Human (Hollywood) Neural Network Learn the top 1000 actors on IMDB with your very own low cost, highly parallel, CUDAless biological neural netwo

Madhava Jay 0 Dec 21, 2021
Official implementation of Long-Short Transformer in PyTorch.

Long-Short Transformer (Transformer-LS) This repository hosts the code and models for the paper: Long-Short Transformer: Efficient Transformers for La

NVIDIA Corporation 198 Dec 29, 2022
A few stylization coreML models that I've trained with CreateML

CoreML-StyleTransfer A few stylization coreML models that I've trained with CreateML You can open and use the .mlmodel files in the "models" folder in

Doron Adler 8 Aug 18, 2022
Robocop is your personal mini voice assistant made using Python.

Robocop-VoiceAssistant To use this project, you should have python installed in your system. If you don't have python installed, install it beforehand

Sohil Khanduja 3 Feb 26, 2022
Trading Gym is an open source project for the development of reinforcement learning algorithms in the context of trading.

Trading Gym Trading Gym is an open-source project for the development of reinforcement learning algorithms in the context of trading. It is currently

Dimitry Foures 535 Nov 15, 2022
Keras-1D-NN-Classifier

Keras-1D-NN-Classifier This code is based on the reference codes linked below. reference 1, reference 2 This code is for 1-D array data classification

Jae-Hoon Shim 6 May 18, 2021