Curated list of awesome GAN applications and demo

Overview

banner

gans-awesome-applications

Curated list of awesome GAN applications and demonstrations.

Note: General GAN papers targeting simple image generation such as DCGAN, BEGAN etc. are not included in the list. I mainly care about applications.

The landmark papers that I respect.

  • Generative Adversarial Networks, [paper], [github]
  • Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks, [paper], [github]
  • Improved Techniques for Training GANs, [paper], [github]
  • BEGAN: Boundary Equilibrium Generative Adversarial Networks, [paper], [github]

Contents

Use this contents list or simply press command + F to search for a keyword


Applications using GANs

Font generation

  • Learning Chinese Character style with conditional GAN, [blog], [github]
  • Artistic Glyph Image Synthesis via One-Stage Few-Shot Learning, [paper], [github]
  • Attribute2Font: Creating Fonts You Want From Attributes, [paper], [github]

Anime character generation

  • Towards the Automatic Anime Characters Creation with Generative Adversarial Networks, [paper]
  • [Project] A simple PyTorch Implementation of Generative Adversarial Networks, focusing on anime face drawing, [github]
  • [Project] A simple, clean TensorFlow implementation of Generative Adversarial Networks with a focus on modeling illustrations, [github]
  • [Project] Keras-GAN-Animeface-Character, [github]
  • [Project] A DCGAN to generate anime faces using custom mined dataset, [github]

Interactive Image generation

  • Generative Visual Manipulation on the Natural Image Manifold, [paper], [github]
  • Neural Photo Editing with Introspective Adversarial Networks, [paper], [github]

Text2Image (text to image)

  • TAC-GAN – Text Conditioned Auxiliary Classifier Generative Adversarial Network, [paper], [github]
  • StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks, [paper], [github]
  • Generative Adversarial Text to Image Synthesis, [paper], [github], [github]
  • Learning What and Where to Draw, [paper], [github]

3D Object generation

  • Parametric 3D Exploration with Stacked Adversarial Networks, [github], [youtube]
  • Learning a Probabilistic Latent Space of Object Shapes via 3D Generative-Adversarial Modeling, [paper], [github], [youtube]
  • 3D Shape Induction from 2D Views of Multiple Objects, [paper]
  • Fully Convolutional Refined Auto-Encoding Generative Adversarial Networks for 3D Multi Object Scenes, [github], [blog]

Image Editing

  • Invertible Conditional GANs for image editing, [paper], [github]
  • Image De-raining Using a Conditional Generative Adversarial Network, [paper], [github]

Face Aging

  • Age Progression/Regression by Conditional Adversarial Autoencoder, [paper], [github]
  • CAN: Creative Adversarial Networks Generating “Art” by Learning About Styles and Deviating from Style Norms, [paper]
  • FACE AGING WITH CONDITIONAL GENERATIVE ADVERSARIAL NETWORKS, [paper]

Human Pose Estimation

  • Joint Discriminative and Generative Learning for Person Re-identification, [paper], [github], [video]
  • Pose Guided Person Image Generation, [paper]

Domain-transfer (e.g. style-transfer, pix2pix, sketch2image)

  • Image-to-Image Translation with Conditional Adversarial Networks, [paper], [github], [youtube]
  • Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks, [paper], [github], [youtube]
  • Learning to Discover Cross-Domain Relations with Generative Adversarial Networks, [paper], [github]
  • Unsupervised Creation of Parameterized Avatars, [paper]
  • UNSUPERVISED CROSS-DOMAIN IMAGE GENERATION, [paper]
  • Precomputed Real-Time Texture Synthesis with Markovian Generative Adversarial Networks, [paper], [github]
  • Pixel-Level Domain Transfer [paper], [github]
  • TextureGAN: Controlling Deep Image Synthesis with Texture Patches, [paper], [demo]
  • Vincent AI Sketch Demo Draws In Throngs at GTC Europe, [blog], [youtube]
  • Deep Photo Style Transfer, [paper], [github]

Image Inpainting (hole filling)

  • Context Encoders: Feature Learning by Inpainting, [paper], [github]
  • Semantic Image Inpainting with Perceptual and Contextual Losses, [paper], [github]
  • SEMI-SUPERVISED LEARNING WITH CONTEXT-CONDITIONAL GENERATIVE ADVERSARIAL NETWORKS, [paper]
  • Generative Face Completion, [paper], [github]

Super-resolution

  • Image super-resolution through deep learning, [github]
  • Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network, [paper], [github]
  • High-Quality Face Image Super-Resolution Using Conditional Generative Adversarial Networks, [paper]
  • Analyzing Perception-Distortion Tradeoff using Enhanced Perceptual Super-resolution Network, [paper], [github]

Image Blending

  • GP-GAN: Towards Realistic High-Resolution Image Blending, [paper], [github]

High-resolution image generation (large-scale image)

  • Generating Large Images from Latent Vectors, [blog], [github]
  • PROGRESSIVE GROWING OF GANS FOR IMPROVED QUALITY, STABILITY, AND VARIATION, [paper], [github]

Adversarial Examples (Defense vs Attack)

  • SafetyNet: Detecting and Rejecting Adversarial Examples Robustly, [paper]
  • ADVERSARIAL EXAMPLES FOR GENERATIVE MODELS, [paper]
  • Adversarial Examples Generation and Defense Based on Generative Adversarial Network, [paper]

Visual Saliency Prediction (attention prediction)

  • SalGAN: Visual Saliency Prediction with Generative Adversarial Networks, [paper], [github]

Object Detection/Recognition

  • Perceptual Generative Adversarial Networks for Small Object Detection, [paper]
  • Adversarial Generation of Training Examples for Vehicle License Plate Recognition, [paper]

Robotics

  • Unsupervised Pixel–Level Domain Adaptation with Generative Adversarial Networks, [paper], [github]

Video (generation/prediction)

  • DEEP MULTI-SCALE VIDEO PREDICTION BEYOND MEAN SQUARE ERROR, [paper], [github]

Synthetic Data Generation

  • Learning from Simulated and Unsupervised Images through Adversarial Training, [paper], [github]

Others

  • (Physics) Learning Particle Physics by Example: Location-Aware Generative Adversarial Networks for Physics Synthesis, [paper], [github]
  • (Games) STYLE TRANSFER GENERATIVE ADVERSARIAL NETWORKS: LEARNING TO PLAY CHESS DIFFERENTLY, [paper], [github]
  • (General) Spectral Normalization for Generative Adversarial Networks, [paper], [github]

Did not use GAN, but still interesting applications.

Real-time face reconstruction

  • Model-based Deep Convolutional Face Autoencoder for Unsupervised Monocular Reconstruction, [paper], [github], [youtube]

Super-resolution

Photorealistic Image generation (e.g. pix2pix, sketch2image)

Human Pose Estimation

  • Knowledge-Guided Deep Fractal Neural Networks for Human Pose Estimation, [paper], [github]

3D Object generation

  • 3D-R2N2: A Unified Approach for Single and Multi-view 3D Object Reconstruction, [paper], [github]

GAN tutorials with easy and simple example code for starters


Implementations of various types of GANs collection


Trendy AI-application Articles

Author

Minchul Shin, @nashory

Any recommendations to add to the list are welcome :)
Feel free to make pull requests!

Owner
Minchul Shin
Deep Learning, Computer Vision | Research Scientist at kakaobrain (2021-present) | ex-SWE at NAVER (2017-2021)
Minchul Shin
Implementation of Retrieval-Augmented Denoising Diffusion Probabilistic Models in Pytorch

Retrieval-Augmented Denoising Diffusion Probabilistic Models (wip) Implementation of Retrieval-Augmented Denoising Diffusion Probabilistic Models in P

Phil Wang 55 Jan 01, 2023
code for our ECCV 2020 paper "A Balanced and Uncertainty-aware Approach for Partial Domain Adaptation"

Code for our ECCV (2020) paper A Balanced and Uncertainty-aware Approach for Partial Domain Adaptation. Prerequisites: python == 3.6.8 pytorch ==1.1.0

32 Nov 27, 2022
Code repository for EMNLP 2021 paper 'Adversarial Attacks on Knowledge Graph Embeddings via Instance Attribution Methods'

Adversarial Attacks on Knowledge Graph Embeddings via Instance Attribution Methods This is the code repository to accompany the EMNLP 2021 paper on ad

Peru Bhardwaj 7 Sep 25, 2022
This repository gives an example on how to preprocess the data of the HECKTOR challenge

HECKTOR 2021 challenge This repository gives an example on how to preprocess the data of the HECKTOR challenge. Any other preprocessing is welcomed an

56 Dec 01, 2022
FastCover: A Self-Supervised Learning Framework for Multi-Hop Influence Maximization in Social Networks by Anonymous.

FastCover: A Self-Supervised Learning Framework for Multi-Hop Influence Maximization in Social Networks by Anonymous.

0 Apr 02, 2021
A set of tools for converting a darknet dataset to COCO format working with YOLOX

darknet格式数据→COCO darknet训练数据目录结构(详情参见dataset/darknet): darknet ├── class.names ├── gen_config.data ├── gen_train.txt ├── gen_valid.txt └── images

RapidAI-NG 148 Jan 03, 2023
YOLO-v5 기반 단안 카메라의 영상을 활용해 차간 거리를 일정하게 유지하며 주행하는 Adaptive Cruise Control 기능 구현

자율 주행차의 영상 기반 차간거리 유지 개발 Table of Contents 프로젝트 소개 주요 기능 시스템 구조 디렉토리 구조 결과 실행 방법 참조 팀원 프로젝트 소개 YOLO-v5 기반으로 단안 카메라의 영상을 활용해 차간 거리를 일정하게 유지하며 주행하는 Adap

14 Jun 29, 2022
REGTR: End-to-end Point Cloud Correspondences with Transformers

REGTR: End-to-end Point Cloud Correspondences with Transformers This repository contains the source code for REGTR. REGTR utilizes multiple transforme

Zi Jian Yew 108 Dec 17, 2022
Automatic Attendance marker for LMS Practice School Division, BITS Pilani

LMS Attendance Marker Automatic script for lazy people to mark attendance on LMS for Practice School 1. Setup Add your LMS credentials and time slot t

Nihar Bansal 3 Jun 12, 2021
🎃 Core identification module of AI powerful point reading system platform.

ppReader-Kernel Intro Core identification module of AI powerful point reading system platform. Usage 硬件: Windows10、GPU:nvdia GTX 1060 、普通RBG相机 软件: con

CrashKing 1 Jan 11, 2022
Supplementary code for the experiments described in the 2021 ISMIR submission: Leveraging Hierarchical Structures for Few Shot Musical Instrument Recognition.

Music Trees Supplementary code for the experiments described in the 2021 ISMIR submission: Leveraging Hierarchical Structures for Few Shot Musical Ins

Hugo Flores García 32 Nov 22, 2022
Python library for computer vision labeling tasks. The core functionality is to translate bounding box annotations between different formats-for example, from coco to yolo.

PyLabel pip install pylabel PyLabel is a Python package to help you prepare image datasets for computer vision models including PyTorch and YOLOv5. I

PyLabel Project 176 Jan 01, 2023
Official implementation of Pixel-Level Bijective Matching for Video Object Segmentation

BMVOS This is the official implementation of Pixel-Level Bijective Matching for Video Object Segmentation, to appear in WACV 2022. @article{cho2021pix

Suhwan Cho 13 Dec 14, 2022
Using deep learning model to detect breast cancer.

Breast-Cancer-Detection Breast cancer is the most frequent cancer among women, with around one in every 19 women at risk. The number of cases of breas

1 Feb 13, 2022
Simple machine learning library / 簡單易用的機器學習套件

FukuML Simple machine learning library / 簡單易用的機器學習套件 Installation $ pip install FukuML Tutorial Lesson 1: Perceptron Binary Classification Learning Al

Fukuball Lin 279 Sep 15, 2022
A more easy-to-use implementation of KPConv

A more easy-to-use implementation of KPConv This repo contains a more easy-to-use implementation of KPConv based on PyTorch. Introduction KPConv is a

Zheng Qin 35 Dec 14, 2022
Benchmark for evaluating open-ended generation

OpenMEVA Contributed by Jian Guan, Zhexin Zhang. Thank Jiaxin Wen for DeBugging. OpenMEVA is a benchmark for evaluating open-ended story generation me

25 Nov 15, 2022
ESL: Event-based Structured Light

ESL: Event-based Structured Light Video (click on the image) This is the code for the 2021 3DV paper ESL: Event-based Structured Light by Manasi Mugli

Robotics and Perception Group 29 Oct 24, 2022
Accommodating supervised learning algorithms for the historical prices of the world's favorite cryptocurrency and boosting it through LightGBM.

Accommodating supervised learning algorithms for the historical prices of the world's favorite cryptocurrency and boosting it through LightGBM.

1 Nov 27, 2021
Record radiologists' eye gaze when they are labeling images.

Record radiologists' eye gaze when they are labeling images. Read for installation, usage, and deep learning examples. Why use MicEye Versatile As a l

24 Nov 03, 2022