Can We Find Neurons that Cause Unrealistic Images in Deep Generative Networks?

Last update: Dec 20, 2022

Overview

Can We Find Neurons that Cause Unrealistic Images in Deep Generative Networks?

Artifact Detection/Correction - Offcial PyTorch Implementation

This repo provides the official PyTorch implementation of the following paper:

Can We Find Neurons that Cause Unrealistic Images in Deep Generative Networks?
Hwanil Choi, Wonjoon Chang, Jaesik Choi*
Korea Advanced Institute of Science and Technology, KAIST

Abstract
Even though image generation with Generative Adversarial Networks (GANs) has been showing remarkable ability to generate high-quality images, GANs do not always guarantee photorealistic images will be generated. Sometimes they generate images that have defective or unnatural objects, which are referred to as 'artifacts'. Research to determine why the artifacts emerge and how they can be detected and removed has not been sufficiently carried out. To analyze this, we first hypothesize that rarely activated neurons and frequently activated neurons have different purposes and responsibilities for the progress of generating images. By analyzing the statistics and the roles for those neurons, we empirically show that rarely activated neurons are related to failed results of making diverse objects and lead to artifacts. In addition, we suggest a correction method, called 'sequential ablation', to repair the defective part of the generated images without complex computational cost and manual efforts.
https://arxiv.org/abs/1812.04948

Dependencies

PyTorch 1.4.0
python 3.6
cuda 10.0.x
cudnn 7.6.3

Pre-Trained Models (Official) - GenForce

Dataset \ Model	PGGAN	StyleGAN2
CelebA-HQ (Official)	1024 x 1024	X
FFHQ (Official)	X	1024 X 1024
LSUN-Church (Official)	256 x 256	256 x 256
LSUN-CAT (Official)	256 x 256	256 x 256

For following implementation, download StyleGAN2 FFHQ weights in current directory. Otherwise, you should change the '--weight_path' options to your directory.

More pre-trained weights are available in genforce-model-zoo

optional : StyleGAN3

Implementation

Options

optional arguments:
  -h, --help                show this help message and exit
  --gpu GPU                 gpu index numper
  --batch_size BATCH_SIZE
                            batch size for pre processing and generating process
  --sample_size SAMPLE_SIZE
                            sample size for statistics
  --freq_path FREQ_PATH
                            loading saved frequencies of neurons
  --model MODEL             pggan, styelgan2
  --dataset DATASET         ffhq, cat, church, etc
  --resolution RESOLUTION
                            dataset resolution
  --weight_path WEIGHT_PATH
                            pre-trained weight path
  --detection DETECTION
                            implement normal/artifact detection
  --correction CORRECTION
                            implement correction task

Usage

python main.py --gpu 0 --batch_size 30 --sample_size 30000 --freq_pth ./stats \
               --model stylegan2 --dataset ffhq --resolution 1024 --weight_path ./ \
               --detection True --correction True

If you are on remote server, then to show the results, you should do the following. (X11 forwarding).

X11 forwarding

You can also implement our codes in 'Jupyter Notebook' that has more degree of freedom. Use the 'notebook.ipynb' file.

Can We Find Neurons that Cause Unrealistic Images in Deep Generative Networks?

Related tags

Overview

Can We Find Neurons that Cause Unrealistic Images in Deep Generative Networks?

Artifact Detection/Correction - Offcial PyTorch Implementation

Dependencies

Pre-Trained Models (Official) - GenForce

Implementation

Detection results for 50K samples

Bottom 60 images

Top 60 images

Correction results

Owner

CHOI HWAN IL

Python Computer Vision Aim Bot for Roblox's Phantom Forces

A Joint Video and Image Encoder for End-to-End Retrieval

Official code for "Bridging Video-text Retrieval with Multiple Choice Questions", CVPR 2022 (Oral).

Awesome Spectral Indices in Python.

Using computer vision method to recognize and calcutate the features of the architecture.

Rubik's Cube in pygame with OpenGL

(CVPR 2021) ST3D: Self-training for Unsupervised Domain Adaptation on 3D Object Detection

Deep learning based page layout analysis

📷 This repository is focused on having various feature implementation of OpenCV in Python.

In this project we will be using the live feed coming from the webcam to create a virtual mouse with complete functionalities.

With the virtual keyboard, you can write on the real time images by combining the thumb and index fingers on the letter you want.

This is the open source implementation of the ICLR2022 paper "StyleNeRF: A Style-based 3D-Aware Generator for High-resolution Image Synthesis"

Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.

Roboflow makes managing, preprocessing, augmenting, and versioning datasets for computer vision seamless.

Virtual Zoom Gesture using OpenCV

Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation

Reference Code for AAAI-20 paper "Multi-Stage Self-Supervised Learning for Graph Convolutional Networks on Graphs with Few Labels"

Let's explore how we can extract text from forms

Image processing in Python

Text modding tools for FF7R (Final Fantasy VII Remake)