TensorFlow implementation of "Learning from Simulated and Unsupervised Images through Adversarial Training"

Last update: Dec 29, 2022

Overview

Simulated+Unsupervised (S+U) Learning in TensorFlow

TensorFlow implementation of Learning from Simulated and Unsupervised Images through Adversarial Training.

Requirements

Python 2.7
TensorFlow 0.12.1
SciPy
pillow
tqdm

Usage

To generate synthetic dataset:

Run UnityEyes with changing resolution to 640x480 and Camera parameters to [0, 0, 20, 40].
Move generated images and json files into data/gaze/UnityEyes.

The data directory should looks like:

data
├── gaze
│   ├── MPIIGaze
│   │   └── Data
│   │       └── Normalized
│   │           ├── p00
│   │           ├── p01
│   │           └── ...
│   └── UnityEyes # contains images of UnityEyes
│       ├── 1.jpg
│       ├── 1.json
│       ├── 2.jpg
│       ├── 2.json
│       └── ...
├── __init__.py
├── gaze_data.py
├── hand_data.py
└── utils.py

To train a model (samples will be generated in samples directory):

$ python main.py
$ tensorboard --logdir=logs --host=0.0.0.0

To refine all synthetic images with a pretrained model:

$ python main.py --is_train=False --synthetic_image_dir="./data/gaze/UnityEyes/"

Training results

Differences with the paper

Used Adam and Stochatstic Gradient Descent optimizer.
Only used 83K (14% of 1.2M used by the paper) synthetic images from UnityEyes.
Manually choose hyperparameters for B and lambda because those are not specified in the paper.

Experiments #1

For these synthetic images,

Result of lambda=1.0 with optimizer=sgd after 8,000 steps.

$ python main.py --reg_scale=1.0 --optimizer=sgd

Result of lambda=0.5 with optimizer=sgd after 8,000 steps.

$ python main.py --reg_scale=0.5 --optimizer=sgd

Training loss of discriminator and refiner when lambda is 1.0 (green) and 0.5 (yellow).

Experiments #2

For these synthetic images,

Result of lambda=1.0 with optimizer=adam after 4,000 steps.

$ python main.py --reg_scale=1.0 --optimizer=adam

Result of lambda=0.5 with optimizer=adam after 4,000 steps.

$ python main.py --reg_scale=0.5 --optimizer=adam

Result of lambda=0.1 with optimizer=adam after 4,000 steps.

$ python main.py --reg_scale=0.1 --optimizer=adam

Training loss of discriminator and refiner when lambda is 1.0 (blue), 0.5 (purple) and 0.1 (green).

Author

Taehoon Kim / @carpedm20

TensorFlow implementation of "Learning from Simulated and Unsupervised Images through Adversarial Training"

Related tags

Overview

Simulated+Unsupervised (S+U) Learning in TensorFlow

Requirements

Usage

Training results

Differences with the paper

Experiments #1

Experiments #2

Author

Owner

Taehoon Kim

This repo is a C++ version of yolov5_deepsort_tensorrt. Packing all C++ programs into .so files, using Python script to call C++ programs further.

LocUNet is a deep learning method to localize a UE based solely on the reported signal strengths from a set of BSs.

Little tool in python to watch anime from the terminal (the better way to watch anime)

Open-Domain Question-Answering for COVID-19 and Other Emergent Domains

A set of simple scripts to process the Imagenet-1K dataset as TFRecords and make index files for NVIDIA DALI.

A Simple and Versatile Framework for Object Detection and Instance Recognition

Source code for the paper "SEPP: Similarity Estimation of Predicted Probabilities for Defending and Detecting Adversarial Text" PACLIC 2021

Pytorch implementation of the AAAI 2022 paper "Cross-Domain Empirical Risk Minimization for Unbiased Long-tailed Classification"

A PyTorch Image-Classification With AlexNet And ResNet50.

Official Pytorch implementation of Scene Representation Networks: Continuous 3D-Structure-Aware Neural Scene Representations

Implementation of SiameseXML (ICML 2021)

Data loaders and abstractions for text and NLP

A diff tool for language models

NEO: Non Equilibrium Sampling on the orbit of a deterministic transform

UnFlow: Unsupervised Learning of Optical Flow with a Bidirectional Census Loss

Stacs-ci - A set of modules to enable integration of STACS with commonly used CI / CD systems

Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method (NeurIPS 2021)

Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition

[TIP 2020] Multi-Temporal Scene Classification and Scene Change Detection with Correlation based Fusion

Official Pytorch implementation for video neural representation (NeRV)