An implementation of "Learning human behaviors from motion capture by adversarial imitation"

Last update: Nov 12, 2022

Related tags

Overview

Merel-MoCap-GAIL

An implementation of Merel et al.'s paper on generative adversarial imitation learning (GAIL) using motion capture (MoCap) data:

Learning human behaviors from motion capture by adversarial imitation
Josh Merel, Yuval Tassa, Dhruva TB, Sriram Srinivasan, Jay Lemmon, Ziyu Wang, Greg Wayne, Nicolas Heess
arXiv preprint arXiv:1707.02201, 2017

Acknowledgements

This code is based on an earlier version developed by Ruben Villegas.

Clone the Repository

This repo contains one submodule (baselines), so make sure you clone with --recursive:

git clone --recursive https://github.com/ywchao/merel-mocap-gail.git

Installation

Make sure the following are installed.

Our own branch of baselines provided as a submodule
1. Change the directory:
```
cd baselines
```
2. Go through the installation steps in this README without re-cloning the repo.
An old verion of dm_control provided as a submodule
1. Change the directory:
```
cd dm_control
```
2. Go through the installation steps in this README without re-cloning the repo. This requires the installation of MuJoCo. Also make sure to install the cloned verion:
```
pip install .
```
Note that we have only tested on this version. The code might work with newer versions but it is not guaranteed.
Matplotlib

Training and Visualization

Download the CMU MoCap dataset:
```
./scripts/download_cmu_mocap.sh
```
This will populate the data folder with cmu_mocap.
Preprocess data. We use the walk sequences from subject 8 as described in the paper.
```
./scripts/data_collect.sh
```
The output will be saved in data/cmu_mocap.npz.
Visualize the processed MoCap sequences in dm_control:
```
./scripts/data_visualize.sh
```
The output will be saved in data/cmu_mocap_vis.
Start training:
```
./scripts/train.sh 0 1
```
Note that:
- The first argument sets the random seed, and the second argument sets the number of used sequences.
- For now we use only sequence 1. We will show using all sequences in later steps.
- The command will run training with random seed 0. In practice we recommend running multiple training jobs with different seeds in parallel, as the training outcome is often sensitive to the seed value.
The output will be saved in output.
Monitor training with TensorBoard:
```
tensorboard --logdir=output --port=6006
```
Below are the curves of episode length, rewards, and true rewards, obtained with four different random seeds:

Visualize trained humanoid:

./scripts/visualize.sh \
  output/trpo_gail.obs_only.transition_limitation_1.humanoid_CMU_run.g_step_3.d_step_1.policy_entcoeff_0.adversary_entcoeff_0.001.seed_0.num_timesteps_5.00e+07/checkpoints/model.ckpt-30000 \
  output/trpo_gail.obs_only.transition_limitation_1.humanoid_CMU_run.g_step_3.d_step_1.policy_entcoeff_0.adversary_entcoeff_0.001.seed_0.num_timesteps_5.00e+07/vis_model.ckpt-30000.mp4 \
  0 \
  1

The arguments are the model path, output video (mp4) file path, random seed, and number of used sequences.

Below is a sample visualization:

If you want to train with all sequences from subject 8. This can be done by replacing 1 by -1 in step 4:

./scripts/train.sh 0 -1

Similarly, for visualization, replace 1 by -1 and update the paths:

./scripts/visualize.sh \
  output/trpo_gail.obs_only.transition_limitation_-1.humanoid_CMU_run.g_step_3.d_step_1.policy_entcoeff_0.adversary_entcoeff_0.001.seed_0.num_timesteps_5.00e+07/checkpoints/model.ckpt-50000 \
  output/trpo_gail.obs_only.transition_limitation_-1.humanoid_CMU_run.g_step_3.d_step_1.policy_entcoeff_0.adversary_entcoeff_0.001.seed_0.num_timesteps_5.00e+07/vis_model.ckpt-50000.mp4 \
  0 \
  -1

Note that training takes longer to converge when using all sequences:

A sample visualization:

An implementation of "Learning human behaviors from motion capture by adversarial imitation"

Related tags

Overview

Merel-MoCap-GAIL

Acknowledgements

Clone the Repository

Installation

Training and Visualization

Owner

Yu-Wei Chao

Implementation for "Conditional entropy minimization principle for learning domain invariant representation features"

Official PyTorch implementation of PS-KD

Cross-modal Deep Face Normals with Deactivable Skip Connections

A bare-bones Python library for quality diversity optimization.

Moer Grounded Image Captioning by Distilling Image-Text Matching Model

Code for the paper "TadGAN: Time Series Anomaly Detection Using Generative Adversarial Networks"

Neuralnetwork - Basic Multilayer Perceptron Neural Network for deep learning

Public repository of the 3DV 2021 paper "Generative Zero-Shot Learning for Semantic Segmentation of 3D Point Clouds"

Official PyTorch Implementation of Rank & Sort Loss [ICCV2021]

Discovering Dynamic Salient Regions with Spatio-Temporal Graph Neural Networks

A Multi-attribute Controllable Generative Model for Histopathology Image Synthesis

Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

Pytorch implementation of Learning with Opponent-Learning Awareness

simple_pytorch_example project is a toy example of a python script that instantiates and trains a PyTorch neural network on the FashionMNIST dataset

Yolo algorithm for detection + centroid tracker to track vehicles

A minimal solution to hand motion capture from a single color camera at over 100fps. Easy to use, plug to run.

Convert openmmlab (not only mmdetection) series model to tensorrt

Using the provided dataset which includes various book features, in order to predict the price of books, using various proposed methods and models.

Learning Confidence for Out-of-Distribution Detection in Neural Networks

Global Filter Networks for Image Classification