Code for "MetaMorph: Learning Universal Controllers with Transformers", Gupta et al, ICLR 2022

Last update: Jan 03, 2023

Related tags

Overview

MetaMorph: Learning Universal Controllers with Transformers

This is the code for the paper

MetaMorph: Learning Universal Controllers with Transformers
Agrim Gupta, Linxi Fan, Surya Ganguli, Fei-Fei Li

Multiple domains like vision, natural language, and audio are witnessing tremendous progress by leveraging Transformers for large scale pre-training followed by task specific fine tuning. In contrast, in robotics we primarily train a single robot for a single task. However, modular robot systems now allow for the flexible combination of general-purpose building blocks into task optimized morphologies. However, given the exponentially large number of possible robot morphologies, training a controller for each new design is impractical. In this work, we propose MetaMorph, a Transformer based approach to learn a universal controller over a modular robot design space. MetaMorph is based on the insight that robot morphology is just another modality on which we can condition the output of a Transformer. Through extensive experiments we demonstrate that large scale pre-training on a variety of robot morphologies results in policies with combinatorial generalization capabilities, including zero shot generalization to unseen robot morphologies. We further demonstrate that our pre-trained policy can be used for sample-efficient transfer to completely new robot morphologies and tasks.

Code Structure

The code consists of two main components:

Metamorph: Code for joint pre-training of different robots.
Environments and evaluation tasks: Three pre-training environments and two evaluation environments.

Benchmark

We also provide Unimal-100 benchmark. The benchmark consists of 100 train morphologies, 1600 morphologies with dynamics variations, 800 morphologies with kinematics variations, and 100 test morphologies.

# Install gdown
pip install gdown
# Download data
gdown 1LyKYTCevnqWrDle1LTBMlBF58RmCjSzM
# Unzip
unzip unimals_100.zip

Setup

We provide Dockerfile for easy installation and development. If you prefer to work without docker please take a look at Dockerfile and ensure that your local system has all the necessary dependencies installed.

Training

# Build docker container. Ensure that MuJoCo license is present: docker/mjkey.txt
./scripts/build_docker.sh
# Joint pre-training. Please change MOUNT_DIR location inside run_docker_gpu.sh
# Finally ensure that ENV.WALKER_DIR points to benchmark files and is accessible
# from docker.
./scripts/run_docker_gpu.sh python tools/train_ppo.py --cfg ./configs/ft.yaml

The default parameters assume that you are running the code on a machine with atlesat 1 GPU.

Citation

If you find this code useful, please consider citing:

@inproceedings{
    gupta2022metamorph,
    title={MetaMorph: Learning Universal Controllers with Transformers},
    author={Agrim Gupta and Linxi Fan and Surya Ganguli and Li Fei-Fei},
    booktitle={International Conference on Learning Representations},
    year={2022},
    url={https://openreview.net/forum?id=Opmqtk_GvYL}
}

Credit

This codebase would not have been possible without the following amazing open source codebases:

Code for "MetaMorph: Learning Universal Controllers with Transformers", Gupta et al, ICLR 2022

Related tags

Overview

MetaMorph: Learning Universal Controllers with Transformers

Code Structure

Benchmark

Setup

Training

Citation

Credit

Owner

Agrim Gupta

A new play-and-plug method of controlling an existing generative model with conditioning attributes and their compositions.

BigbrotherBENL - Face recognition on the Big Brother episodes in Belgium and the Netherlands.

[EMNLP 2020] Keep CALM and Explore: Language Models for Action Generation in Text-based Games

A framework for attentive explainable deep learning on tabular data

A supplementary code for Editable Neural Networks, an ICLR 2020 submission.

This repository is the official implementation of the Hybrid Self-Attention NEAT algorithm.

Efficient training of deep recommenders on cloud.

Sample Code for "Pessimism Meets Invariance: Provably Efficient Offline Mean-Field Multi-Agent RL"

Code for the paper A Theoretical Analysis of the Repetition Problem in Text Generation

PyTorch implementation of SimCLR: A Simple Framework for Contrastive Learning of Visual Representations

Code for WSDM 2022 paper, Contrastive Learning for Representation Degeneration Problem in Sequential Recommendation.

We utilize deep reinforcement learning to obtain favorable trajectories for visual-inertial system calibration.

Code for our SIGCOMM'21 paper "Network Planning with Deep Reinforcement Learning".

Train CPPNs as a Generative Model, using Generative Adversarial Networks and Variational Autoencoder techniques to produce high resolution images.

Memory-Augmented Model Predictive Control

Latent Execution for Neural Program Synthesis

This is an open-source toolkit for Heterogeneous Graph Neural Network(OpenHGNN) based on DGL [Deep Graph Library] and PyTorch.

Pretrained models for Jax/Flax: StyleGAN2, GPT2, VGG, ResNet.

Simple ray intersection library similar to coldet - succedeed by libacc

The official PyTorch code for NeurIPS 2021 ML4AD Paper, "Does Thermal data make the detection systems more reliable?"