On Effective Scheduling of Model-based Reinforcement Learning

Last update: Oct 07, 2022

Related tags

Deep Learning autombpo

Overview

On Effective Scheduling of Model-based Reinforcement Learning

Code to reproduce the experiments in On Effective Scheduling of Model-based Reinforcement Learning.

Requirements

To install requirements:

pip install -r requirements.txt

Mujoco license is required to run the experiments on the Mujoco environments.

Training

To train the hyper-controller of the paper, run this command:

python train.py --env=

The env_name can be selected from [hopper,ant,humanoid,hopperbullet,walker2dbullet,halfcheetahbullet]. For example: python train.py --env=hopper

The trained hyper-controller will be saved in saved-models/. The computing infrastructure used in our experiments and the around computation time to train the hyper-controller is provided in Appendix G.

Evaluation

After training, to evaluate the trained hyper-controller, run:

python eval.py --config=config.
   
     --model_path=saved-models

The env_name can be selected from [hopper,ant,humanoid,hopperbullet,walker2dbullet,halfcheetahbullet]. For example: python eval.py --config=config.hopper --model_path=saved-models

Notice this command can only be run after finishing training the hyper-controller on the corresponding environments.

Pre-trained Models

We provided our pre-trained hyper-controller in pre-trained-models/ to better reproduce the experiments. To evaluate the pre-trained models, run:

python eval.py --config=config.
   
     --model_path=pre-trained-models

The env_name can be selected from [hopper,ant,humanoid,hopperbullet,walker2dbullet,halfcheetahbullet]. For example: python eval.py --config=config.hopper --model_path=pre-trained-models

On Effective Scheduling of Model-based Reinforcement Learning

Related tags

Overview

On Effective Scheduling of Model-based Reinforcement Learning

Requirements

Training

Evaluation

Pre-trained Models

Owner

laihang

The Easy-to-use Dialogue Response Selection Toolkit for Researchers

A simple root calculater for python

Ready-to-use code and tutorial notebooks to boost your way into few-shot image classification.

This repository contains tutorials for the py4DSTEM Python package

[TIP 2021] SADRNet: Self-Aligned Dual Face Regression Networks for Robust 3D Dense Face Alignment and Reconstruction

MapReader: A computer vision pipeline for the semantic exploration of maps at scale

Synthesizing and manipulating 2048x1024 images with conditional GANs

Reference implementation for Structured Prediction with Deep Value Networks

Distributed Arcface Training in Pytorch

Reference implementation of code generation projects from Facebook AI Research. General toolkit to apply machine learning to code, from dataset creation to model training and evaluation. Comes with pretrained models.

Data Engineering ZoomCamp

Generate pixel-style avatars with python.

Algorithmic trading with deep learning experiments

dataset for ECCV 2020 "Motion Capture from Internet Videos"

A semantic segmentation toolbox based on PyTorch

Google-drive-to-sqlite - Create a SQLite database containing metadata from Google Drive

[AAAI2021] The source code for our paper 《Enhancing Unsupervised Video Representation Learning by Decoupling the Scene and the Motion》.

DualGAN-tensorflow: tensorflow implementation of DualGAN

Non-Homogeneous Poisson Process Intensity Modeling and Estimation using Measure Transport

Deep Video Matting via Spatio-Temporal Alignment and Aggregation [CVPR2021]