Implementation of H-UCRL Algorithm

This repository is an implementation of the H-UCRL algorithm introduced in Curi, S., Berkenkamp, F., & Krause, A. (2020). Efficient Model-Based Reinforcement Learning through Optimistic Policy Search and Planning.

To install create a conda environment:

$ conda create -n hucrl python=3.7
$ conda activate hucrl

$ pip install -e .[test,logging,experiments]

For Mujoco (license required) Run:

$ pip install -e .[mujoco]

Running an experiment.

For the inverted pendulum experiment run

$ python exps/inverted_pendulum/run.py

For the mujoco (license required) experiment run

$ python exps/mujoco/run.py --environment ENV_NAME --agent AGENT_NAME --action

We support MBHalfCheetah-v0, MBPusher-v0, MBReacher-v0, MBAnt-v0, MBCartPole-v0, MBHopper-v0, MBInvertedDoublePendulum-v0, MBInvertedPendulum-v0, MBReacher-v0, MBReacher3D-v0, MBSwimmer-v0, MBWalker2d-v0

Citing H-UCRL

If you this repo for your research please use the following BibTeX entry:

@article{curi2020efficient,
  title={Efficient model-based reinforcement learning through optimistic policy search and planning},
  author={Curi, Sebastian and Berkenkamp, Felix and Krause, Andreas},
  journal={Advances in Neural Information Processing Systems},
  volume={33},
  year={2020}
}

Implementation of H-UCRL Algorithm

Related tags

Overview

Implementation of H-UCRL Algorithm

Running an experiment.

Citing H-UCRL

Owner

Sebastian Curi

Agent-based model simulator for air quality and pandemic risk assessment in architectural spaces

Tutorial for the PERFECTING FACTORY 5.0 WITH EDGE-POWERED AI workshop

Code Impementation for "Mold into a Graph: Efficient Bayesian Optimization over Mixed Spaces"

PyTorch implementation of federated learning framework based on the acceleration of global momentum

Algorithm to texture 3D reconstructions from multi-view stereo images

Pytorch implementation of MaskFlownet

Sequence-to-Sequence learning using PyTorch

Points2Surf: Learning Implicit Surfaces from Point Clouds (ECCV 2020 Spotlight)

Husein pet projects in here!

Image Fusion Transformer

PyTorch code accompanying the paper "Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning" (NeurIPS 2021).

Event sourced bank - A wide-and-shallow example using the Python event sourcing library

Finding Donors for CharityML

HW3 ― GAN, ACGAN and UDA

Tool for live presentations using manim

Code in conjunction with the publication 'Contrastive Representation Learning for Hand Shape Estimation'

(IEEE TIP 2021) Regularized Densely-connected Pyramid Network for Salient Instance Segmentation

This repository builds a basic vision transformer from scratch so that one beginner can understand the theory of vision transformer.

Rule Based Classification Project

Read and write layered TIFF ImageSourceData and ImageResources tags