PPO Lagrangian in JAX

Last update: Sep 14, 2022

Related tags

Deep Learning jax-ppo

Overview

PPO Lagrangian in JAX

This repository implements PPO in JAX. Implementation is tested on the safety-gym benchmark.

Usage

Install dependencies using the following-

pip install -r requirements.txt

Install safety-gym (after installing mujoco-py) using the following-

git clone https://github.com/openai/safety-gym.git
cd safety-gym
pip install -e .

Train the PPO agent using the following-

python train.py --env=Safexp-CarGoal1-v0

Results will be stored in the logs folder. To create a plot run the following-

python plot.py

Citation

In case you find the code helpful then please cite the following-

@misc{ppolag,
  author = {Suri, Karush},
  title = {{PPO Lagrangian in JAX.}},
  url = {https://github.com/karush17/jax-ppo},
  year = {2021}
}

Owner

Karush Suri

Deep Learning Researcher at Huawei Noah's Ark Lab, Toronto.

GitHub Repository

Parris, the automated infrastructure setup tool for machine learning algorithms.

README Parris, the automated infrastructure setup tool for machine learning algorithms. What Is This Tool? Parris is a tool for automating the trainin

319 Aug 02, 2022

Forecasting for knowable future events using Bayesian informative priors (forecasting with judgmental-adjustment).

What is judgyprophet? judgyprophet is a Bayesian forecasting algorithm based on Prophet, that enables forecasting while using information known by the

56 Oct 26, 2022

Asymmetric metric learning for knowledge transfer

Asymmetric metric learning This is the official code that enables the reproduction of the results from our paper: Asymmetric metric learning for knowl

20 Dec 06, 2022

This repository focus on Image Captioning & Video Captioning & Seq-to-Seq Learning & NLP

Awesome-Visual-Captioning Table of Contents ACL-2021 CVPR-2021 AAAI-2021 ACMMM-2020 NeurIPS-2020 ECCV-2020 CVPR-2020 ACL-2020 AAAI-2020 ACL-2019 NeurI

362 Jan 03, 2023

[ICCV 2021] Official PyTorch implementation for Deep Relational Metric Learning.

Ranking Models in Unlabeled New Environments Prerequisites This code uses the following libraries Python 3.7 NumPy PyTorch 1.7.0 + torchivision 0.8.1

39 Dec 10, 2022

Minimalistic PyTorch training loop

Backbone for PyTorch training loop Will try to keep it minimalistic. pip install back from back import Bone Features Progress bar Checkpoints saving/l

4 Jan 16, 2020

On the adaptation of recurrent neural networks for system identification

On the adaptation of recurrent neural networks for system identification This repository contains the Python code to reproduce the results of the pape

3 Jan 13, 2022

Event sourced bank - A wide-and-shallow example using the Python event sourcing library

Event Sourced Bank A "wide but shallow" example of using the Python event sourci

3 Mar 09, 2022

PyTorch code for EMNLP 2021 paper: Don't be Contradicted with Anything! CI-ToD: Towards Benchmarking Consistency for Task-oriented Dialogue System

Don’t be Contradicted with Anything!CI-ToD: Towards Benchmarking Consistency for Task-oriented Dialogue System This repository contains the PyTorch im

25 Sep 06, 2022

Coded illumination for improved lensless imaging

CodedCam Coded Illumination for Improved Lensless Imaging Paper | Supplementary results | Data and Code are available. Coded illumination for improved

1 Nov 29, 2021

Annotated, understandable, and visually interpretable PyTorch implementations of: VAE, BIRVAE, NSGAN, MMGAN, WGAN, WGANGP, LSGAN, DRAGAN, BEGAN, RaGAN, InfoGAN, fGAN, FisherGAN

Overview PyTorch 0.4.1 | Python 3.6.5 Annotated implementations with comparative introductions for minimax, non-saturating, wasserstein, wasserstein g

471 Dec 16, 2022

PPO Lagrangian in JAX

Related tags

Overview

PPO Lagrangian in JAX

Usage

Citation

Owner

Karush Suri

Parris, the automated infrastructure setup tool for machine learning algorithms.

Forecasting for knowable future events using Bayesian informative priors (forecasting with judgmental-adjustment).

Asymmetric metric learning for knowledge transfer

This repository focus on Image Captioning & Video Captioning & Seq-to-Seq Learning & NLP

[ICCV 2021] Official PyTorch implementation for Deep Relational Metric Learning.

Minimalistic PyTorch training loop

On the adaptation of recurrent neural networks for system identification

Event sourced bank - A wide-and-shallow example using the Python event sourcing library

PyTorch code for EMNLP 2021 paper: Don't be Contradicted with Anything! CI-ToD: Towards Benchmarking Consistency for Task-oriented Dialogue System

Coded illumination for improved lensless imaging

Annotated, understandable, and visually interpretable PyTorch implementations of: VAE, BIRVAE, NSGAN, MMGAN, WGAN, WGANGP, LSGAN, DRAGAN, BEGAN, RaGAN, InfoGAN, fGAN, FisherGAN

Unoffical reMarkable AddOn for Firefox.

DeepOBS: A Deep Learning Optimizer Benchmark Suite

Implementation of U-Net and SegNet for building segmentation

An expansion for RDKit to read all types of files in one line

Code for the paper "Adapting Monolingual Models: Data can be Scarce when Language Similarity is High"

Anderson Acceleration for Deep Learning

Pytorch Implementation of Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic

On the Limits of Pseudo Ground Truth in Visual Camera Re-Localization

A PyTorch Implementation of "Watch Your Step: Learning Node Embeddings via Graph Attention" (NeurIPS 2018).