This is an implementation of the proximal policy optimization algorithm for the C++ API of Pytorch

Last update: Dec 09, 2022

Related tags

Overview

PPO Pytorch C++

This is an implementation of the proximal policy optimization algorithm for the C++ API of Pytorch. It uses a simple TestEnvironment to test the algorithm. Below is a small visualization of the environment, the algorithm is tested in.

Fig. 1: The agent in testing mode.

Build

You first need to install PyTorch. For a clean installation from Anaconda, checkout this short tutorial, or this tutorial, to only install the binaries.

mkdir build
cd build
cmake -DCMAKE_PREFIX_PATH=/absolut/path/to/libtorch ..
make

Run

Run the executable with

cd build
./train_ppo

It should produce something like shown below.

Fig. 2: From left to right, the agent for successive epochs in training mode as it takes actions in the environment to reach the goal.

The algorithm can also be used in test mode, once trained. Therefore, run

cd build
./test_ppo

Visualization

The results are saved to data/data.csv and can be visualized by running python plot.py.

Owner

Martin Huber

Hi :), I'm Martin.

GitHub Repository

Course files for "Ocean/Atmosphere Time Series Analysis"

time-series This package contains all necessary files for the course Ocean/Atmosphere Time Series Analysis, an introduction to data and time series an

107 Nov 29, 2022

机器学习检测webshell

ai-webshell-detect 机器学习检测webshell,利用textcnn+简单二分类网络,基于keras,花了七天检测原理: 从文件熵文件长度文件语句提取出特征,然后文件熵与长度送入二分类网络,文件语句送入textcnn 项目原理,介绍,怎么做出来的

56 Dec 14, 2022

Arquivos do curso online sobre a estatística voltada para ciência de dados e aprendizado de máquina.

Estatistica para Ciência de Dados e Machine Learning Arquivos do curso online sobre a estatística voltada para ciência de dados e aprendizado de máqui

1 Jan 10, 2022

BioPy is a collection (in-progress) of biologically-inspired algorithms written in Python

BioPy is a collection (in-progress) of biologically-inspired algorithms written in Python. Some of the algorithms included are mor

40 Aug 26, 2022

My capstone project for Udacity's Machine Learning Nanodegree

MLND-Capstone My capstone project for Udacity's Machine Learning Nanodegree Lane Detection with Deep Learning In this project, I use a deep learning-b

407 Dec 12, 2022

pymc-learn: Practical Probabilistic Machine Learning in Python

pymc-learn: Practical Probabilistic Machine Learning in Python Contents: Github repo What is pymc-learn? Quick Install Quick Start Index What is pymc-

196 Dec 07, 2022

A simple python program that draws a tree for incrementing values using the Collatz Conjecture.

Collatz Conjecture A simple python program that draws a tree for incrementing values using the Collatz Conjecture. Values which can be edited: Length

1 Oct 28, 2021

AutoX是一个高效的自动化机器学习工具，它主要针对于表格类型的数据挖掘竞赛。它的特点包括: 效果出色、简单易用、通用、自动化、灵活。

English | 简体中文 AutoX是什么？ AutoX一个高效的自动化机器学习工具，它主要针对于表格类型的数据挖掘竞赛。它的特点包括: 效果出色: AutoX在多个kaggle数据集上，效果显著优于其他解决方案(见效果对比)。简单易用: AutoX的接口和sklearn类似，方便上手使用。

431 Dec 28, 2022

A simple guide to MLOps through ZenML and its various integrations.

ZenBytes Join our Slack Community and become part of the ZenML family Give the main ZenML repo a GitHub star to show your love ZenBytes is a series of

127 Dec 27, 2022

It is a forest of random projection trees

rpforest rpforest is a Python library for approximate nearest neighbours search: finding points in a high-dimensional space that are close to a given

211 Dec 29, 2022

Apache Liminal is an end-to-end platform for data engineers & scientists, allowing them to build, train and deploy machine learning models in a robust and agile way

Apache Liminals goal is to operationalise the machine learning process, allowing data scientists to quickly transition from a successful experiment to an automated pipeline of model training, validat

121 Dec 28, 2022

A machine learning toolkit dedicated to time-series data

tslearn The machine learning toolkit for time series analysis in Python Section Description Installation Installing the dependencies and tslearn Getti

2.3k Dec 29, 2022

DistML is a Ray extension library to support large-scale distributed ML training on heterogeneous multi-node multi-GPU clusters

27 Aug 19, 2022

Python package for concise, transparent, and accurate predictive modeling

Python package for concise, transparent, and accurate predictive modeling. All sklearn-compatible and easy to use. 📚 docs • 📖 demo notebooks Modern

983 Jan 01, 2023

Software Engineer Salary Prediction

Based on 2021 stack overflow data, this machine learning web application helps one predict the salary based on years of experience, level of education and the country they work in.

1 Jan 08, 2022

Lightweight Machine Learning Experiment Logging 📖

Simple logging of statistics, model checkpoints, plots and other objects for your Machine Learning Experiments (MLE). Furthermore, the MLELogger comes with smooth multi-seed result aggregation and co

65 Dec 08, 2022

High performance, easy-to-use, and scalable machine learning (ML) package, including linear model (LR), factorization machines (FM), and field-aware factorization machines (FFM) for Python and CLI interface.

What is xLearn? xLearn is a high performance, easy-to-use, and scalable machine learning package that contains linear model (LR), factorization machin

3k Jan 08, 2023

This is an implementation of the proximal policy optimization algorithm for the C++ API of Pytorch

Related tags

Overview

PPO Pytorch C++

Build

Run

Visualization

Owner

Martin Huber

Course files for "Ocean/Atmosphere Time Series Analysis"

机器学习检测webshell

Arquivos do curso online sobre a estatística voltada para ciência de dados e aprendizado de máquina.

BioPy is a collection (in-progress) of biologically-inspired algorithms written in Python

My capstone project for Udacity's Machine Learning Nanodegree

pymc-learn: Practical Probabilistic Machine Learning in Python

A simple python program that draws a tree for incrementing values using the Collatz Conjecture.

AutoX是一个高效的自动化机器学习工具，它主要针对于表格类型的数据挖掘竞赛。它的特点包括: 效果出色、简单易用、通用、自动化、灵活。

A simple guide to MLOps through ZenML and its various integrations.

It is a forest of random projection trees

Apache Liminal is an end-to-end platform for data engineers & scientists, allowing them to build, train and deploy machine learning models in a robust and agile way

A machine learning toolkit dedicated to time-series data

DistML is a Ray extension library to support large-scale distributed ML training on heterogeneous multi-node multi-GPU clusters

Python package for concise, transparent, and accurate predictive modeling

Software Engineer Salary Prediction

Lightweight Machine Learning Experiment Logging 📖

High performance, easy-to-use, and scalable machine learning (ML) package, including linear model (LR), factorization machines (FM), and field-aware factorization machines (FFM) for Python and CLI interface.

Tutorial for Decision Threshold In Machine Learning.

neurodsp is a collection of approaches for applying digital signal processing to neural time series

Dive into Machine Learning

This is an implementation of the proximal policy optimization algorithm for the C++ API of Pytorch

Related tags

Overview

PPO Pytorch C++

Build

Run

Visualization

Owner

Martin Huber

Course files for "Ocean/Atmosphere Time Series Analysis"

机器学习检测webshell

Arquivos do curso online sobre a estatística voltada para ciência de dados e aprendizado de máquina.

BioPy is a collection (in-progress) of biologically-inspired algorithms written in Python

My capstone project for Udacity's Machine Learning Nanodegree

pymc-learn: Practical Probabilistic Machine Learning in Python

A simple python program that draws a tree for incrementing values using the Collatz Conjecture.

AutoX是一个高效的自动化机器学习工具，它主要针对于表格类型的数据挖掘竞赛。 它的特点包括: 效果出色、简单易用、通用、自动化、灵活。

A simple guide to MLOps through ZenML and its various integrations.

It is a forest of random projection trees

Apache Liminal is an end-to-end platform for data engineers & scientists, allowing them to build, train and deploy machine learning models in a robust and agile way

A machine learning toolkit dedicated to time-series data

DistML is a Ray extension library to support large-scale distributed ML training on heterogeneous multi-node multi-GPU clusters

Python package for concise, transparent, and accurate predictive modeling

Software Engineer Salary Prediction

Lightweight Machine Learning Experiment Logging 📖

High performance, easy-to-use, and scalable machine learning (ML) package, including linear model (LR), factorization machines (FM), and field-aware factorization machines (FFM) for Python and CLI interface.

Tutorial for Decision Threshold In Machine Learning.

neurodsp is a collection of approaches for applying digital signal processing to neural time series

Dive into Machine Learning

AutoX是一个高效的自动化机器学习工具，它主要针对于表格类型的数据挖掘竞赛。它的特点包括: 效果出色、简单易用、通用、自动化、灵活。