An example project demonstrating how the Autonomous Learning Library can be used to build new reinforcement learning agents.

Last update: Aug 30, 2022

Related tags

Deep Learning all-example-project

Overview

About

This repository shows how Autonomous Learning Library can be used to build new reinforcement learning agents. In particular, it contains a model based agent that predicts future frames and uses them to guide decision making.

Instructions

First, you'll need the latest version of Pytorch. If you wish to view Tensorboard logs, you'll also need to grab a copy of that (it also comes with tensorflow). Then, you'll need to install the autonomous-learning-library along with the Atari environments:

pip install autonomous-learning-library[atari]

Unfortunately, the current IP holders for the Atari library made it more difficult to acquire a license and use the ROMs than it used to be. If you have a license to use the ROMs, you can try AutoROM.

Usage

You can run the agent as well as a baseline DQN agent using:

python main.py Pong

You can track progress using:

tensorboard --logdir runs

Once the script has finished (could take a long time, especially if you do not have a fast GPU!), you can see the final results using:

python plot.py

Results

For us, the above instructions produced the following results:

As you can see, this agent isn't very good! On the other hand, the purpose of this agent was not performance, but to demonstrate the utility of the autonomous-learning-library in developing new agents not included in the original library. Maybe you can come up with ways of improving this agent!

An example project demonstrating how the Autonomous Learning Library can be used to build new reinforcement learning agents.

Related tags

Overview

About

Instructions

Usage

Results

Owner

Chris Nota

Gym Threat Defense

The NEOSSat is a dual-mission microsatellite designed to detect potentially hazardous Earth-orbit-crossing asteroids and track objects that reside in deep space

A simple log parser and summariser for IIS web server logs

Data Preparation, Processing, and Visualization for MoVi Data

Reinforcement Learning via Supervised Learning

PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis

This project generates news headlines using a Long Short-Term Memory (LSTM) neural network.

Deep Structured Instance Graph for Distilling Object Detectors (ICCV 2021)

A Simple Key-Value Data-store written in Python

This is an official implementation for the WTW Dataset in "Parsing Table Structures in the Wild " on table detection and table structure recognition.

Object DGCNN and DETR3D, Our implementations are built on top of MMdetection3D.

Code for our SIGCOMM'21 paper "Network Planning with Deep Reinforcement Learning".

DSTC10 Track 2 - Knowledge-grounded Task-oriented Dialogue Modeling on Spoken Conversations

Self-Supervised Monocular 3D Face Reconstruction by Occlusion-Aware Multi-view Geometry Consistency[ECCV 2020]

Multi-Task Temporal Shift Attention Networks for On-Device Contactless Vitals Measurement (NeurIPS 2020)

Project page for our ICCV 2021 paper "The Way to my Heart is through Contrastive Learning"

Directed Greybox Fuzzing with AFL

Tzer: TVM Implementation of "Coverage-Guided Tensor Compiler Fuzzing with Joint IR-Pass Mutation (OOPSLA'22)“.

PIXIE: Collaborative Regression of Expressive Bodies

Voice control for Garry's Mod