A collection of Reinforcement Learning algorithms from Sutton and Barto's book and other research papers implemented in Python.

Last update: Dec 28, 2022

Overview

Reinforcement-Learning-Notebooks

A collection of Reinforcement Learning algorithms from Sutton and Barto's book and other research papers implemented in Python.

I wrote these notebooks in March 2017 while I took the COMP 767: Reinforcement Learning [5] class by Prof. Doina Precup at McGill, Montréal. I highly recommend you to go through the class notes and references of all the papers the intructors have posted on the website.

These notebooks should be used while you read the book and go beyond the same with the referenced papers. I would suggest watching David Silver's videos and reading the book simultaneously. And when you are done with a few chapters, start implementing them. The algorithms follow a pattern and mostly are variants of each other. I have tried my best to explain each notebook's results and possible future directions.

Disclaimer: The code is a little messy. I'd written this when I was not a Pythonista. If you would like to clean them up and want to make it into a nice interface, feel free to contact me. I will be very pleased to collaborate. If you use them then please cite the source and also mention the credits as listed below. Also, email me with ways to improve, let me know if you find any bugs.

Feel free to reach me at [email protected] or see my website here

Special Credits:

[1] Denny Britz

[2] Monica Patel

[3] Sutton and Barto

[4] David Silver

[5] Doina Precup's course

A collection of Reinforcement Learning algorithms from Sutton and Barto's book and other research papers implemented in Python.

Related tags

Overview

Reinforcement-Learning-Notebooks

A collection of Reinforcement Learning algorithms from Sutton and Barto's book and other research papers implemented in Python.

Owner

Pulkit Khandelwal

Official Repo for Ground-aware Monocular 3D Object Detection for Autonomous Driving

An official PyTorch Implementation of Boundary-aware Self-supervised Learning for Video Scene Segmentation (BaSSL)

Net2net - Network-to-Network Translation with Conditional Invertible Neural Networks

Prototype-based Incremental Few-Shot Semantic Segmentation

Code for Understanding Pooling in Graph Neural Networks

Code to produce syntactic representations that can be used to study syntax processing in the human brain

thundernet ncnn

Build tensorflow keras model pipelines in a single line of code. Created by Ram Seshadri. Collaborators welcome. Permission granted upon request.

Convnext-tf - Unofficial tensorflow keras implementation of ConvNeXt

Fairness Metrics: All you need to know

Python Interview Questions

pixelNeRF: Neural Radiance Fields from One or Few Images

Code for "Continuous-Time Meta-Learning with Forward Mode Differentiation" (ICLR 2022)

Data-Driven Operational Space Control for Adaptive and Robust Robot Manipulation

This is the code for HOI Transformer

Creating multimodal multitask models

Pytorch code for "Text-Independent Speaker Verification Using 3D Convolutional Neural Networks".

Official code for "EagerMOT: 3D Multi-Object Tracking via Sensor Fusion" [ICRA 2021]

PyTorch implementation of spectral graph ConvNets, NIPS’16

Code repository of the paper Neural circuit policies enabling auditable autonomy published in Nature Machine Intelligence