Deep Reinforcement Learning Agents

This repository contains a collection of reinforcement learning algorithms written in Tensorflow. The ipython notebook here were written to go along with a still-underway tutorial series I have been publishing on Medium. If you are new to reinforcement learning, I recommend reading the accompanying post for each algorithm.

The repository currently contains the following algorithms:

Q-Table - An implementation of Q-learning using tables to solve a stochastic environment problem.
Q-Network - A neural network implementation of Q-Learning to solve the same environment as in Q-Table.
Simple-Policy - An implementation of policy gradient method for stateless environments such as n-armed bandit problems.
Contextual-Policy - An implementation of policy gradient method for stateful environments such as contextual bandit problems.
Policy-Network - An implementation of a neural network policy-gradient agent that solves full RL problems with states and delayed rewards, and two opposite actions (ie. CartPole or Pong).
Vanilla-Policy - An implementation of a neural network vanilla-policy-gradient agent that solves full RL problems with states, delayed rewards, and an arbitrary number of actions.
Model-Network - An addition to the Policy-Network algorithm which includes a separate network which models the environment dynamics.
Double-Dueling-DQN - An implementation of a Deep-Q Network with the Double DQN and Dueling DQN additions to improve stability and performance.
Deep-Recurrent-Q-Network - An implementation of a Deep Recurrent Q-Network which can solve reinforcement learning problems involving partial observability.
Q-Exploration - An implementation of DQN containing multiple action-selection strategies for exploration. Strategies include: greedy, random, e-greedy, Boltzmann, and Bayesian Dropout.
A3C-Doom - An implementation of Asynchronous Advantage Actor-Critic (A3C) algorithm. It utilizes multiple agents to collectively improve a policy. This implementation can solve RL problems in 3D environments such as VizDoom challenges.

A set of Deep Reinforcement Learning Agents implemented in Tensorflow.

Related tags

Overview

Deep Reinforcement Learning Agents

Owner

Arthur Juliani

Transformers provides thousands of pretrained models to perform tasks on different modalities such as text, vision, and audio.

Official implementation of Long-Short Transformer in PyTorch.

Implementation for our AAAI2021 paper (Entity Structure Within and Throughout: Modeling Mention Dependencies for Document-Level Relation Extraction).

Codes and pretrained weights for winning submission of 2021 Brain Tumor Segmentation (BraTS) Challenge

Exploration-Exploitation Dilemma Solving Methods

clustering moroccan stocks time series data using k-means with dtw (dynamic time warping)

Explore the Expression: Facial Expression Generation using Auxiliary Classifier Generative Adversarial Network

Code release for "BoxeR: Box-Attention for 2D and 3D Transformers"

Code for the Paper "Diffusion Models for Handwriting Generation"

Lab course materials for IEMBA 8/9 course "Coding and Artificial Intelligence"

Hyperparameters tuning and features selection are two common steps in every machine learning pipeline.

For auto aligning, cropping, and scaling HR and LR images for training image based neural networks

K-Nearest Neighbor in Pytorch

abess: Fast Best-Subset Selection in Python and R

Repository for the electrical and ICT benchmark model developed in the ERIGrid 2.0 project.

Example repository for custom C++/CUDA operators for TorchScript

the official code for ICRA 2021 Paper: "Multimodal Scale Consistency and Awareness for Monocular Self-Supervised Depth Estimation"

🗣️ Microsoft Edge TTS for Home Assistant, no need for app_key

Code for the paper "Learning-Augmented Algorithms for Online Steiner Tree"

FinGAT: A Financial Graph Attention Networkto Recommend Top-K Profitable Stocks