Deeprl - Standard DQN and dueling network for simple games

Last update: Apr 12, 2020

Overview

DeepRL

This code implements the standard deep Q-learning and dueling network with experience replay (memory buffer) for playing simple games.

DQN algorithm implemented in this code is from the Google DeepMind's paper Playing Atari with Deep Reinforcement Learning[link].

Dueling network is from the paper Dueling Network Architectures for Deep Reinforcement Learning [link]

Requirement

DeepRL is implemented with Torch and the packages of its ecosystem. This code is well worked on my Mac Pro with CPU (I haven't tested it on Linux and GPU). Install Torch7 firstly, then you should install the following packages by luarocks

luarocks install nn
luarocks install image
luarocks install qt
luarocks install optim

Running

You can run this code by tapping the command in the project dir.

qlua main.lua

The result looks like

DQN: I got the accuracy of 93.2% (932 success of 1000 epochs).

Dueling: I got the accuracy of 99.2% (992 success of 1000 epochs).

Code

The envir.lua indicates the environment in reinforcement learning stage, which receives the action and produces the states and a reward for agent.

The agent.lua is the implementation of agent which receives the states and reward to produce the action directed by the policy network.

The learner.lua is the learning algorithm of DQN with experience replay as the following.

MISC

I completed this code when I was an intern at Horizon Robotics. I will greatly thank the article of Andrej Karpathy and other implementations:SeanNaren's code and EderSantana's gist.

LICENSE

MIT

Deeprl - Standard DQN and dueling network for simple games

Related tags

Overview

DeepRL

Requirement

Running

Code

MISC

LICENSE

Owner

Yao Zhou

Key information extraction from invoice document with Graph Convolution Network

Kaggle competition: Springleaf Marketing Response

Official code for "Towards An End-to-End Framework for Flow-Guided Video Inpainting" (CVPR2022)

A custom-designed Spider Robot trained to walk using Deep RL in a PyBullet Simulation

PyTorch implementation of "Transparency by Design: Closing the Gap Between Performance and Interpretability in Visual Reasoning"

Pytorch implementation for DFN: Distributed Feedback Network for Single-Image Deraining.

Uncertain natural language inference

Machine Learning Privacy Meter: A tool to quantify the privacy risks of machine learning models with respect to inference attacks, notably membership inference attacks

Demonstrates iterative FGSM on Apple's NeuralHash model.

SGPT: Multi-billion parameter models for semantic search

Code for "Adversarial Attack Generation Empowered by Min-Max Optimization", NeurIPS 2021

Plato: A New Framework for Federated Learning Research

This is the repo for the paper "Improving the Accuracy-Memory Trade-Off of Random Forests Via Leaf-Refinement".

Training DALL-E with volunteers from all over the Internet using hivemind and dalle-pytorch (NeurIPS 2021 demo)

Zsseg.baseline - Zero-Shot Semantic Segmentation

VR-Caps: A Virtual Environment for Active Capsule Endoscopy

(AAAI2020)Grapy-ML: Graph Pyramid Mutual Learning for Cross-dataset Human Parsing

PyTorch implementation of the paper Dynamic Token Normalization Improves Vision Transfromers.

Source code for the ACL-IJCNLP 2021 paper entitled "T-DNA: Taming Pre-trained Language Models with N-gram Representations for Low-Resource Domain Adaptation" by Shizhe Diao et al.

This code is for eCaReNet: explainable Cancer Relapse Prediction Network.