Tensorflow implementation of Human-Level Control through Deep Reinforcement Learning

Last update: Dec 26, 2022

Related tags

Deep Learning DQN-tensorflow

Overview

Human-Level Control through Deep Reinforcement Learning

Tensorflow implementation of Human-Level Control through Deep Reinforcement Learning.

This implementation contains:

Deep Q-network and Q-learning
Experience replay memory
- to reduce the correlations between consecutive updates
Network for Q-learning targets are fixed for intervals
- to reduce the correlations between target and predicted Q-values

Requirements

Python 2.7 or Python 3.3+
gym
tqdm
SciPy or OpenCV2
TensorFlow 0.12.0

Usage

First, install prerequisites with:

$ pip install tqdm gym[all]

To train a model for Breakout:

$ python main.py --env_name=Breakout-v0 --is_train=True
$ python main.py --env_name=Breakout-v0 --is_train=True --display=True

To test and record the screen with gym:

$ python main.py --is_train=False
$ python main.py --is_train=False --display=True

Results

Result of training for 24 hours using GTX 980 ti.

Simple Results

Details of Breakout with model m2(red) for 30 hours using GTX 980 Ti.

Details of Breakout with model m3(red) for 30 hours using GTX 980 Ti.

Detailed Results

[1] Action-repeat (frame-skip) of 1, 2, and 4 without learning rate decay

[2] Action-repeat (frame-skip) of 1, 2, and 4 with learning rate decay

[1] & [2]

[3] Action-repeat of 4 for DQN (dark blue) Dueling DQN (dark green) DDQN (brown) Dueling DDQN (turquoise)

The current hyper parameters and gradient clipping are not implemented as it is in the paper.

[4] Distributed action-repeat (frame-skip) of 1 without learning rate decay

[5] Distributed action-repeat (frame-skip) of 4 without learning rate decay

References

License

MIT License.

Tensorflow implementation of Human-Level Control through Deep Reinforcement Learning

Related tags

Overview

Human-Level Control through Deep Reinforcement Learning

Requirements

Usage

Results

Simple Results

Detailed Results

References

License

Owner

Devsisters Corp.

PyTorch implementation of Spiking Neural Networks trained on surrogate gradient & BPTT using snntorch.

A collection of educational notebooks on multi-view geometry and computer vision.

Generalized Data Weighting via Class-level Gradient Manipulation

Improving Object Detection by Label Assignment Distillation

Discriminative Condition-Aware PLDA

[NeurIPS 2020] This project provides a strong single-stage baseline for Long-Tailed Classification, Detection, and Instance Segmentation (LVIS).

Block Sparse movement pruning

Deep Reinforcement Learning for mobile robot navigation in ROS Gazebo simulator

STARCH compuets regional extreme storm physical characteristics and moisture balance based on spatiotemporal precipitation data from reanalysis or climate model data.

Optimizing DR with hard negatives and achieving SOTA first-stage retrieval performance on TREC DL Track (SIGIR 2021 Full Paper).

Implementation of CaiT models in TensorFlow and ImageNet-1k checkpoints. Includes code for inference and fine-tuning.

Keras implementation of "One pixel attack for fooling deep neural networks" using differential evolution on Cifar10 and ImageNet

Implementation of Memformer, a Memory-augmented Transformer, in Pytorch

In this work, we will implement some basic but important algorithm of machine learning step by step.

Efficient Speech Processing Tookit for Automatic Speaker Recognition

Video Frame Interpolation with Transformer (CVPR2022)

In the case of your data having only 1 channel while want to use timm models

Domain Generalization with MixStyle, ICLR'21.

A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation

Official implementation of EfficientPose