This is a clean and robust Pytorch implementation of DQN and Double DQN.

Last update: Dec 27, 2022

Related tags

Deep Learning DQN-DDQN-Pytorch

Overview

DQN/DDQN-Pytorch

This is a clean and robust Pytorch implementation of DQN and Double DQN. Here is the training curve:

All the experiments are trained with same hyperparameters.

A quick render here:

Dependencies

gym==0.18.3
numpy==1.21.2
pytorch==1.8.1

How to use my code

Train from scratch

run 'python main.py', where the default enviroment is CartPole-v1.

Play with trained model

run 'python main.py --write False --render True --Loadmodel True --ModelIdex 50000'

Change Enviroment

If you want to train on different enviroments, just run 'python main.py --EnvIdex 1'.
The --EnvIdex can be set to be 0 and 1, where
'--EnvIdex 0' for 'CartPole-v1'
'--EnvIdex 1' for 'LunarLander-v2'

Visualize the training curve

You can use the tensorboard to visualize the training curve. History training curve is saved at '\runs'

Hyperparameter Setting

For more details of Hyperparameter Setting, please check 'main.py'

References

DQN: Mnih V , Kavukcuoglu K , Silver D , et al. Playing Atari with Deep Reinforcement Learning[J]. Computer Science, 2013.

Double DQN: Hasselt H V , Guez A , Silver D . Deep Reinforcement Learning with Double Q-learning[J]. Computer ence, 2015.

This is a clean and robust Pytorch implementation of DQN and Double DQN.

Related tags

Overview

DQN/DDQN-Pytorch

Dependencies

How to use my code

Train from scratch

Play with trained model

Change Enviroment

Visualize the training curve

Hyperparameter Setting

References

Other RL algorithms by Pytorch can be found here.

Owner

XinJingHao

A modular active learning framework for Python

CPF: Learning a Contact Potential Field to Model the Hand-object Interaction

Prototypical Pseudo Label Denoising and Target Structure Learning for Domain Adaptive Semantic Segmentation (CVPR 2021)

Yolox-bytetrack-sample - Python sample of MOT (Multiple Object Tracking) using YOLOX and ByteTrack

A simple and extensible library to create Bayesian Neural Network layers on PyTorch.

Real time Human Detection Counting

🚀 An end-to-end ML applications using PyTorch, W&B, FastAPI, Docker, Streamlit and Heroku

Voxel-based Network for Shape Completion by Leveraging Edge Generation (ICCV 2021, oral)

A style-based Quantum Generative Adversarial Network

学习 python3 以来写的一些垃圾玩具……

FedScale: Benchmarking Model and System Performance of Federated Learning

ISTR: End-to-End Instance Segmentation with Transformers (https://arxiv.org/abs/2105.00637)

Code to reproduce the results in "Visually Grounded Reasoning across Languages and Cultures", EMNLP 2021.

git git《Transformer Meets Tracker: Exploiting Temporal Context for Robust Visual Tracking》(CVPR 2021) GitHub:git2] 《Masksembles for Uncertainty Estimation》(CVPR 2021) GitHub:git3]

Rank1 Conversation Emotion Detection Task

A PaddlePaddle version of Neural Renderer, refer to its PyTorch version

Coursera - Quiz & Assignment of Coursera

Repo 4 basic seminar §How to make human machine readable"

Retinal vessel segmentation based on GT-UNet

Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method (NeurIPS 2021)