PyTorch implementation of Munchausen Reinforcement Learning based on DQN and SAC. Handles discrete and continuous action spaces

Last update: Mar 10, 2022

Overview

Exploring Munchausen Reinforcement Learning

This is the project repository of my team in the "Advanced Deep Learning for Robotics" course at TUM. Our project's topic is "Exploring Munchausen Reinforcement Learning" based on this paper.

For a detailed discussion, see the report and the final presentation.

Setup

Create a virtual environment.
Run pip3 install -r requirements.txt

Code Structure

This repository is structured as follows:

The directories M-DQN and M-SAC contain the implementations of the RL agents DQN and SAC extended with the Munchausen term, respectively.
The directories rl-baselines3-zoo contains a copy of this repository, where we included the implementations of M-DQN so that we can easily train and test the M-DQN agent on benchmark environments and also compare it to other classical agents. To do so, just follow the steps described in the original repository and insert M-DQN as the agent argument.
The directory particles-envcontains a modified version of this repository. The modified version contains code for a particles environment, where an agent wants to reach a goal, while avoiding obstacles. Besides, M-SAC agent is implemented and included in the code, so that it can be trained and compared to the classical SAC agent.
The directory action-gap contains implementation of callbacks for experiment manager of rl-baselines3-zoo which logs action-gap for tensorboard.

PyTorch implementation of Munchausen Reinforcement Learning based on DQN and SAC. Handles discrete and continuous action spaces

Related tags

Overview

Exploring Munchausen Reinforcement Learning

Setup

Code Structure

Owner

Mohamed Amine Ketata

CN24 is a complete semantic segmentation framework using fully convolutional networks

Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)

Implementation of Restricted Boltzmann Machine (RBM) and its variants in Tensorflow

GarmentNets: Category-Level Pose Estimation for Garments via Canonical Space Shape Completion

Structured Data Gradient Pruning (SDGP)

Multi-query Video Retreival

A Closer Look at Reference Learning for Fourier Phase Retrieval

Cosine Annealing With Warmup

Structure Information is the Key: Self-Attention RoI Feature Extractor in 3D Object Detection

Temporal-Relational CrossTransformers

RLMeta is a light-weight flexible framework for Distributed Reinforcement Learning Research.

This is code of book "Learn Deep Learning with PyTorch"

CLDF dataset derived from Robbeets et al.'s "Triangulation Supports Agricultural Spread" from 2021

X-modaler is a versatile and high-performance codebase for cross-modal analytics.

DPT: Deformable Patch-based Transformer for Visual Recognition (ACM MM2021)

🕺Full body detection and tracking

Official pytorch implementation of DeformSyncNet: Deformation Transfer via Synchronized Shape Deformation Spaces

Code to reproduce the experiments in the paper "Transformer Based Multi-Source Domain Adaptation" (EMNLP 2020)

Moon-patrol - A faithful recreation of the 1983 hit classic Moon Patrol for the Atari 2600 created using the Pygame library for Python

RealFormer-Pytorch Implementation of RealFormer using pytorch