Reinforcement Learning Tricks, Index

This repository contains the code for the paper "Distilling Reinforcement Learning Tricks for Video Games".

Short story shorter: RL algorithms are neat and all, but to get it to work in video games (RL competitions and whatnot), there are some nifty little tricks involved that need bit of expertise in the domain. This includes reward shaping, curriculum learning, splitting task into subtasks by hand and guiding agent's actions. We took some of these tricks and tried them on three environments with DQN. With right setup you get more out of DQN.

Code authors: Anssi Kanervisto, Christian Scheller and Yanick Schraner.

The experiments in the three environments are split into three git branches:

vizdoom for ViZDoom Deathmatch experiments
minerl for MineRL ObtainDiamond experiments
gfootball for Football environment experiments

To run the experiments, checkout the repository you want to run experiments for with git checkout [branch name], and follow the instructions in the README file there.

After running all the experiments, collect the results as described the respective branches. You should have three directories

vizdoom-runs
minerl-runs
football-runs

After this, running python plot_paper.py should create a figures/learning_curves.pdf file which summarizes the results.

Evaluating different engineering tricks that make RL work

Related tags

Overview

Reinforcement Learning Tricks, Index

Owner

Anssi

Efficient Training of Audio Transformers with Patchout

CARLA: A Python Library to Benchmark Algorithmic Recourse and Counterfactual Explanation Algorithms

The official implementation of Variable-Length Piano Infilling (VLI).

[MedIA2021]MIDeepSeg: Minimally Interactive Segmentation of Unseen Objects from Medical Images Using Deep Learning

A machine learning malware analysis framework for Android apps.

Python package for multiple object tracking research with focus on laboratory animals tracking.

Image-Adaptive YOLO for Object Detection in Adverse Weather Conditions

HNN: Human (Hollywood) Neural Network

git《Self-Attention Attribution: Interpreting Information Interactions Inside Transformer》(AAAI 2021) GitHub:

3D Pose Estimation for Vehicles

Data augmentation for NLP, accepted at EMNLP 2021 Findings

ColBERT: Contextualized Late Interaction over BERT (SIGIR'20)

A set of tools for converting a darknet dataset to COCO format working with YOLOX

Self-Supervised Document-to-Document Similarity Ranking via Contextualized Language Models and Hierarchical Inference

[TIP 2020] Multi-Temporal Scene Classification and Scene Change Detection with Correlation based Fusion

Deep deconfounded recommender (Deep-Deconf) for paper "Deep causal reasoning for recommendations"

PyTorch implementation of our ICCV paper DeFRCN: Decoupled Faster R-CNN for Few-Shot Object Detection.

The FIRST GANs-based omics-to-omics translation framework

Age Progression/Regression by Conditional Adversarial Autoencoder

You Only Look One-level Feature (YOLOF), CVPR2021, Detectron2