Official code repository for Continual Learning In Environments With Polynomial Mixing Times

Last update: Dec 19, 2021

Related tags

Overview

Official code for Continual Learning In Environments With Polynomial Mixing Times

Continual Learning in Environments with Polynomial Mixing Times

This repository provides official code base for the paper "Continual Learning in Environments with Polynomial Mixing Times"

Basic Setup

Clone this repository and then follow this command

cd polynomial-mixing-times

Create either use a python virtualenv or a conda environment and activate it.

pip install virtualenv
virtualenv -p /usr/bin/python3.7 mixing-times
source mixing-times/bin/activate

To install all the relevant packages use the following command:

pip install -e .

Running the experiments

We provide a running script with all relevant hyperparameters used for both baselines and our proposed model. One can run run_bottleneck.sh to run all the models.

To run the experiments of the proposed models on the Example 2 Bottleneck MDP class with 4 rooms, "random" task evolution and a random seed of 1, use the following command

bash run_bottleneck.sh 1 4 "random"

Available Models

Online Q learning
Q learning with Replay
Q learning w/ Dyna
Model based n-step TD
Vanilla Policy Gradient
Onpolicy rho learning
Off-policy rho learning
rho Policy Gradient

List of Environments

ScaleClass-v0
NBottleneckClass-v0
NCycleClass-v0

System requirements

We used python 3.7 version to run all our experiments.

Official code repository for Continual Learning In Environments With Polynomial Mixing Times

Related tags

Overview

Continual Learning in Environments with Polynomial Mixing Times

Basic Setup

Running the experiments

Available Models

List of Environments

System requirements

Owner

Sharath Raparthy

An implementation of the research paper "Retina Blood Vessel Segmentation Using A U-Net Based Convolutional Neural Network"

Compact Bidirectional Transformer for Image Captioning

Code for Referring Image Segmentation via Cross-Modal Progressive Comprehension, CVPR2020.

Symmetry and Uncertainty-Aware Object SLAM for 6DoF Object Pose Estimation

Object detection evaluation metrics using Python.

Problem-943.-ACMP - Problem 943. ACMP

Implementation of trRosetta and trDesign for Pytorch, made into a convenient package

A modular active learning framework for Python

Code for our ACL 2021 paper "One2Set: Generating Diverse Keyphrases as a Set"

Road Crack Detection Using Deep Learning Methods

Deep Learning (with PyTorch)

An end-to-end framework for mixed-integer optimization with data-driven learned constraints.

Model-based reinforcement learning in TensorFlow

A simple baseline for 3d human pose estimation in tensorflow. Presented at ICCV 17.

This repository introduces a short project about Transfer Learning for Classification of MRI Images.

Code for Motion Representations for Articulated Animation paper

Main repository for the HackBio'2021 Virtual Internship Experience for #Team-Greider ❤️

Intro-to-dl - Resources for "Introduction to Deep Learning" course.

Autoregressive Models in PyTorch.

Pre-trained BERT Models for Ancient and Medieval Greek, and associated code for LaTeCH 2021 paper titled - "A Pilot Study for BERT Language Modelling and Morphological Analysis for Ancient and Medieval Greek"