Flexible Option Learning - NeurIPS 2021

Last update: Nov 09, 2022

Related tags

Overview

Flexible Option Learning

This repository contains code for the paper Flexible Option Learning presented as a Spotlight at NeurIPS 2021. The implementation is based on gym-miniworld, OpenAI's baselines and the Option-Critic's tabular implementation.

Contents:

FourRooms Experiments
Continuous Control Experiments
Visual Navigation Experiments
Citation

Tabular Experiments (Four-Rooms)

Installation and Launch code

pip install gym==0.12.1
cd diagnostic_experiments/
python main_fixpol.py --multi_option # for experiments with fixed options
python main.py --multi_option # for experiments with learned options

Continuous Control (MuJoCo)

Installation

virtualenv moc_cc --python=python3
source moc_cc/bin/activate
pip install tensorflow==1.12.0 
cd continuous_control
pip install -e . 
pip install gym==0.9.3
pip install mujoco-py==0.5.1

Launch

cd baselines/ppoc_int
python run_mujoco.py --switch --nointfc --env AntWalls --eta 0.9 --mainlr 8e-5 --intlr 8e-5 --piolr 8e-5

Maze Navigation (MiniWorld)

Installation

virtualenv moc_vision --python=python3
source moc_vision/bin/activate
pip install tensorflow==1.13.1
cd vision_miniworld
pip install -e .
pip install gym==0.15.4

Launch

cd baselines/
# Run agent in first task
python run.py --alg=ppo2_options --env=MiniWorld-WallGap-v0 --num_timesteps 2500000 --save_interval 1000  --num_env 8 --noptions 4 --eta 0.7

# Load and run agent in transfer task
python run.py --alg=ppo2_options --env=MiniWorld-WallGapTransfer-v0 --load_path path/to/model --num_timesteps 2500000 --save_interval 1000  --num_env 8 --noptions 4 --eta 0.7

Cite

If you find this work useful to you, please consider adding you to your references.

@inproceedings{
klissarov2021flexible,
title={Flexible Option Learning},
author={Martin Klissarov and Doina Precup},
booktitle={Thirty-Fifth Conference on Neural Information Processing Systems},
year={2021},
url={https://openreview.net/forum?id=L5vbEVIePyb}
}

Flexible Option Learning - NeurIPS 2021

Related tags

Overview

Flexible Option Learning

Tabular Experiments (Four-Rooms)

Installation and Launch code

Continuous Control (MuJoCo)

Installation

Launch

Maze Navigation (MiniWorld)

Installation

Launch

Cite

Owner

Martin Klissarov

Image based Human Fall Detection

Codes and models for the paper "Learning Unknown from Correlations: Graph Neural Network for Inter-novel-protein Interaction Prediction".

Using pretrained GROVER to extract the atomic fingerprints from molecule

Diffusion Probabilistic Models for 3D Point Cloud Generation (CVPR 2021)

Implementation of EMNLP 2017 Paper "Natural Language Does Not Emerge 'Naturally' in Multi-Agent Dialog" using PyTorch and ParlAI

Interactive dimensionality reduction for large datasets

AI-Bot - 一个基于watermelon改造的OpenAI-GPT-2的智能机器人

[CVPR 2022 Oral] TubeDETR: Spatio-Temporal Video Grounding with Transformers

Single-stage Keypoint-based Category-level Object Pose Estimation from an RGB Image

Python package for visualizing the loss landscape of parameterized quantum algorithms.

ANEA: Distant Supervision for Low-Resource Named Entity Recognition

Self-Supervised Image Denoising via Iterative Data Refinement

Qimera: Data-free Quantization with Synthetic Boundary Supporting Samples

Code samples for my book "Neural Networks and Deep Learning"

Official implementation of the paper "AAVAE: Augmentation-AugmentedVariational Autoencoders"

TDmatch is a Python library developed to perform matching tasks in three categories:

An end-to-end PyTorch framework for image and video classification

AlphaBot2 Pi Core software for interfacing with the various components.

Blender Add-On for slicing meshes with planes

PyTorch implementation of "Image-to-Image Translation Using Conditional Adversarial Networks".