Multi-Objective Reinforced Active Learning

Last update: Nov 19, 2022

Related tags

Deep Learning moral_rl

Overview

Multi-Objective Reinforced Active Learning

Dependencies

wandb
tqdm
pytorch >= 1.7.0
numpy >= 1.20.0
scipy >= 1.1.0
pycolab == 1.2

Weights and Biases

Our code depends on for visualizing and logging results during training. As a result, we call wandb.init(), which will prompt to add an API key for linking the training runs with your personal wandb account. This can be done by pasting the WANDB_API_KEY into the respective box when running the code for the first time.

Environments

Our gridworlds (Emergency: randomized_v2.py, Delivery: randomized_v3.py) build on the game engine with a custom wrapper to provide similar functionality as the gym . This engine comes with a user interface and any environment can be played in the console using python environment.py with arrow keys and w, a, s, d as controls.

Training

There are four training scripts for

manually training a PPO agent on custom rewards (ppo_train.py),
training AIRL on a single expert dataset (airl_train.py),
active MORL with custom/automatic preferences (moral_train.py) and
training DRLHP with custom/automatic preferences (drlhp_train.py).

When using automatic preferences, a desired ratio can be passed as an argument. For example,

python moral_train.py --ratio a b c

will run MORAL using a (real-valued) ratio of a:b:c among the three explicit objectives in Delivery.

Hyperparameters

Hyperparameters are passed as arguments to wandb.init() and can be changed by modifying the respective training files.

Multi-Objective Reinforced Active Learning

Related tags

Overview

Multi-Objective Reinforced Active Learning

Dependencies

Weights and Biases

Environments

Training

Hyperparameters

Owner

Markus Peschl

Pytorch implementation of MaskGIT: Masked Generative Image Transformer

Official repository for the paper, MidiBERT-Piano: Large-scale Pre-training for Symbolic Music Understanding.

Projects for AI/ML and IoT integration for games and other presented at re:Invent 2021.

Running Google MoveNet Multipose Tracking models on OpenVINO.

SlideGraph+: Whole Slide Image Level Graphs to Predict HER2 Status in Breast Cancer

SemiNAS: Semi-Supervised Neural Architecture Search

A simple, high level, easy-to-use open source Computer Vision library for Python.

Repositório da disciplina de APC, no segundo semestre de 2021

Tensorflow Implementation of SMU: SMOOTH ACTIVATION FUNCTION FOR DEEP NETWORKS USING SMOOTHING MAXIMUM TECHNIQUE

Unofficial implementation of the paper: PonderNet: Learning to Ponder in TensorFlow

FaceQgen: Semi-Supervised Deep Learning for Face Image Quality Assessment

unofficial pytorch implementation of RefineGAN

The code is for the paper "A Self-Distillation Embedded Supervised Affinity Attention Model for Few-Shot Segmentation"

A repo that contains all the mesh keys needed for mesh backend, along with a code example of how to use them in python

Object-aware Contrastive Learning for Debiased Scene Representation

Bridging the Gap between Label- and Reference based Synthesis(ICCV 2021)

Official code for MPG2: Multi-attribute Pizza Generator: Cross-domain Attribute Control with Conditional StyleGAN

This repository contains the source code for the paper Tutorial on amortized optimization for learning to optimize over continuous domains by Brandon Amos

a morph transfer UGATIT for image translation.

Locally Enhanced Self-Attention: Rethinking Self-Attention as Local and Context Terms