Reinforcement learning library in JAX.

Last update: Oct 30, 2022

Overview

Magi RL library in JAX

Installation | Agents | Examples | Contributing | Documentation

Magi is a RL library developed on top of Acme.

Note: Magi is in alpha development so expect breaking changes!

Installation

Create a new Python virtual environment

python3 -m venv venv
source venv/bin/activate

Install dependencies and the package in editable mode by running

pip install -U pip setuptools wheel
pip install -r requirements.txt # This uses pinned dependencies, you may adjust this for your needs.
pip install -e .

If for some reason installation fails, first check out GitHub Actions badge to see if this fails on the latest CI run. If the CI is successful, then it's likely that there are some issues to setting up your own environment. Refer to .github/workflows/ci.yaml as the official source for how to set up the environment.

Agents

magi includes popular RL algorithm implementation such as SAC, DrQ, SAC-AE and PETS. Refer to magi/agents for a full list of agents.

Examples

Check out magi/examples where we include examples of using our RL agents on popular benchmark tasks.

Testing

On Linux, you can run tests with

JAX_PLATFORM_NAME=cpu pytest -n `grep -c ^processor /proc/cpuinfo` magi

Contributing

Refer to CONTRIBUTING.md.

Acknowledgements

Magi is inspired by many of the open-source RL projects out there. Here is a (non-exhaustive) list of related libraries and packages that Magi references:

License

Apache License 2.0

Citation

If you use Magi in your work, please cite us according to the CITATION file. You may learn more about the CITATION file from here.

Reinforcement learning library in JAX.

Related tags

Overview

Magi RL library in JAX

Installation

Agents

Examples

Testing

Contributing

Acknowledgements

License

Citation

Owner

Yicheng Luo

ConE: Cone Embeddings for Multi-Hop Reasoning over Knowledge Graphs

PyTorch implementation DRO: Deep Recurrent Optimizer for Structure-from-Motion

Official Implementation for the paper DeepFace-EMD: Re-ranking Using Patch-wise Earth Mover’s Distance Improves Out-Of-Distribution Face Identification

HAR-stacked-residual-bidir-LSTMs - Deep stacked residual bidirectional LSTMs for HAR

A strongly-typed genetic programming framework for Python

Node-level Graph Regression with Deep Gaussian Process Models

An official repository for Paper "Uformer: A General U-Shaped Transformer for Image Restoration".

Consensus Learning from Heterogeneous Objectives for One-Class Collaborative Filtering

OpenDILab RL Kubernetes Custom Resource and Operator Lib

TorchX: A PyTorch Extension Library for More Efficient Deep Learning

Vision-Language Pre-training for Image Captioning and Question Answering

YOLOv5 in PyTorch > ONNX > CoreML > TFLite

To SMOTE, or not to SMOTE?

High accurate tool for automatic faces detection with landmarks

Pytorch implementation for the Temporal and Object Quantification Networks (TOQ-Nets).

Unofficial Pytorch Implementation of WaveGrad2

Code for the paper "M2m: Imbalanced Classification via Major-to-minor Translation" (CVPR 2020)

Relaxed-machines - explorations in neuro-symbolic differentiable interpreters

A GOOD REPRESENTATION DETECTS NOISY LABELS

Flower classification model that classifies flowers in 10 classes made using transfer learning (~85% accuracy).