ATAC: Adversarially Trained Actor Critic

Last update: Dec 08, 2022

Related tags

Overview

ATAC: Adversarially Trained Actor Critic

Adversarially Trained Actor Critic for Offline Reinforcement Learning by Ching-An Cheng*, Tengyang Xie*, Nan Jiang, and Alekh Agarwal.
https://arxiv.org/abs/2202.02446

Setup

Clone the repository and create a conda environment.

git clone https://github.com/microsoft/ATAC.git
conda create -n atac python=3.8
cd atac

Prerequisite: Install Mujoco

(Optional) Install free mujoco210 for mujoco_py and mujoco211 for dm_control.

> ~/.bashrc source ~/.bashrc">

bash install_mujoco.sh
echo "export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:~/.mujoco/mujoco210/bin:/usr/lib/nvidia" >> ~/.bashrc
source ~/.bashrc

Install ATAC

conda activate atac
pip install -e .[mujoco210]
# or below, if the original paid mujoco is used.
pip install -e .[mujoco200]

Run ATAC

python scripts/main.py

Contributing

This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.opensource.microsoft.com.

When you submit a pull request, a CLA bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., status check, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact [email protected] with any additional questions or comments.

Trademarks

This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft trademarks or logos is subject to and must follow Microsoft's Trademark & Brand Guidelines. Use of Microsoft trademarks or logos in modified versions of this project must not cause confusion or imply Microsoft sponsorship. Any use of third-party trademarks or logos are subject to those third-party's policies.

ATAC: Adversarially Trained Actor Critic

Related tags

Overview

ATAC: Adversarially Trained Actor Critic

Setup

Clone the repository and create a conda environment.

Prerequisite: Install Mujoco

Install ATAC

Run ATAC

Contributing

Trademarks

Owner

Microsoft

OpenMMLab Text Detection, Recognition and Understanding Toolbox

Dynamica causal Bayesian optimisation

This codebase is the official implementation of Test-Time Classifier Adjustment Module for Model-Agnostic Domain Generalization (NeurIPS2021, Spotlight)

Federated learning on graph, especially on graph neural networks (GNNs), knowledge graph, and private GNN.

CTRMs: Learning to Construct Cooperative Timed Roadmaps for Multi-agent Path Planning in Continuous Spaces

A dead simple python wrapper for darknet that works with OpenCV 4.1, CUDA 10.1

Polyp-PVT: Polyp Segmentation with Pyramid Vision Transformers (arXiv2021)

A PyTorch Implementation of Neural IMage Assessment

Predict Breast Cancer Wisconsin (Diagnostic) using Naive Bayes

PyMove is a Python library to simplify queries and visualization of trajectories and other spatial-temporal data

tensorflow implementation of 'YOLO : Real-Time Object Detection'

[CIKM 2021] Enhancing Aspect-Based Sentiment Analysis with Supervised Contrastive Learning

ScaleNet: A Shallow Architecture for Scale Estimation

Scikit-event-correlation - Event Correlation and Forecasting over High Dimensional Streaming Sensor Data algorithms

Bayes-Newton—A Gaussian process library in JAX, with a unifying view of approximate Bayesian inference as variants of Newton's algorithm.

OneFlow is a performance-centered and open-source deep learning framework.

Hydra Lightning Template for Structured Configs

An example of semantic segmentation using tensorflow in eager execution.

MAGMA - a GPT-style multimodal model that can understand any combination of images and language

“袋鼯麻麻——智能购物平台”能够精准地定位识别每一个商品