Official Implementation of 'UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers' ICLR 2021(spotlight)

Last update: Dec 22, 2022

Related tags

Deep Learning UPDeT

Overview

UPDeT

Official Implementation of UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers (ICLR 2021 spotlight)

The framework is inherited from PyMARL. UPDeT is written in pytorch and uses SMAC as its environment.

Installation instructions

Installing dependencies:

pip install -r requirements.txt

Download SC2 into the `3rdparty/` folder and copy the maps necessary to run over.

bash install_sc2.sh

Run an experiment

Before training your own transformer-based multi-agent model, there are a list of things to note.

Currently, this repository supports marine-based battle scenarios. e.g. 3m, 8m, 5m_vs_6m.
If you are interested in training a different unit type, carefully modify the Transformer Parameters block at src/config/default.yaml and revise the _build_input_transformer function in basic_controller.python.
Before running the experiment, check the agent type in Agent Parameters block at src/config/default.yaml.
This repository contains two new transformer-based agents from the UPDeT paper including
- Standard UPDeT
- Aggregation Transformer

Training script

python3 src/main.py --config=vdn --env-config=sc2 with env_args.map_name=5m_vs_6m

All results will be stored in the Results/ folder.

Performance

Single battle scenario

Surpass the GRU baseline on hard 5m_vs_6m with:

Multiple battle scenarios

Zero-shot generalize to different tasks:

Result on 7m-5m-3m transfer learning.

Note: Only UPDeT can be deployed to other scenarios without changing the model's architecture.

More details please refer to UPDeT paper.

Bibtex

@article{hu2021updet,
  title={UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers},
  author={Hu, Siyi and Zhu, Fengda and Chang, Xiaojun and Liang, Xiaodan},
  journal={arXiv preprint arXiv:2101.08001},
  year={2021}
}

License

The MIT License

Official Implementation of 'UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers' ICLR 2021(spotlight)

Related tags

Overview

UPDeT

Installation instructions

Installing dependencies:

Download SC2 into the `3rdparty/` folder and copy the maps necessary to run over.

Run an experiment

Training script

Performance

Single battle scenario

Multiple battle scenarios

Bibtex

License

Owner

hhhusiyi

NeuralDiff: Segmenting 3D objects that move in egocentric videos

Code for "LASR: Learning Articulated Shape Reconstruction from a Monocular Video". CVPR 2021.

Automatic labeling, conversion of different data set formats, sample size statistics, model cascade

a basic code repository for basic task in CV(classification,detection,segmentation)

SEJE Pytorch implementation

We will see a basic program that is basically a hint to brute force attack to crack passwords. In other words, we will make a program to Crack Any Password Using Python. Show some ❤️ by starring this repository!

CVAT is free, online, interactive video and image annotation tool for computer vision

Implementation of the paper NAST: Non-Autoregressive Spatial-Temporal Transformer for Time Series Forecasting.

A keras-based real-time model for medical image segmentation (CFPNet-M)

Self-Supervised Image Denoising via Iterative Data Refinement

Multi-Agent Reinforcement Learning (MARL) method to learn scalable control polices for multi-agent target tracking.

Implementing a simplified copy of Shazam application from scratch using MinHashing and LSH.

Implementation of Artificial Neural Network Algorithm

Using machine learning to predict and analyze high and low reader engagement for New York Times articles posted to Facebook.

A learning-based data collection tool for human segmentation

This is the code related to "Sparse-to-dense Feature Matching: Intra and Inter domain Cross-modal Learning in Domain Adaptation for 3D Semantic Segmentation" (ICCV 2021).

Unsupervised Representation Learning by Invariance Propagation

Container : Context Aggregation Network

DISTIL: Deep dIverSified inTeractIve Learning.

A fast implementation of bss_eval metrics for blind source separation

Official Implementation of 'UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers' ICLR 2021(spotlight)

Related tags

Overview

UPDeT

Installation instructions

Installing dependencies:

Download SC2 into the 3rdparty/ folder and copy the maps necessary to run over.

Run an experiment

Training script

Performance

Single battle scenario

Multiple battle scenarios

Bibtex

License

Owner

hhhusiyi

NeuralDiff: Segmenting 3D objects that move in egocentric videos

Code for "LASR: Learning Articulated Shape Reconstruction from a Monocular Video". CVPR 2021.

Automatic labeling, conversion of different data set formats, sample size statistics, model cascade

a basic code repository for basic task in CV(classification,detection,segmentation)

SEJE Pytorch implementation

We will see a basic program that is basically a hint to brute force attack to crack passwords. In other words, we will make a program to Crack Any Password Using Python. Show some ❤️ by starring this repository!

CVAT is free, online, interactive video and image annotation tool for computer vision

Implementation of the paper NAST: Non-Autoregressive Spatial-Temporal Transformer for Time Series Forecasting.

A keras-based real-time model for medical image segmentation (CFPNet-M)

Self-Supervised Image Denoising via Iterative Data Refinement

Multi-Agent Reinforcement Learning (MARL) method to learn scalable control polices for multi-agent target tracking.

Implementing a simplified copy of Shazam application from scratch using MinHashing and LSH.

Implementation of Artificial Neural Network Algorithm

Using machine learning to predict and analyze high and low reader engagement for New York Times articles posted to Facebook.

A learning-based data collection tool for human segmentation

This is the code related to "Sparse-to-dense Feature Matching: Intra and Inter domain Cross-modal Learning in Domain Adaptation for 3D Semantic Segmentation" (ICCV 2021).

Unsupervised Representation Learning by Invariance Propagation

Container : Context Aggregation Network

DISTIL: Deep dIverSified inTeractIve Learning.

A fast implementation of bss_eval metrics for blind source separation

Download SC2 into the `3rdparty/` folder and copy the maps necessary to run over.