Cooperative multi-agent reinforcement learning for high-dimensional nonequilibrium control

Last update: Nov 16, 2021

Related tags

Deep Learning marl-design

Overview

Cooperative multi-agent reinforcement learning for high-dimensional nonequilibrium control

Official implementation of:

Cooperative multi-agent reinforcement learning for high-dimensional nonequilibrium control

Shriram Chennakesavalu and Grant M. Rotskoff

https://arxiv.org/abs/2111.06875

Abstract: Experimental advances enabling high-resolution external control create new opportunities to produce materials with exotic properties. In this work, we investigate how a multi-agent reinforcement learning approach can be used to design external control protocols for self-assembly. We find that a fully decentralized approach performs remarkably well even with a "coarse" level of external control. More importantly, we see that a partially decentralized approach, where we include information about the local environment allows us to better control our system towards some target distribution. We explain this by analyzing our approach as a partially-observed Markov decision process. With a partially decentralized approach, the agent is able to act more presciently, both by preventing the formation of undesirable structures and by better stabilizing target structures as compared to a fully decentralized approach.

Installing prerequisites (using conda)

conda env create -f environment.yml -n marldesign
conda activate marldesign

Possible --centralize_approach values are ("plaquette", "all", "grid_n"), where 1 < n < region_num/2

Sample training commands

python train.py --active --centralize_states --centralize_approach plaquette
python train.py --active --centralize_rewards --centralize_approach all
python train.py --centralize_rewards --centralize_states --centralize_approach grid_1

Sample testing commands

python test.py --active --num_samples 10  --centralize_states --centralize_approach plaquette
python test.py --active --num_samples 10 --centralize_rewards --centralize_approach grid_1
python test.py --centralize_rewards --num_samples 10 --centralize_states --centralize_approach grid_2

For a more theoretical description of the systems described here, please visit https://github.com/rotskoff-group/dissipative-design

Cooperative multi-agent reinforcement learning for high-dimensional nonequilibrium control

Related tags

Overview

Cooperative multi-agent reinforcement learning for high-dimensional nonequilibrium control

Installing prerequisites (using conda)

Sample training commands

Sample testing commands

Owner

Autonomous racing with the Anki Overdrive

Implementation of the GVP-Transformer, which was used in the paper "Learning inverse folding from millions of predicted structures" for de novo protein design alongside Alphafold2

This is a repository with the code for the ACL 2019 paper

Composable transformations of Python+NumPy programsComposable transformations of Python+NumPy programs

基于Pytorch实现优秀的自然图像分割框架！(包括FCN、U-Net和Deeplab)

Shape Matching of Real 3D Object Data to Synthetic 3D CADs (3DV project @ ETHZ)

PenguinSpeciesPredictionML - Basic model to predict Penguin species based on beak size and sex.

PiCIE: Unsupervised Semantic Segmentation using Invariance and Equivariance in clustering (CVPR2021)

A flexible and extensible framework for gait recognition.

CBKH: The Cornell Biomedical Knowledge Hub

Fast Soft Color Segmentation

Official tensorflow implementation for CVPR2020 paper “Learning to Cartoonize Using White-box Cartoon Representations”

Dilated Convolution for Semantic Image Segmentation

Code repo for EMNLP21 paper "Zero-Shot Information Extraction as a Unified Text-to-Triple Translation"

AttGAN: Facial Attribute Editing by Only Changing What You Want (IEEE TIP 2019)

Official code for Score-Based Generative Modeling through Stochastic Differential Equations

Code to reproduce the experiments in the paper "Transformer Based Multi-Source Domain Adaptation" (EMNLP 2020)

Code for Temporally Abstract Partial Models

MAVE: : A Product Dataset for Multi-source Attribute Value Extraction

This repository contains the implementation of the paper: Federated Distillation of Natural Language Understanding with Confident Sinkhorns