[NeurIPS 2021] Official implementation of paper "Learning to Simulate Self-driven Particles System with Coordinated Policy Optimization".

Last update: Dec 19, 2022

Overview

Code for Coordinated Policy Optimization

Webpage | Code | Paper | Talk (English) | Talk (Chinese)

Hi there! This is the source code of the paper “Learning to Simulate Self-driven Particles System with Coordinated Policy Optimization”.

Please following the tutorial below to kickoff the reproduction of our results.

Installation

# Create virtual environment
conda create -n copo python=3.7
conda activate copo

# Install dependency
pip install metadrive-simulator==0.2.3
pip install torch  # Make sure your torch is successfully installed! Especially when using GPU!

# Install environment and algorithm.
cd code
pip install -e .

Training

As a quick start, you can start training CoPO in Intersection environment immediately after installation by running:

cd code/copo/
python inter/train_copo_dist.py --exp-name inter_copo_dist

The general way to run training is following:

cd code/copo/
python ENV/train_ALGO.py --exp-name EXPNAME

Here ENV refers to the shorthand of environments:

round  # Roundabout
inter  # Intersection
bottle  # Bottleneck
parking  # Parking Lot
tollgate  # Tollgate

and ALGO is the shorthand for algorithms:

ippo  # Individual Policy Optimization
ccppo  # Mean Field Policy Optimization
cl  # Curriculum Learning
copo_dist  # Coordinated Policy Optimiztion (Ours)
copo_dist_cc  # Coordinated Policy Optimiztion with Centralized Critics

finally the EXPNAME is arbitrary name to denote the experiment (with multiple concurrent trials), such as roundabout_copo.

Visualization

We provide the trained models for all algorithms in all environments. A simple command can bring you the visualization of the behaviors of the populations!

cd copo
python vis.py 

# In default, we provide you the CoPO population in Intersection environment. 
# If you want to see others, try:
python vis.py --env round --algo ippo

# Or you can use the native renderer for 3D rendering:
# (Press H to show helper message)
python vis.py --env tollgate --algo cl --use_native_render

We hope you enjoy the interesting behaviors learned in this work! Please feel free to contact us if you have any questions, thanks!

Citation

@misc{peng2021learning,
      title={Learning to Simulate Self-Driven Particles System with Coordinated Policy Optimization}, 
      author={Zhenghao Peng and Quanyi Li and Ka Ming Hui and Chunxiao Liu and Bolei Zhou},
      year={2021},
      eprint={2110.13827},
      archivePrefix={arXiv},
      primaryClass={cs.LG}
}

[NeurIPS 2021] Official implementation of paper "Learning to Simulate Self-driven Particles System with Coordinated Policy Optimization".

Related tags

Overview

Code for Coordinated Policy Optimization

Installation

Training

Visualization

Citation

Owner

DeciForce: Crossroads of Machine Perception and Autonomy

Wenzhou-Kean University AI-LAB

NuPIC Studio is an all-in-one tool that allows users create a HTM neural network from scratch

RID-Noise: Towards Robust Inverse Design under Noisy Environments

SalFBNet: Learning Pseudo-Saliency Distribution via Feedback Convolutional Networks

[ICCV 2021] FaPN: Feature-aligned Pyramid Network for Dense Image Prediction

An open source implementation of CLIP.

Implement slightly different caffe-segnet in tensorflow

Useful materials and tutorials for 110-1 NTU DBME5028 (Application of Deep Learning in Medical Imaging)

Controlling Hill Climb Racing with Hand Tacking

M3DSSD: Monocular 3D Single Stage Object Detector

Keras implementation of Normalizer-Free Networks and SGD - Adaptive Gradient Clipping

Code for ICCV 2021 paper: ARAPReg: An As-Rigid-As Possible Regularization Loss for Learning Deformable Shape Generators..

This repository contains several image-to-image translation models, whcih were tested for RGB to NIR image generation. The models are Pix2Pix, Pix2PixHD, CycleGAN and PointWise.

Code for "Solving Graph-based Public Good Games with Tree Search and Imitation Learning"

Rewrite ultralytics/yolov5 v6.0 opencv inference code based on numpy, no need to rely on pytorch

Tensorflow 2.x implementation of Panoramic BlitzNet for object detection and semantic segmentation on indoor panoramic images.

Behind the Curtain: Learning Occluded Shapes for 3D Object Detection

Learning trajectory representations using self-supervision and programmatic supervision.

The code repository for "RCNet: Reverse Feature Pyramid and Cross-scale Shift Network for Object Detection" (ACM MM'21)

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

[NeurIPS 2021] Official implementation of paper "Learning to Simulate Self-driven Particles System with Coordinated Policy Optimization".

Related tags

Overview

Code for Coordinated Policy Optimization

Installation

Training

Visualization

Citation

Owner

DeciForce: Crossroads of Machine Perception and Autonomy

Wenzhou-Kean University AI-LAB

NuPIC Studio is an all­-in-­one tool that allows users create a HTM neural network from scratch

RID-Noise: Towards Robust Inverse Design under Noisy Environments

SalFBNet: Learning Pseudo-Saliency Distribution via Feedback Convolutional Networks

[ICCV 2021] FaPN: Feature-aligned Pyramid Network for Dense Image Prediction

An open source implementation of CLIP.

Implement slightly different caffe-segnet in tensorflow

Useful materials and tutorials for 110-1 NTU DBME5028 (Application of Deep Learning in Medical Imaging)

Controlling Hill Climb Racing with Hand Tacking

M3DSSD: Monocular 3D Single Stage Object Detector

Keras implementation of Normalizer-Free Networks and SGD - Adaptive Gradient Clipping

Code for ICCV 2021 paper: ARAPReg: An As-Rigid-As Possible Regularization Loss for Learning Deformable Shape Generators..

This repository contains several image-to-image translation models, whcih were tested for RGB to NIR image generation. The models are Pix2Pix, Pix2PixHD, CycleGAN and PointWise.

Code for "Solving Graph-based Public Good Games with Tree Search and Imitation Learning"

Rewrite ultralytics/yolov5 v6.0 opencv inference code based on numpy, no need to rely on pytorch

Tensorflow 2.x implementation of Panoramic BlitzNet for object detection and semantic segmentation on indoor panoramic images.

Behind the Curtain: Learning Occluded Shapes for 3D Object Detection

Learning trajectory representations using self-supervision and programmatic supervision.

The code repository for "RCNet: Reverse Feature Pyramid and Cross-scale Shift Network for Object Detection" (ACM MM'21)

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

NuPIC Studio is an all-in-one tool that allows users create a HTM neural network from scratch