An experiment on the performance of homemade Q-learning AIs in Agar.io depending on their state representation and available actions

Last update: Jun 09, 2022

Overview

Agar.io_Q-Learning_AI

An experiment on the performance of homemade Q-learning AIs in Agar.io depending on their state representation and available actions.

An image of the circle categorisation function in action. Food blobs are outlined in blue, edible cells in green and dangerous cells in red according to where our program detects them. Screen edges mess that up a bit. The agents action at this moment is labelled with the green arrow.

States are calculated using the shortest euclidian distance to each of the three circle types: food, edible cells and dangerous cells. These distances are measured and discretized according to which interval they fall within. The rulers in this image are to scale.

Currently the agent can't press any keyboard buttons, only move around using the mouse. It could be added without too much hassle, but it would require a rework of some aspects of the code and a ton training, which already takes ages. The q-learning part could also do with a proper implementation of stochastic q-learning instead of our generic iterative q-learning, if I knew how to do it. I look forward to learning that at a later point.

Feel free to ask any questions about the code or the project. I hope you enjoy!

The humans in the experiment were subject to the same move set as the bots and agents, so only mouse movement.

An experiment on the performance of homemade Q-learning AIs in Agar.io depending on their state representation and available actions

Related tags

Overview

Agar.io_Q-Learning_AI

Owner

It is a simple library to speed up CLIP inference up to 3x (K80 GPU)

Code of TIP2021 Paper《SFace: Sigmoid-Constrained Hypersphere Loss for Robust Face Recognition》. We provide both MxNet and Pytorch versions.

Code for CVPR 2021 oral paper "Exploring Data-Efficient 3D Scene Understanding with Contrastive Scene Contexts"

A repository for generating stylized talking 3D and 3D face

This GitHub repository contains code used for plots in NeurIPS 2021 paper 'Stochastic Multi-Armed Bandits with Control Variates.'

Pytorch implementation of XRD spectral identification from COD database

Code for "Neural Parts: Learning Expressive 3D Shape Abstractions with Invertible Neural Networks", CVPR 2021

[CVPR 2022] CoTTA Code for our CVPR 2022 paper Continual Test-Time Domain Adaptation

Motion Reconstruction Code and Data for Skills from Videos (SFV)

Y. Zhang, Q. Yao, W. Dai, L. Chen. AutoSF: Searching Scoring Functions for Knowledge Graph Embedding. IEEE International Conference on Data Engineering (ICDE). 2020

TRACER: Extreme Attention Guided Salient Object Tracing Network implementation in PyTorch

The official repo of the CVPR2021 oral paper: Representative Batch Normalization with Feature Calibration

Scikit-event-correlation - Event Correlation and Forecasting over High Dimensional Streaming Sensor Data algorithms

EsViT: Efficient self-supervised Vision Transformers

MLOps will help you to understand how to build a Continuous Integration and Continuous Delivery pipeline for an ML/AI project.

Code and training data for our ECCV 2016 paper on Unsupervised Learning

《Deep Single Portrait Image Relighting》(ICCV 2019)

abess: Fast Best-Subset Selection in Python and R

The code for paper "Contrastive Spatio-Temporal Pretext Learning for Self-supervised Video Representation" which is accepted by AAAI 2022

MPI-IS Mesh Processing Library