AAAI-22 paper: SimSR: Simple Distance-based State Representationfor Deep Reinforcement Learning

Last update: Dec 19, 2022

Related tags

Overview

SimSR

Code and dataset for the paper SimSR: Simple Distance-based State Representationfor Deep Reinforcement Learning (AAAI-22).

Requirements

We assume you have access to a gpu that can run CUDA 11. All of the dependencies are in the conda_env.yml file.

conda env create -f conda_env.yml

After the instalation ends you can activate your environment with

conda activate simsr

Instructions

To train a SimSR agent on the cartpole swingup task from image-based observations run bash run.sh from the root of this directory. The run.sh file contains the following command, which you can modify to try different environments / hyperparamters.

DOMAIN=cartpole
TASK=swingup
SEED=1

MUJOCO_GL="egl" CUDA_VISIBLE_DEVICES=0 nohup python -u train.py \
	--domain_name ${DOMAIN} \
	--task_name ${TASK} \
	--encoder_type pixel \
	--action_repeat 4 \
	--pre_transform_image_size 84 \
	--image_size 84 \
	--work_dir ./tmp \
	--agent simsr_sac \
	--frame_stack 3\
	--seed ${SEED} --critic_lr 1e-3 \
	--actor_lr 1e-3 \
	--eval_freq 10000 \
	--batch_size 128 \
	--num_train_steps 260000 > ${DOMAIN}_${TASK}_${SEED}.log &

Note that the MuJoCo Python bindings support three different OpenGL rendering backends: "glfw", "egl", or "osmesa". You can also specify a particular backend to use by setting the MUJOCO_GL= environment variable to one of them.

To visualize progress with tensorboard run:

tensorboard --logdir ./path/to/your/log --port 6006

References

Please cite the paper SimSR: Simple Distance-based State Representationfor Deep Reinforcement Learning if you found the resources in the repository useful.

AAAI-22 paper: SimSR: Simple Distance-based State Representationfor Deep Reinforcement Learning

Related tags

Overview

SimSR

Requirements

Instructions

References

Owner

FCOS: Fully Convolutional One-Stage Object Detection (ICCV'19)

:hot_pepper: R²SQL: "Dynamic Hybrid Relation Network for Cross-Domain Context-Dependent Semantic Parsing." (AAAI 2021)

A simple Python configuration file operator.

Codes and models of NeurIPS2021 paper - DominoSearch: Find layer-wise fine-grained N:M sparse schemes from dense neural networks

traiNNer is an open source image and video restoration (super-resolution, denoising, deblurring and others) and image to image translation toolbox based on PyTorch.

A TensorFlow implementation of FCN-8s

Code in conjunction with the publication 'Contrastive Representation Learning for Hand Shape Estimation'

transfer attack; adversarial examples; black-box attack; unrestricted Adversarial Attacks on ImageNet; CVPR2021 天池黑盒竞赛

moving object detection for satellite videos.

A High-Level Fusion Scheme for Circular Quantities published at the 20th International Conference on Advanced Robotics

TraSw for FairMOT - A Single-Target Attack example (Attack ID: 19; Screener ID: 24):

Fast Soft Color Segmentation

SurvITE: Learning Heterogeneous Treatment Effects from Time-to-Event Data

Code for Graph-to-Tree Learning for Solving Math Word Problems (ACL 2020)

The full training script for Enformer (Tensorflow Sonnet) on TPU clusters

This is a Tensorflow implementation of Learning to See in the Dark in CVPR 2018

The 1st place solution of track2 (Vehicle Re-Identification) in the NVIDIA AI City Challenge at CVPR 2021 Workshop.

A no-BS, dead-simple training visualizer for tf-keras

Highly comparative time-series analysis

Learning 3D Part Assembly from a Single Image