PyTorch implementation for paper StARformer: Transformer with State-Action-Reward Representations.

Last update: Dec 09, 2022

Related tags

Overview

StARformer

This repository contains the PyTorch implementation for our paper titled StARformer: Transformer with State-Action-Reward Representations. We learn local State-Action-Reward representations (StAR-representations) to improve (long) sequence modeling for reinforcement learning (and imitation learning).

Results

Installation

Dependencies can be installed by Conda:

conda env create -f my_env.yml

And install Atari ROMs.

Datasets

Please follow this instruction for datasets.

Example usage

See run.sh or below:

python run_star_atari.py --seed 123 --data_dir_prefix [data_directory] --epochs 10 --num_steps 500000 --num_buffers 50 --batch_size 64 --seq_len 30 --model_type 'star' --game 'Breakout'

[data_directory] is where you place the Atari dataset.

Variants (`model_type`):

'star' (imitation)
'star_rwd' (offline RL)
'star_fusion' (see Figure 4a in our paper)
'star_stack' (see Figure 4b in our paper)

Acknowledgement

This code is based on Decision-Transformer.

PyTorch implementation for paper StARformer: Transformer with State-Action-Reward Representations.

Related tags

Overview

StARformer

Results

Installation

Datasets

Example usage

Variants (`model_type`):

Acknowledgement

Owner

Jinghuan Shang

Self-Supervised Pre-Training for Transformer-Based Person Re-Identification

The official implementation of our CVPR 2021 paper - Hybrid Rotation Averaging: A Fast and Robust Rotation Averaging Approach

Official Code for VideoLT: Large-scale Long-tailed Video Recognition (ICCV 2021)

Code for NeurIPS2021 submission "A Surrogate Objective Framework for Prediction+Programming with Soft Constraints"

A tool for calculating distortion parameters in coordination complexes.

Framework to build and train RL algorithms

Learning Domain Invariant Representations in Goal-conditioned Block MDPs

PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch

Dynamic Visual Reasoning by Learning Differentiable Physics Models from Video and Language (NeurIPS 2021)

Computer vision - fun segmentation experience using classic and deep tools :)

ML for NLP and Computer Vision.

a simple, efficient, and intuitive text editor

Codebase for Image Classification Research, written in PyTorch.

A benchmark for the task of translation suggestion

Tiny Object Detection in Aerial Images.

The ICS Chat System project for NYU Shanghai Fall 2021

EquiBind: Geometric Deep Learning for Drug Binding Structure Prediction

Apache Flink

Official implementation of the paper "Light Field Networks: Neural Scene Representations with Single-Evaluation Rendering"

PyTorch implementation for paper StARformer: Transformer with State-Action-Reward Representations.

Related tags

Overview

StARformer

Results

Installation

Datasets

Example usage

Variants (model_type):

Acknowledgement

Owner

Jinghuan Shang

Self-Supervised Pre-Training for Transformer-Based Person Re-Identification

The official implementation of our CVPR 2021 paper - Hybrid Rotation Averaging: A Fast and Robust Rotation Averaging Approach

Official Code for VideoLT: Large-scale Long-tailed Video Recognition (ICCV 2021)

Code for NeurIPS2021 submission "A Surrogate Objective Framework for Prediction+Programming with Soft Constraints"

A tool for calculating distortion parameters in coordination complexes.

Framework to build and train RL algorithms

Learning Domain Invariant Representations in Goal-conditioned Block MDPs

PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch

Dynamic Visual Reasoning by Learning Differentiable Physics Models from Video and Language (NeurIPS 2021)

Computer vision - fun segmentation experience using classic and deep tools :)

ML for NLP and Computer Vision.

a simple, efficient, and intuitive text editor

Codebase for Image Classification Research, written in PyTorch.

A benchmark for the task of translation suggestion

Tiny Object Detection in Aerial Images.

The ICS Chat System project for NYU Shanghai Fall 2021

EquiBind: Geometric Deep Learning for Drug Binding Structure Prediction

Apache Flink

Official implementation of the paper "Light Field Networks: Neural Scene Representations with Single-Evaluation Rendering"

Variants (`model_type`):