Running SB3 developed agents on TFLite or Coral

Introduction

I've been using Stable-Baselines3 to train agents against some custom Gyms, some of which require fairly large NNs in order to be effective.

I want those agents to eventually be run on a pi or similar, so I need to export all the way to TFLite and ideally a Coral.

How to use

Setup

You will need to have configured the Coral system-wide stuff.

Build a venv:

python3 -m venv venv
source venv/bin/activate
python3 -m pip install -r requirements.txt

Running

This comes with enough defaults to do cradle-to-grave demonstration, but all the pieces take command-line arguments so I can adjust to taste for my actual use case.

# Train an agent with SB3
python3 ./train.py

# Convert model
python3 ./model_conv.py

# Run original SB3 model
python3 ./run_sb3.py
# Run the onnx model
python3 ./run_onnx.py
# Run the TFLite model
python3 ./run_tflite.py
# Run the Coral model ["edgetpu" in the name will attempt to load Coral]
python3 ./run_tflite.py MountainCarContinuous-v0 model_quant_edgetpu

Cheers,
Gary [email protected]

Simple converter for deploying Stable-Baselines3 model to TFLite and/or Coral

Related tags

Overview

Running SB3 developed agents on TFLite or Coral

Introduction

How to use

Setup

Running

Owner

Gary Briggs

PyTorch implementation of Convolutional Neural Fabrics http://arxiv.org/abs/1606.02492

This is the code for HOI Transformer

FaceQgen: Semi-Supervised Deep Learning for Face Image Quality Assessment

Builds a LoRa radio frequency fingerprint identification (RFFI) system based on deep learning techiniques

Video-based open-world segmentation

N-Omniglot is a large neuromorphic few-shot learning dataset

Pytorch Lightning Implementation of SC-Depth Methods.

The codes and related files to reproduce the results for Image Similarity Challenge Track 1.

Code to use Augmented Shapiro Wilks Stopping, as well as code for the paper "Statistically Signifigant Stopping of Neural Network Training"

Conservative Q Learning for Offline Reinforcement Reinforcement Learning in JAX

ML-based medical imaging using Azure

[AAAI 2022] Sparse Structure Learning via Graph Neural Networks for Inductive Document Classification

WRENCH: Weak supeRvision bENCHmark

This repository contains code for the paper "Decoupling Representation and Classifier for Long-Tailed Recognition", published at ICLR 2020

HNN: Human (Hollywood) Neural Network

Pytorch implementation of the paper Improving Text-to-Image Synthesis Using Contrastive Learning

Fit Fast, Explain Fast

Supervised Sliding Window Smoothing Loss Function Based on MS-TCN for Video Segmentation

My implementation of Image Inpainting - A deep learning Inpainting model

Deep Q-network learning to play flappybird.