Dynamic Bottleneck for Robust Self-Supervised Exploration

Last update: Nov 14, 2022

Related tags

Deep Learning DB

Overview

Dynamic Bottleneck

Introduction

This is a TensorFlow based implementation for our paper on

"Dynamic Bottleneck for Robust Self-Supervised Exploration". NeurIPS 2021

Prerequisites

python3.6 or 3.7, tensorflow-gpu 1.x, tensorflow-probability, openAI baselines, openAI Gym

Installation and Usage

Atari games

The following command should train a pure exploration agent on "Breakout" with default experiment parameters.

python run.py --env BreakoutNoFrameskip-v4

Atari games with Random-Box noise

The following command should train a pure exploration agent on "Breakout" with randomBox noise.

python run.py --env BreakoutNoFrameskip-v4 --randomBoxNoise

Atari games with Gaussian noise

The following command should train a pure exploration agent on "Breakout" with Gaussian noise.

python run.py --env BreakoutNoFrameskip-v4 --pixelNoise

Atari games with sticky actions

The following command should train a pure exploration agent on "sticky Breakout" with a probability of 0.25

python run.py --env BreakoutNoFrameskip-v4 --stickyAtari

Baselines

ICM: We use the official code of "Curiosity-driven Exploration by Self-supervised Prediction, ICML 2017" and "Large-Scale Study of Curiosity-Driven Learning, ICLR 2019".
Disagreement: We use the official code of "Self-Supervised Exploration via Disagreement, ICML 2019".
CB: We use the official code of "Curiosity-Bottleneck: Exploration by Distilling Task-Specific Novelty, ICML 2019".

Dynamic Bottleneck for Robust Self-Supervised Exploration

Related tags

Overview

Dynamic Bottleneck

Introduction

Prerequisites

Installation and Usage

Atari games

Atari games with Random-Box noise

Atari games with Gaussian noise

Atari games with sticky actions

Baselines

Owner

Bai Chenjia

An Industrial Grade Federated Learning Framework

Collection of in-progress libraries for entity neural networks.

EZ graph is an easy to use AI solution that allows you to make and train your neural networks without a single line of code.

GemNet model in PyTorch, as proposed in "GemNet: Universal Directional Graph Neural Networks for Molecules" (NeurIPS 2021)

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Real-ESRGAN aims at developing Practical Algorithms for General Image Restoration.

New AidForBlind - Various Libraries used like OpenCV and other mentioned in Requirements.txt

Facial recognition project

Code for Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation (CVPR 2021)

TransFGU: A Top-down Approach to Fine-Grained Unsupervised Semantic Segmentation

InferPy: Deep Probabilistic Modeling with Tensorflow Made Easy

TransMVSNet: Global Context-aware Multi-view Stereo Network with Transformers.

Sum-Product Probabilistic Language

COD-Rank-Localize-and-Segment (CVPR2021)

PULSE: Self-Supervised Photo Upsampling via Latent Space Exploration of Generative Models

Gym for multi-agent reinforcement learning

Plaything for Autistic Children (demo for PaddlePaddle/Wechaty/Mixlab project)

Elastic weight consolidation technique for incremental learning.

Real-time ground filtering algorithm of cloud points acquired using Terrestrial Laser Scanner (TLS)

An Object Oriented Programming (OOP) interface for Ontology Web language (OWL) ontologies.