NovelD: A Simple yet Effective Exploration Criterion

Intro

This is an implementation of the method proposed in

NovelD: A Simple yet Effective Exploration Criterion and BeBold: Exploration Beyond the Boundary of Explored Regions

Citation

If you use this code in your own work, please cite our paper:

@article{zhang2021noveld,
  title={NovelD: A Simple yet Effective Exploration Criterion},
  author={Zhang, Tianjun and Xu, Huazhe and Wang, Xiaolong and Wu, Yi and Keutzer, Kurt and Gonzalez, Joseph E and Tian, Yuandong},
  journal={Advances in Neural Information Processing Systems},
  volume={34},
  year={2021}
}

@article{zhang2020bebold,
  title={BeBold: Exploration Beyond the Boundary of Explored Regions},
  author={Zhang, Tianjun and Xu, Huazhe and Wang, Xiaolong and Wu, Yi and Keutzer, Kurt and Gonzalez, Joseph E and Tian, Yuandong},
  journal={arXiv preprint arXiv:2012.08621},
  year={2020}
}

Installation

# Install Instructions
conda create -n ride python=3.7
conda activate noveld 
git clone [email protected]:tianjunz/NovelD.git
cd NovelD
pip install -r requirements.txt

Train NovelD on MiniGrid

OMP_NUM_THREADS=1 python main.py --model bebold --env MiniGrid-ObstructedMaze-2Dlhb-v0 --total_frames 500000000 --intrinsic_reward_coef 0.05 --entropy_cost 0.0005

Acknowledgements

Our vanilla RL algorithm is based on RIDE.

License

This code is under the CC-BY-NC 4.0 (Attribution-NonCommercial 4.0 International) license.

NovelD: A Simple yet Effective Exploration Criterion

Related tags

Overview

NovelD: A Simple yet Effective Exploration Criterion

Intro

Citation

Installation

Train NovelD on MiniGrid

Acknowledgements

License

Owner

Automatic library of congress classification, using word embeddings from book titles and synopses.

PyTorch for Semantic Segmentation

CCPD: a diverse and well-annotated dataset for license plate detection and recognition

The repository contains source code and models to use PixelNet architecture used for various pixel-level tasks. More details can be accessed at .

Repository for the paper "Exploring the Sensory Spaces of English Perceptual Verbs in Natural Language Data"

Supplemental learning materials for "Fourier Feature Networks and Neural Volume Rendering"

Pytorch tutorials for Neural Style transfert

Towards uncontrained hand-object reconstruction from RGB videos

Official implementation of the method ContIG, for self-supervised learning from medical imaging with genomics

[arXiv'22] Panoptic NeRF: 3D-to-2D Label Transfer for Panoptic Urban Scene Segmentation

A PyTorch implementation of Sharpness-Aware Minimization for Efficiently Improving Generalization

TensorFlow-based neural network library

Evaluating deep transfer learning for whole-brain cognitive decoding

Prototypical python implementation of the trust-region algorithm presented in Sequential Linearization Method for Bound-Constrained Mathematical Programs with Complementarity Constraints by Larson, Leyffer, Kirches, and Manns.

This is the code for Deformable Neural Radiance Fields, a.k.a. Nerfies.

A PyTorch port of the Neural 3D Mesh Renderer

Example repository for custom C++/CUDA operators for TorchScript

Complementary Patch for Weakly Supervised Semantic Segmentation, ICCV21 (poster)

Run Effective Large Batch Contrastive Learning on Limited Memory GPU

PyTorch implemention of ICCV'21 paper SGPA: Structure-Guided Prior Adaptation for Category-Level 6D Object Pose Estimation