DrQ-v2: Improved Data-Augmented Reinforcement Learning

Last update: Jan 01, 2023

Related tags

Overview

DrQ-v2: Improved Data-Augmented RL Agent

Method

DrQ-v2 is a model-free off-policy algorithm for image-based continuous control. DrQ-v2 builds on DrQ, an actor-critic approach that uses data augmentation to learn directly from pixels. We introduce several improvements including:

Switch the base RL learner from SAC to DDPG.
Incorporate n-step returns to estimate TD error.
Introduce a decaying schedule for exploration noise.
Make implementation 3.5 times faster.
Find better hyper-parameters.

These changes allow us to significantly improve sample efficiency and wall-clock training time on a set of challening tasks from the DeepMind Control Suite compared to prior methods. Furthermore, DrQ-v2 is able to solve complex humanoid locomotion tasks directly from pixel observations, previously unattained by model-free RL.

Citation

If you use this repo in your research, please consider citing the paper as follows:

@article{yarats2021drqv2,
  title={Mastering Visual Continuous Control: Improved Data-Augmented Reinforcement Learning},
  author={Denis Yarats and Rob Fergus and Alessandro Lazaric and Lerrel Pinto},
  journal={arXiv preprint arXiv:},
  year={2021}
}

Instructions

Install dependencies:

conda env create -f conda_env.yml
conda activate drqv2

Train the agent:

python train.py task=quadruped_walk

Monitor results:

tensorboard --logdir exp_local

License

The majority of DrQ-v2 is licensed under the MIT license, however portions of the project are available under separate license terms: DeepMind is licensed under the Apache 2.0 license.

DrQ-v2: Improved Data-Augmented Reinforcement Learning

Related tags

Overview

DrQ-v2: Improved Data-Augmented RL Agent

Method

Citation

Instructions

License

Owner

Facebook Research

An official TensorFlow implementation of “CLCC: Contrastive Learning for Color Constancy” accepted at CVPR 2021.

Pseudo lidar - (CVPR 2019) Pseudo-LiDAR from Visual Depth Estimation: Bridging the Gap in 3D Object Detection for Autonomous Driving

Pytoydl: A toy deep learning framework built upon numpy.

Convex optimization for fun and profit.

Some simple programs built in Python: webcam with cv2 that detects eyes and face, with grayscale filter

[SDM 2022] Towards Similarity-Aware Time-Series Classification

This repository contains Prior-RObust Bayesian Optimization (PROBO) as introduced in our paper "Accounting for Gaussian Process Imprecision in Bayesian Optimization"

Quantized models with python

Code for the paper "Reinforcement Learning as One Big Sequence Modeling Problem"

This repository for project that can Automate Number Plate Recognition (ANPR) in Morocco Licensed Vehicles. 💻 + 🚙 + 🇲🇦 = 🤖 🕵🏻‍♂️

Face Mask Detection is a project to determine whether someone is wearing mask or not, using deep neural network.

Official repository of ICCV21 paper "Viewpoint Invariant Dense Matching for Visual Geolocalization"

FastFace: Lightweight Face Detection Framework

A large-image collection explorer and fast classification tool

StarGAN - Official PyTorch Implementation (CVPR 2018)

Boundary IoU API (Beta version)

Unofficial implementation of "TTNet: Real-time temporal and spatial video analysis of table tennis" (CVPR 2020)

The official re-implementation of the Neurips 2021 paper, "Targeted Neural Dynamical Modeling".

Deep Learning Specialization by Andrew Ng, deeplearning.ai.

[CVPR'22] Official PyTorch Implementation of Collaborative Transformers for Grounded Situation Recognition