Admin Panels
Algorithms
Asset Management
Audio
Authentication
More Categories
Boilerplate Build Tools Caching CMS Code Analysis Code Refactoring Code review tool Command-line Interface Development Command-line Tools Communication Computer Vision Concurrency and Parallelism Configuration Cryptography Data Analysis Data Containers Data Serialization Data Structures Data Validation Data Visualization Database Database Drivers Date & Time Utilities Debugging Tools Deep Learning Deep Learning Model Explanation DevOps Tools Distributed Computing Distribution Django Documentation Downloader E-commerce Editor Plugins Email Environment Management FastAPI Projects FastAPI Utilities Feature Engineering File & Path Utilities Finance Flask Forms Functional Programming Game Development General Utilities Geolocation GPU Utilities GraphQL GUI Development Hardware HTML Manipulation HTTP Clients IDE Image Processing Implementations of Python Internationalization Interpreter Job Scheduler JSON Linters & Style Checkers Logging Machine Learning Markdown/YAML Microsoft Windows Miscellaneous Monitoring Network Virtualization Networking Office Files Processing Organization ORM Package Management Payment Processing PDF Files Processing Performance optimization Pipelines Process Utilities Productivity PyTorch Learning Resources Pytorch Utilities Recommender Systems Reinforcement Learning RESTful API RPC Servers Science SCM Search Security related resources Serialization Serverless Frameworks Sklearn Utilities Specific Formats Processing Static Site Generator Storage Task Queues Template Engine Testing Text Data & NLP Text Processing Third-party APIs Wrappers URL Manipulation Video Web Asset Management Web Content Extracting Web Crawling Web Frameworks WebSocket WSGI Servers
Popular Repo
Latest Repo
Resources
All Article News Book Tutorial

Overview
Comments 1
Releases

Reinforcement Learning Theory Book (rus)

Last update: Nov 27, 2022

Related tags

Deep Learning RL-Theory-book

Overview

Reinforcement Learning Theory Book (rus)

Full book on Arxiv: https://arxiv.org/abs/2201.09746

Ch. 1: Introduction
Ch. 2: Meta-heuristics
- NEAT, WANN
- CEM, OpenAI-ES, CMA-ES
Ch. 3: Classic theory
- Bellman equations
- RPI, policy improv. theorem
- Value Iteration, Generalized Policy Iteration
- Temporal Difference, Q-learning, SARSA
- Eligibility Traces, TD-lambda, Retrace
Ch. 4: Value-based
- DQN
- Double DQN, Dueling DQN, PER, Noisy DQN, Multi-step DQN
- c51, QR-DQN, IQN, Rainbow DQN
Ch. 5: Policy Gradient
- REINFORCE, A2C, GAE
- TRPO, PPO
Ch. 6: Continuous Control
- DDPG, TD3
- SAC
Ch. 7: Model-based
- Bandits
- MCTS, AlphaZero, MuZero
- LQR
Ch. 8: Next Stage
- Imitation Learning / Inverse Reinforcement Learning
- Intrinsic Motivation
- Multi-Task and Hindsight
- Hierarchical RL
- Partial observability
- Multi-Agent RL

Owner

qbrick

qbrick

GitHub Repository

RNG-KBQA: Generation Augmented Iterative Ranking for Knowledge Base Question Answering

RNG-KBQA: Generation Augmented Iterative Ranking for Knowledge Base Question Answering Authors: Xi Ye, Semih Yavuz, Kazuma Hashimoto, Yingbo Zhou and

72 Dec 05, 2022

The software associated with a paper accepted at EMNLP 2021 titled "Open Knowledge Graphs Canonicalization using Variational Autoencoders".

Open-KG-canonicalization The software associated with a paper accepted at EMNLP 2021 titled "Open Knowledge Graphs Canonicalization using Variational

13 Nov 11, 2022

Kinetics-Data-Preprocessing

Kinetics-Data-Preprocessing Kinetics-400 and Kinetics-600 are common video recognition datasets used by popular video understanding projects like Slow

7 Oct 27, 2022

Everything's Talkin': Pareidolia Face Reenactment (CVPR2021)

Everything's Talkin': Pareidolia Face Reenactment (CVPR2021) Linsen Song, Wayne Wu, Chaoyou Fu, Chen Qian, Chen Change Loy, and Ran He [Paper], [Video

71 Dec 21, 2022

minimizer-space de Bruijn graphs (mdBG) for whole genome assembly

rust-mdbg: Minimizer-space de Bruijn graphs (mdBG) for whole-genome assembly rust-mdbg is an ultra-fast minimizer-space de Bruijn graph (mdBG) impleme

148 Dec 01, 2022

Image Segmentation using U-Net, U-Net with skip connections and M-Net architectures

Brain-Image-Segmentation Segmentation of brain tissues in MRI image has a number of applications in diagnosis, surgical planning, and treatment of bra

8 Oct 27, 2022

Open source annotation tool for machine learning practitioners.

doccano doccano is an open source text annotation tool for humans. It provides annotation features for text classification, sequence labeling and sequ

7.1k Jan 01, 2023

Collect super-resolution related papers, data, repositories

Collect super-resolution related papers, data, repositories

1.7k Jan 03, 2023

Official Repo for ICCV2021 Paper: Learning to Regress Bodies from Images using Differentiable Semantic Rendering

[ICCV2021] Learning to Regress Bodies from Images using Differentiable Semantic Rendering Getting Started DSR has been implemented and tested on Ubunt

83 Nov 27, 2022

Camera calibration & 3D pose estimation tools for AcinoSet

AcinoSet: A 3D Pose Estimation Dataset and Baseline Models for Cheetahs in the Wild Daniel Joska, Liam Clark, Naoya Muramatsu, Ricardo Jericevich, Fre

42 Nov 16, 2022

Clustering with variational Bayes and population Monte Carlo

pypmc pypmc is a python package focusing on adaptive importance sampling. It can be used for integration and sampling from a user-defined target densi

45 Feb 06, 2022

Code for CVPR 2018 paper --- Texture Mapping for 3D Reconstruction with RGB-D Sensor

G2LTex This repository contains the implementation of "Texture Mapping for 3D Reconstruction with RGB-D Sensor (CVPR2018)" based on mvs-texturing. Due

129 Dec 30, 2022

PyTorch code for the paper: FeatMatch: Feature-Based Augmentation for Semi-Supervised Learning

FeatMatch: Feature-Based Augmentation for Semi-Supervised Learning This is the PyTorch implementation of our paper: FeatMatch: Feature-Based Augmentat

43 Nov 19, 2022

Auto White-Balance Correction for Mixed-Illuminant Scenes

Auto White-Balance Correction for Mixed-Illuminant Scenes Mahmoud Afifi, Marcus A. Brubaker, and Michael S. Brown York University Video Reference code

47 Nov 26, 2022

A simple Rock-Paper-Scissors game using CV in python

ML18_Rock-Paper-Scissors-using-CV A simple Rock-Paper-Scissors game using CV in python For IITISOC-21 Rules and procedure to play the interactive game

3 Aug 08, 2021

Pytorch implementation of Feature Pyramid Network (FPN) for Object Detection

fpn.pytorch Pytorch implementation of Feature Pyramid Network (FPN) for Object Detection Introduction This project inherits the property of our pytorc

912 Dec 21, 2022

Time-stretch audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.

Time-stretch audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.

22 Jul 07, 2022

Text Summarization - WCN — Weighted Contextual N-gram method for evaluation of Text Summarization

Text Summarization WCN — Weighted Contextual N-gram method for evaluation of Text Summarization In this project, I fine tune T5 model on Extreme Summa

1 Jan 03, 2022

PyTorch implementation of the paper: Label Noise Transition Matrix Estimation for Tasks with Lower-Quality Features

Label Noise Transition Matrix Estimation for Tasks with Lower-Quality Features Estimate the noise transition matrix with f-mutual information. This co

[email protected]"> 1 Jun 05, 2022

Source code of AAAI 2022 paper "Towards End-to-End Image Compression and Analysis with Transformers".

Towards End-to-End Image Compression and Analysis with Transformers Source code of our AAAI 2022 paper "Towards End-to-End Image Compression and Analy

37 Dec 21, 2022

2022.PythonRepo

About
Contact Us
DMCA
Disclaimer
Privacy Policy