Reinforcement Learning with Q-Learning Algorithm on gym's frozen lake environment implemented in python

Overview

Reinforcement Learning with Q Learning Algorithm

Q learning algorithm is trained on the gym's frozen lake environment.

Libraries Used

  • gym
  • Numpy
  • tqdm
  • Pytorch Deep Learning Framework

  • Install Requirement Files

    clone the repository or download the 'requirement.txt' files, then open terminal in the working directory and type
    'pip install -r requirements.txt'
    to install all the requirements for this project.

    Demo Video

    Q-learning.mp4
    Just-Now - This Is Just Now Login Friendlist Cloner Tools

    JUST NOW LOGIN FRIENDLIST CLONER TOOLS Install $ apt update $ apt upgrade $ apt

    MAHADI HASAN AFRIDI 21 Mar 09, 2022
    noisy labels; missing labels; semi-supervised learning; entropy; uncertainty; robustness and generalisation.

    ProSelfLC: CVPR 2021 ProSelfLC: Progressive Self Label Correction for Training Robust Deep Neural Networks For any specific discussion or potential fu

    amos_xwang 57 Dec 04, 2022
    Dark Finix: All in one hacking framework with almost 100 tools

    Dark Finix - Hacking Framework. Dark Finix is a all in one hacking framework wit

    Md. Nur habib 2 Feb 18, 2022
    Code for PackNet: Adding Multiple Tasks to a Single Network by Iterative Pruning

    PackNet: https://arxiv.org/abs/1711.05769 Pretrained models are available here: https://uofi.box.com/s/zap2p03tnst9dfisad4u0sfupc0y1fxt Datasets in Py

    Arun Mallya 216 Jan 05, 2023
    The Wearables Development Toolkit - a development environment for activity recognition applications with sensor signals

    Wearables Development Toolkit (WDK) The Wearables Development Toolkit (WDK) is a framework and set of tools to facilitate the iterative development of

    Juan Haladjian 114 Nov 27, 2022
    Torch implementation of various types of GAN (e.g. DCGAN, ALI, Context-encoder, DiscoGAN, CycleGAN, EBGAN, LSGAN)

    gans-collection.torch Torch implementation of various types of GANs (e.g. DCGAN, ALI, Context-encoder, DiscoGAN, CycleGAN, EBGAN). Note that EBGAN and

    Minchul Shin 53 Jan 22, 2022
    Mitsuba 2: A Retargetable Forward and Inverse Renderer

    Mitsuba Renderer 2 Documentation Mitsuba 2 is a research-oriented rendering system written in portable C++17. It consists of a small set of core libra

    Mitsuba Physically Based Renderer 2k Jan 07, 2023
    Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.

    Smaller Multilingual Transformers This repository shares smaller versions of multilingual transformers that keep the same representations offered by t

    Geotrend 79 Dec 28, 2022
    TalkingHead-1KH is a talking-head dataset consisting of YouTube videos

    TalkingHead-1KH Dataset TalkingHead-1KH is a talking-head dataset consisting of YouTube videos, originally created as a benchmark for face-vid2vid: On

    173 Dec 29, 2022
    YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone

    YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone In our recent paper we propose the YourTTS model. YourTTS bri

    Edresson Casanova 390 Dec 29, 2022
    "Segmenter: Transformer for Semantic Segmentation" reproduced via mmsegmentation

    Segmenter-based-on-OpenMMLab "Segmenter: Transformer for Semantic Segmentation, arxiv 2105.05633." reproduced via mmsegmentation. We reproduce Segment

    EricKani 22 Feb 24, 2022
    Applying curriculum to meta-learning for few shot classification

    Curriculum Meta-Learning for Few-shot Classification We propose an adaptation of the curriculum training framework, applicable to state-of-the-art met

    Stergiadis Manos 3 Oct 25, 2022
    基于Flask开发后端、VUE开发前端框架,在WEB端部署YOLOv5目标检测模型

    基于Flask开发后端、VUE开发前端框架,在WEB端部署YOLOv5目标检测模型

    37 Jan 01, 2023
    [CVPR 2016] Unsupervised Feature Learning by Image Inpainting using GANs

    Context Encoders: Feature Learning by Inpainting CVPR 2016 [Project Website] [Imagenet Results] Sample results on held-out images: This is the trainin

    Deepak Pathak 829 Dec 31, 2022
    SEJE Pytorch implementation

    SEJE is a prototype for the paper Learning Text-Image Joint Embedding for Efficient Cross-Modal Retrieval with Deep Feature Engineering. Contents Inst

    0 Oct 21, 2021
    Code for "Long Range Probabilistic Forecasting in Time-Series using High Order Statistics"

    Long Range Probabilistic Forecasting in Time-Series using High Order Statistics This is the code produced as part of the paper Long Range Probabilisti

    16 Dec 06, 2022
    A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.

    bbc-speech-segmenter: Voice Activity Detection & Speaker Diarization A complete speech segmentation system using Kaldi and x-vectors for voice activit

    BBC 16 Oct 27, 2022
    Tensor-based approaches for fMRI classification

    tensor-fmri Using tensor-based approaches to classify fMRI data from StarPLUS. Citation If you use any code in this repository, please cite the follow

    4 Sep 07, 2022
    Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

    Decision Transformer Lili Chen*, Kevin Lu*, Aravind Rajeswaran, Kimin Lee, Aditya Grover, Michael Laskin, Pieter Abbeel, Aravind Srinivas†, and Igor M

    Kevin Lu 1.4k Jan 07, 2023
    Kinetics-Data-Preprocessing

    Kinetics-Data-Preprocessing Kinetics-400 and Kinetics-600 are common video recognition datasets used by popular video understanding projects like Slow

    Kaihua Tang 7 Oct 27, 2022