PyTorch implementation of the ideas presented in the paper Interaction Grounded Learning (IGL)

Last update: Aug 31, 2022

Overview

Interaction Grounded Learning

This repository contains a simple PyTorch implementation of the ideas presented in the paper Interaction Grounded Learning (IGL) from Xie et al., 2021. This repository is also accompanied by a short blog post I wrote on the topic, which is available here.

In IGL, rather than being provided with a reward signal from the environment, a feedback signal is provided instead which corresponds in some way to the true latent reward. The task is to learn both a policy for optimizing against the true reward, as well as a decoder for learning a proxy reward from the feedback signal.

My implementation differs slightly from that of the original paper, but converges consistently on the MNIST digit identification task, and is robust to hyperparameters and initialization seeds. Performance of IGL method is comparable to that of contextual bandit with access to ground truth reward.

The code can be found in the Jupyter notebook here.

Requirements

Python 3
PyTorch
TorchVision
PyPlot
Jupyter-Lab

PyTorch implementation of the ideas presented in the paper Interaction Grounded Learning (IGL)

Related tags

Overview

Interaction Grounded Learning

Requirements

Owner

Arthur Juliani

Pmapper is a super-resolution and deconvolution toolkit for python 3.6+

ECLARE: Extreme Classification with Label Graph Correlations

Source code for "UniRE: A Unified Label Space for Entity Relation Extraction.", ACL2021.

Pytorch implementation of

Music Source Separation; Train & Eval & Inference piplines and pretrained models we used for 2021 ISMIR MDX Challenge.

transfer attack; adversarial examples; black-box attack; unrestricted Adversarial Attacks on ImageNet; CVPR2021 天池黑盒竞赛

Source code for Acorn, the precision farming rover by Twisted Fields

This repo contains code to reproduce all experiments in Equivariant Neural Rendering

Audio-Visual Generalized Few-Shot Learning with Prototype-Based Co-Adaptation

GeneDisco is a benchmark suite for evaluating active learning algorithms for experimental design in drug discovery.

MQBench: Towards Reproducible and Deployable Model Quantization Benchmark

Makes patches from huge resolution .svs slide files using openslide

Implementation of self-attention mechanisms for general purpose. Focused on computer vision modules. Ongoing repository.

Seach Losses of our paper 'Loss Function Discovery for Object Detection via Convergence-Simulation Driven Search', accepted by ICLR 2021.

SPTAG: A library for fast approximate nearest neighbor search

TreeSubstitutionCipher - Encryption system based on trees and substitution

Code for the paper "Can Active Learning Preemptively Mitigate Fairness Issues?" presented at RAI 2021.

Tensorflow implementation of Swin Transformer model.

ArcaneGAN by Alex Spirin

Development of IP code based on VIPs and AADM