A TensorFlow implementation of SOFA, the Simulator for OFfline LeArning and evaluation.

Last update: Nov 23, 2022

Overview

SOFA

This repository is the implementation of SOFA, the Simulator for OFfline leArning and evaluation.

Keeping Dataset Biases out of the Simulation: A Debiased Simulator for Reinforcement Learning based Recommender Systems. Jin Huang, Harrie Oosterhuis, Maarten de Rijke, Herke van Hoof. Recsys 2020.

The framework shows how RL4Rec typically interacts with a simulation-based environment. A state is user historical interactions, an action is an item being recommended bytheRS, and a reward is related to user feedback.

As a solution to the effect of bias present in logged data, we introduce a debiasing step in the simulation pipeline, which corrects for the biases present in the logged data before it is used to simulate user behavior.

Running the code

$ cd examples
$ python run_dqn.py

More details

We provide the details of DQN-based Policy used in experiments and the related hyperparamters (See Appendix). And we also provide the slide used for presentation in recsys 2020.

Cite

If you use our code, please cite our paper:

@inproceedings{huang2020keeping,
  title={Keeping Dataset Biases out of the Simulation: A Debiased Simulator for Reinforcement Learning based Recommender Systems},
  author={Huang, Jin and Oosterhuis, Harrie and de Rijke, Maarten and van Hoof, Herke},
  booktitle={Fourteenth ACM Conference on Recommender Systems},
  pages={190--199},
  year={2020}
}

A TensorFlow implementation of SOFA, the Simulator for OFfline LeArning and evaluation.

Related tags

Overview

SOFA

Running the code

More details

Cite

Owner

商品推荐系统

Direct design of biquad filter cascades with deep learning by sampling random polynomials.

Official Tensorflow implementation of "M-LSD: Towards Light-weight and Real-time Line Segment Detection"

Code for reproducing experiments in "Improved Training of Wasserstein GANs"

To Design and Implement Logistic Regression to Classify Between Benign and Malignant Cancer Types

Official Implementation of "Transformers Can Do Bayesian Inference"

Nested Graph Neural Network (NGNN) is a general framework to improve a base GNN's expressive power and performance

Instance-wise Occlusion and Depth Orders in Natural Scenes (CVPR 2022)

Detect roadway lanes using Python OpenCV for project during the 5th semester at DHBW Stuttgart for lecture in digital image processing.

The source codes for ACL 2021 paper 'BoB: BERT Over BERT for Training Persona-based Dialogue Models from Limited Personalized Data'

No-Reference Image Quality Assessment via Transformers, Relative Ranking, and Self-Consistency

Streaming over lightweight data transformations

Paddle implementation for "Highly Efficient Knowledge Graph Embedding Learning with Closed-Form Orthogonal Procrustes Analysis" (NAACL 2021)

Wenet STT Python

TensorFlow Tutorial and Examples for Beginners (support TF v1 & v2)

Code for CVPR 2018 paper --- Texture Mapping for 3D Reconstruction with RGB-D Sensor

Official pytorch code for "APP: Anytime Progressive Pruning"

Self-Supervised Learning with Data Augmentations Provably Isolates Content from Style

A curated list of awesome neural radiance fields papers

An architecture that makes any doodle realistic, in any specified style, using VQGAN, CLIP and some basic embedding arithmetics.