(SIGIR2020) “Asymmetric Tri-training for Debiasing Missing-Not-At-Random Explicit Feedback’’

Last update: Dec 01, 2022

Overview

Asymmetric Tri-training for Debiasing Missing-Not-At-Random Explicit Feedback

About

This repository accompanies the real-world experiments conducted in the paper "Asymmetric Tri-training for Debiasing Missing-Not-At-Random Explicit Feedback" by Yuta Saito, which has been accepted at SIGIR2020 as a full paper.

If you find this code useful in your research then please cite:

@inproceedings{saito2020asymmetric,
  title={Asymmetric tri-training for debiasing missing-not-at-random explicit feedback},
  author={Saito, Yuta},
  booktitle={Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval},
  year={2020}
}

Dependencies

numpy==1.17.2
pandas==0.25.1
scikit-learn==0.22.1
tensorflow==1.15.2
optuna==0.17.0
pyyaml==5.1.2

Running the code

To run the simulation with real-world datasets,

download the Coat dataset from https://www.cs.cornell.edu/~schnabts/mnar/ and put train.ascii and test.ascii files into ./data/coat/ directory.
download the Yahoo! R3 dataset from https://webscope.sandbox.yahoo.com/catalog.php?datatype=r and put train.txt and test.txt files into ./data/yahoo/ directory.

Then, run the following commands in the ./src/ directory:

for the MF-IPS models without asymmetric tri-training

for data in yahoo coat
do
  for model in uniform user item both nb nb_true
  do
    python main.py -d $data -m $model
  done
done

for the MF-IPS models with asymmetric tri-training (our proposal)

for data in coat yahoo
do
  for model in uniform-at user-at item-at both-at nb-at nb_true-at
  do
    python main.py -d $data -m $model
  done
done

where (uniform, user, item, both, nb, nb_true) correspond to (uniform propenisty, user propensity, item propensity, user-item propensity, NB (uniform), NB (true)), respectively.

These commands will run simulations with real-world datasets conducted in Section 5. The tuned hyperparameters for all models can be found in ./hyper_params.yaml.
(By adding the -t option to the above code, you can re-run the hyperparameter tuning procedure by Optuna.)

Once the simulations have finished running, the summarized results can be obtained by running the following command in the ./src/ directory:

python summarize_results -d coat yahoo

This creates ./paper_results/.

(SIGIR2020) “Asymmetric Tri-training for Debiasing Missing-Not-At-Random Explicit Feedback’’

Related tags

Overview

Asymmetric Tri-training for Debiasing Missing-Not-At-Random Explicit Feedback

About

Dependencies

Running the code

Owner

yuta-saito

HeatNet is a python package that provides tools to build, train and evaluate neural networks designed to predict extreme heat wave events globally on daily to subseasonal timescales.

COVID-Net Open Source Initiative

Pytorch implementation for "Distribution-Balanced Loss for Multi-Label Classification in Long-Tailed Datasets" (ECCV 2020 Spotlight)

Development Kit for the SoccerNet Challenge

A pytorch implementation of faster RCNN detection framework (Use detectron2, it's a masterpiece)

Official pytorch implementation for Learning to Listen: Modeling Non-Deterministic Dyadic Facial Motion (CVPR 2022)

Aquarius - Enabling Fast, Scalable, Data-Driven Virtual Network Functions

Tools for computational pathology

PyTorch implementation for Stochastic Fine-grained Labeling of Multi-state Sign Glosses for Continuous Sign Language Recognition.

A clean and robust Pytorch implementation of PPO on continuous action space.

Multi-agent reinforcement learning algorithm and environment

Code for the paper "Spatio-temporal Self-Supervised Representation Learning for 3D Point Clouds" (ICCV 2021)

Notepy is a full-featured Notepad Python app

This repository is to support contributions for tools for the Project CodeNet dataset hosted in DAX

Analysing poker data from home games with friends

This repository contains the implementation of the HealthGen model, a generative model to synthesize realistic EHR time series data with missingness

This is the official code release for the paper Shape and Material Capture at Home

Inteligência artificial criada para realizar interação social com idosos.

a short visualisation script for pyvideo data

EZ graph is an easy to use AI solution that allows you to make and train your neural networks without a single line of code.