PyTorch implementation of Off-policy Learning in Two-stage Recommender Systems

Jiaqi Ma

Last update: Dec 12, 2022

Related tags

Deep Learning Off-Policy-2-Stage

Overview

Off-Policy-2-Stage

This repo provides a PyTorch implementation of the MovieLens experiments for the following paper:

Off-policy Learning in Two-stage Recommender Systems

Jiaqi Ma, Zhe Zhao, Xinyang Yi, Ji Yang, Minmin Chen, Jiaxi Tang, Lichan Hong, Ed H. Chi. TheWebConf (WWW) 2020.

Requirements

See environment.yml. Run conda op2s_env create -f environment.yml to install the required packages.

Run the code

Example: python run.py --loss_type loss_2s.

The "Cross-Entropy", "1-IPS", and "2-IPS" objectives respectively correspond to "loss_ce", "loss_ips", and "loss_2s" in the code.

The MovieLens-1M dataset can be found on the GroupLens website.

Cite

@inproceedings{ma2020off,
  title={Off-policy Learning in Two-stage Recommender Systems},
  author={Ma, Jiaqi and Zhao, Zhe and Yi, Xinyang and Yang, Ji and Chen, Minmin and Tang, Jiaxi and Hong, Lichan and Chi, Ed H},
  booktitle={Proceedings of The Web Conference 2020},
  pages={463--473},
  year={2020}
}

You might also like...

A library for preparing, training, and evaluating scalable deep learning hybrid recommender systems using PyTorch.

collie Collie is a library for preparing, training, and evaluating implicit deep learning hybrid recommender systems, named after the Border Collie do

96 Dec 29, 2022

NVIDIA Merlin is an open source library providing end-to-end GPU-accelerated recommender systems, from feature engineering and preprocessing to training deep learning models and running inference in production.

NVIDIA Merlin NVIDIA Merlin is an open source library designed to accelerate recommender systems on NVIDIA’s GPUs. It enables data scientists, machine

419 Jan 3, 2023

Two-stage CenterNet

Probabilistic two-stage detection Two-stage object detectors that use class-agnostic one-stage detectors as the proposal network. Probabilistic two-st

1.1k Jan 3, 2023

Two-Stage Peer-Regularized Feature Recombination for Arbitrary Image Style Transfer

Two-Stage Peer-Regularized Feature Recombination for Arbitrary Image Style Transfer Paper on arXiv Public PyTorch implementation of two-stage peer-reg

38 Oct 14, 2022

dualFace: Two-Stage Drawing Guidance for Freehand Portrait Sketching (CVMJ)

dualFace dualFace: Two-Stage Drawing Guidance for Freehand Portrait Sketching (CVMJ) We provide python implementations for our CVM 2021 paper "dualFac

46 Nov 10, 2022

A two-stage U-Net for high-fidelity denoising of historical recordings

A two-stage U-Net for high-fidelity denoising of historical recordings Official repository of the paper (not submitted yet): E. Moliner and V. Välimäk

57 Jan 5, 2023

The trained model and denoising example for paper : Cardiopulmonary Auscultation Enhancement with a Two-Stage Noise Cancellation Approach

1 Jan 18, 2022

The official codes of our CVPR2022 paper: A Differentiable Two-stage Alignment Scheme for Burst Image Reconstruction with Large Shift

TwoStageAlign The official codes of our CVPR2022 paper: A Differentiable Two-stage Alignment Scheme for Burst Image Reconstruction with Large Shift Pa

32 Dec 15, 2022

A Comparative Framework for Multimodal Recommender Systems

Cornac Cornac is a comparative framework for multimodal recommender systems. It focuses on making it convenient to work with models leveraging auxilia

671 Jan 3, 2023

Comments

Replication of the experiments
I'm trying to replicate the results obtained in the paper using the code in the repository and I have some questions about:

How can I set the sample size, c1 and c2 parameters?

Is the default dataset split the same used for the experiments in the paper? Or How can I set it in the same way?

How can I include the Wiki10 dataset?

Is the seed used in your experiments the same reported in the code? (i.e. 0)
opened by CavenaghiEmanuele 0

Owner

Jiaqi Ma

GitHub

Code for our NeurIPS 2021 paper Mining the Benefits of Two-stage and One-stage HOI Detection

CDN Code for our NeurIPS 2021 paper "Mining the Benefits of Two-stage and One-stage HOI Detection". Contributed by Aixi Zhang*, Yue Liao*, Si Liu, Mia

71 Dec 14, 2022

Code for Mining the Benefits of Two-stage and One-stage HOI Detection

Status: Archive (code is provided as-is, no updates expected) PPO-EWMA [Paper] This is code for training agents using PPO-EWMA and PPG-EWMA, introduce

33 Dec 15, 2022

Pytorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.

Off-Policy Multi-Agent Reinforcement Learning (MARL) Algorithms This repository contains implementations of various off-policy multi-agent reinforceme

183 Dec 28, 2022

PyTorch implementation of Off-policy Learning in Two-stage Recommender Systems

Related tags

Overview

Off-Policy-2-Stage

Requirements

Run the code

Cite

You might also like...

A library for preparing, training, and evaluating scalable deep learning hybrid recommender systems using PyTorch.

NVIDIA Merlin is an open source library providing end-to-end GPU-accelerated recommender systems, from feature engineering and preprocessing to training deep learning models and running inference in production.

Two-stage CenterNet

Two-Stage Peer-Regularized Feature Recombination for Arbitrary Image Style Transfer

dualFace: Two-Stage Drawing Guidance for Freehand Portrait Sketching (CVMJ)

A two-stage U-Net for high-fidelity denoising of historical recordings

The trained model and denoising example for paper : Cardiopulmonary Auscultation Enhancement with a Two-Stage Noise Cancellation Approach

The official codes of our CVPR2022 paper: A Differentiable Two-stage Alignment Scheme for Burst Image Reconstruction with Large Shift

A Comparative Framework for Multimodal Recommender Systems

Comments

Replication of the experiments

Owner

Jiaqi Ma

Code for our NeurIPS 2021 paper Mining the Benefits of Two-stage and One-stage HOI Detection

Code for Mining the Benefits of Two-stage and One-stage HOI Detection

Pytorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.

Implementation of Advantage-Weighted Regression: Simple and Scalable Off-Policy Reinforcement Learning

DRLib：A concise deep reinforcement learning library, integrating HER and PER for almost off policy RL algos.

PyTorch implementation of "A Two-Stage End-to-End System for Speech-in-Noise Hearing Aid Processing"

Deep Reinforcement Learning by using an on-policy adaptation of Maximum a Posteriori Policy Optimization (MPO)

Virtual Dance Reality Stage: a feature that offers you to share a stage with another user virtually

An efficient PyTorch implementation of the evaluation metrics in recommender systems.

A library for preparing, training, and evaluating scalable deep learning hybrid recommender systems using PyTorch.