Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1812.02353
Cited By
Top-K Off-Policy Correction for a REINFORCE Recommender System
6 December 2018
Minmin Chen
Alex Beutel
Paul Covington
Sagar Jain
Francois Belletti
Ed H. Chi
CML
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Top-K Off-Policy Correction for a REINFORCE Recommender System"
37 / 187 papers shown
Title
Exploration in two-stage recommender systems
Jiri Hron
K. Krauth
Michael I. Jordan
Niki Kilbertus
11
10
0
01 Sep 2020
How to "Improve" Prediction Using Behavior Modification
Galit Shmueli
A. Tafti
OffRL
AI4TS
6
8
0
26 Aug 2020
Zero-Shot Heterogeneous Transfer Learning from Recommender Systems to Cold-Start Search Retrieval
Tao Wu
E. Chio
Heng-Tze Cheng
Yu Du
Steffen Rendle
...
Ankit Kumar
Xiang Ma
A. Soares
Nitin Jindal
Pei Cao
DML
VLM
16
23
0
07 Aug 2020
Deep Bayesian Bandits: Exploring in Online Personalized Recommendations
Dalin Guo
S. Ktena
Ferenc Huszár
Pranay K. Myana
Wenzhe Shi
Alykhan Tejani
OffRL
33
39
0
03 Aug 2020
Optimizing Long-term Social Welfare in Recommender Systems: A Constrained Matching Approach
Martin Mladenov
Elliot Creager
Omer Ben-Porat
Kevin Swersky
R. Zemel
Craig Boutilier
16
63
0
31 Jul 2020
Counterfactual Evaluation of Slate Recommendations with Sequential Reward Interactions
James McInerney
B. Brost
Praveen Chandar
Rishabh Mehrotra
Ben Carterette
BDL
CML
OffRL
118
55
0
25 Jul 2020
Optimizing Interactive Systems via Data-Driven Objectives
Ziming Li
Julia Kiseleva
A. Grotov
Maarten de Rijke
Harrie Oosterhuis
OffRL
19
3
0
19 Jun 2020
Non-Stationary Off-Policy Optimization
Joey Hong
B. Kveton
Manzil Zaheer
Yinlam Chow
Amr Ahmed
OffRL
22
0
0
15 Jun 2020
Self-Supervised Reinforcement Learning for Recommender Systems
Xin Xin
Alexandros Karatzoglou
Ioannis Arapakis
J. Jose
SSL
OffRL
29
198
0
10 Jun 2020
Generator and Critic: A Deep Reinforcement Learning Approach for Slate Re-ranking in E-commerce
Jianxiong Wei
Anxiang Zeng
Yueqiu Wu
P. Guo
Q. Hua
Qingpeng Cai
OffRL
19
9
0
25 May 2020
Contrastive Learning for Debiased Candidate Generation in Large-Scale Recommender Systems
Chang Zhou
Jianxin Ma
Jianwei Zhang
Jingren Zhou
Hongxia Yang
31
142
0
20 May 2020
MOReL : Model-Based Offline Reinforcement Learning
Rahul Kidambi
Aravind Rajeswaran
Praneeth Netrapalli
Thorsten Joachims
OffRL
23
654
0
12 May 2020
Adaptive Reward-Poisoning Attacks against Reinforcement Learning
Xuezhou Zhang
Yuzhe Ma
Adish Singla
Xiaojin Zhu
AAML
29
124
0
27 Mar 2020
AliExpress Learning-To-Rank: Maximizing Online Model Performance without Going Online
Guangda Huzhang
Zhen-Jia Pang
Yongqing Gao
Yawen Liu
Weijie Shen
...
Qing Da
Anxiang Zeng
Han Yu
Yang Yu
Zhi-Hua Zhou
14
4
0
25 Mar 2020
Heterogeneous Relational Reasoning in Knowledge Graphs with Reinforcement Learning
Mandana Saebi
Steven J. Krieg
Chuxu Zhang
Meng Jiang
Nitesh V. Chawla
27
13
0
12 Mar 2020
Off-Policy Evaluation and Learning for External Validity under a Covariate Shift
Masahiro Kato
Masatoshi Uehara
Shota Yasui
OffRL
27
52
0
26 Feb 2020
Statistically Efficient Off-Policy Policy Gradients
Nathan Kallus
Masatoshi Uehara
OffRL
14
37
0
10 Feb 2020
Developing Multi-Task Recommendations with Long-Term Rewards via Policy Distilled Reinforcement Learning
Xi Liu
Li Li
Ping-Chun Hsieh
Muhe Xie
Yong Ge
Rui Chen
OffRL
23
3
0
27 Jan 2020
Crowdfunding Dynamics Tracking: A Reinforcement Learning Approach
Jun Wang
Haifeng Zhang
Qi Liu
Zhen Pan
Hanqing Tao
11
6
0
27 Dec 2019
ORL: Reinforcement Learning Benchmarks for Online Stochastic Optimization Problems
Bharathan Balaji
Jordan Bell-Masterson
Enes Bilgin
Andreas C. Damianou
Pablo Moreno Garcia
Arpit Jain
Runfei Luo
Alvaro Maggiar
Balakrishnan Narayanaswamy
Chun Jimmie Ye
OffRL
19
32
0
24 Nov 2019
Model-Based Reinforcement Learning with Adversarial Training for Online Recommendation
Xueying Bai
Jian Guan
Hongning Wang
OffRL
6
74
0
10 Nov 2019
MBCAL: Sample Efficient and Variance Reduced Reinforcement Learning for Recommender Systems
Fan Wang
Xiaomin Fang
Lihang Liu
Hao Tian
Zhiming Peng
OffRL
28
0
0
06 Nov 2019
Learning to Recommend from Sparse Data via Generative User Feedback
Wenlin Wang
Hongteng Xu
Ruiyi Zhang
Wenqi Wang
Piyush Rai
Lawrence Carin
13
0
0
21 Oct 2019
A Re-classification of Information Seeking Tasks and Their Computational Solutions
Zhiwen Tang
Grace Hui Yang
20
6
0
26 Sep 2019
RecSim: A Configurable Simulation Platform for Recommender Systems
Eugene Ie
Chih-Wei Hsu
Martin Mladenov
Vihan Jain
Sanmit Narvekar
Jing Wang
Rui Wu
Craig Boutilier
30
177
0
11 Sep 2019
DEAR: Deep Reinforcement Learning for Online Advertising Impression in Recommender Systems
Xiangyu Zhao
Changsheng Gu
Haoshenglun Zhang
Xiwang Yang
Xiaobing Liu
Jiliang Tang
Hui Liu
OffRL
11
98
0
09 Sep 2019
Unbiased Recommender Learning from Missing-Not-At-Random Implicit Feedback
Yuta Saito
Suguru Yaginuma
Yuta Nishino
Hayato Sakata
Kazuhide Nakata
CML
11
262
0
09 Sep 2019
Estimating Attention Flow in Online Video Networks
Siqi Wu
Marian-Andrei Rizoiu
Lexing Xie
GNN
22
26
0
20 Aug 2019
On the Value of Bandit Feedback for Offline Recommender System Evaluation
Olivier Jeunen
D. Rohde
Flavian Vasile
OffRL
6
10
0
26 Jul 2019
Addressing Delayed Feedback for Continuous Training with Neural Networks in CTR prediction
S. Ktena
Alykhan Tejani
Lucas Theis
Pranay K. Myana
D. Dilipkumar
Ferenc Huszár
Steven Yoo
Wenzhe Shi
NoLa
16
52
0
15 Jul 2019
Toward Simulating Environments in Reinforcement Learning Based Recommendations
Xiangyu Zhao
Long Xia
Zhuoye Ding
Dawei Yin
Jiliang Tang
30
25
0
27 Jun 2019
Reinforcement Learning for Slate-based Recommender Systems: A Tractable Decomposition and Practical Methodology
Eugene Ie
Vihan Jain
Jing Wang
Sanmit Narvekar
Ritesh Agarwal
...
Vince Gatto
Paul Covington
Jim McFadden
Tushar Chandra
Craig Boutilier
OffRL
18
69
0
29 May 2019
Lessons from Contextual Bandit Learning in a Customer Support Bot
Nikos Karampatziakis
Sebastian Kochman
Jade Huang
Paul Mineiro
Kathy Osborne
Weizhu Chen
10
6
0
06 May 2019
Fairness in Recommendation Ranking through Pairwise Comparisons
Alex Beutel
Jilin Chen
Tulsee Doshi
Hai Qian
Li Wei
...
Lukasz Heldt
Zhe Zhao
Lichan Hong
Ed H. Chi
Cristos Goodrow
FaML
31
373
0
02 Mar 2019
Towards Neural Mixture Recommender for Long Range Dependent User Sequences
Jiaxi Tang
Francois Belletti
Sagar Jain
Minmin Chen
Alex Beutel
Can Xu
Ed H. Chi
24
90
0
22 Feb 2019
Horizon: Facebook's Open Source Applied Reinforcement Learning Platform
J. Gauci
Edoardo Conti
Yitao Liang
Kittipat Virochsiri
Yuchen He
Zachary Kaden
Vivek Narayanan
Xiaohui Ye
Zhengxing Chen
Scott Fujimoto
22
139
0
01 Nov 2018
Seq2Slate: Re-ranking and Slate Optimization with RNNs
Irwan Bello
Sayali Kulkarni
Sagar Jain
Craig Boutilier
Ed H. Chi
Elad Eban
Xiyang Luo
Alan Mackey
Ofer Meshi
30
91
0
04 Oct 2018
Previous
1
2
3
4