Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1812.02353
Cited By
Top-K Off-Policy Correction for a REINFORCE Recommender System
6 December 2018
Minmin Chen
Alex Beutel
Paul Covington
Sagar Jain
Francois Belletti
Ed H. Chi
CML
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Top-K Off-Policy Correction for a REINFORCE Recommender System"
50 / 187 papers shown
Title
Efficient Distributed Framework for Collaborative Multi-Agent Reinforcement Learning
Shuhan Qi
Shuhao Zhang
Xiaohan Hou
Jia-jia Zhang
Xueliang Wang
Jing Xiao
24
0
0
11 May 2022
PinnerFormer: Sequence Modeling for User Representation at Pinterest
Nikil Pancha
Andrew Zhai
J. Leskovec
Charles R. Rosenberg
AI4TS
24
28
0
09 May 2022
When Should We Prefer Offline Reinforcement Learning Over Behavioral Cloning?
Aviral Kumar
Joey Hong
Anika Singh
Sergey Levine
OffRL
45
76
0
12 Apr 2022
Platform Behavior under Market Shocks: A Simulation Framework and Reinforcement-Learning Based Study
Xintong Wang
Gary Qiurui Ma
Alon Eden
Clara Li
Alexander R. Trott
Stephan Zheng
David C. Parkes
40
8
0
25 Mar 2022
Reinforcement Learning in Practice: Opportunities and Challenges
Yuxi Li
OffRL
36
9
0
23 Feb 2022
Who Are the Best Adopters? User Selection Model for Free Trial Item Promotion
Shiqi Wang
Chongming Gao
Min Gao
Junliang Yu
Zongwei Wang
Hongzhi Yin
26
10
0
19 Feb 2022
Should I send this notification? Optimizing push notifications decision making by modeling the future
Conor O'Brien
Huasen Wu
Shaodan Zhai
Dalin Guo
Wenzhe Shi
Jonathan J. Hunt
19
4
0
17 Feb 2022
Offline Reinforcement Learning for Mobile Notifications
Yiping Yuan
A. Muralidharan
Preetam Nandy
Miao Cheng
Prakruthi Prabhakar
OffRL
33
9
0
04 Feb 2022
Context Uncertainty in Contextual Bandits with Applications to Recommender Systems
Hao Wang
Yifei Ma
Hao Ding
Yuyang Wang
40
6
0
01 Feb 2022
Sequential Search with Off-Policy Reinforcement Learning
Dadong Miao
Yanan Wang
Guoyu Tang
Lin Liu
Sulong Xu
Bo Long
Yun Xiao
Lingfei Wu
Yunjiang Jiang
OffRL
16
3
0
01 Feb 2022
Recency Dropout for Recurrent Recommender Systems
Bo-Yu Chang
Can Xu
Matt Le
Jingchen Feng
Ya Le
Sriraj Badam
Ed H. Chi
Minmin Chen
25
3
0
26 Jan 2022
Edge-Compatible Reinforcement Learning for Recommendations
James E. Kostas
Philip S. Thomas
Georgios Theocharous
OffRL
21
0
0
10 Dec 2021
A Validation Tool for Designing Reinforcement Learning Environments
Ruiyang Xu
Zhengxing Chen
OffRL
16
0
0
10 Dec 2021
Contextual Bandit Applications in Customer Support Bot
Sandra Sajeev
Jade Huang
Nikos Karampatziakis
Matthew Hall
Sebastian Kochman
Weizhu Chen
22
10
0
06 Dec 2021
SelectAugment: Hierarchical Deterministic Sample Selection for Data Augmentation
Shiqi Lin
Zhizheng Zhang
Xin Li
Wenjun Zeng
Zhibo Chen
41
9
0
06 Dec 2021
Supervised Advantage Actor-Critic for Recommender Systems
Xin Xin
Alexandros Karatzoglou
Ioannis Arapakis
J. Jose
OffRL
18
30
0
05 Nov 2021
FINN.no Slates Dataset: A new Sequential Dataset Logging Interactions, allViewed Items and Click Responses/No-Click for Recommender Systems Research
Simen Eide
A. Frigessi
Helge Jenssen
David S. Leslie
Joakim Rishaug
Sofie Verrewaere
11
12
0
05 Nov 2021
Choosing the Best of Both Worlds: Diverse and Novel Recommendations through Multi-Objective Reinforcement Learning
Dusan Stamenkovic
Alexandros Karatzoglou
Ioannis Arapakis
Xin Xin
Kleomenis Katevas
26
46
0
28 Oct 2021
RL4RS: A Real-World Dataset for Reinforcement Learning based Recommender System
Kai Wang
Zhene Zou
Minghao Zhao
Qilin Deng
Yue Shang
Yile Liang
Runze Wu
Xudong Shen
Tangjie Lyu
Changjie Fan
OffRL
23
9
0
18 Oct 2021
Value Penalized Q-Learning for Recommender Systems
Chengqian Gao
Ke Xu
Kuangqi Zhou
Lanqing Li
Xueqian Wang
Bo Yuan
P. Zhao
OffRL
54
20
0
15 Oct 2021
Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters
Vladislav Kurenkov
Sergey Kolesnikov
OffRL
32
24
0
08 Oct 2021
Deep Exploration for Recommendation Systems
Zheqing Zhu
Benjamin Van Roy
32
11
0
26 Sep 2021
Accelerating Offline Reinforcement Learning Application in Real-Time Bidding and Recommendation: Potential Use of Simulation
Haruka Kiyohara
K. Kawakami
Yuta Saito
OffRL
26
12
0
17 Sep 2021
A Survey of Deep Reinforcement Learning in Recommender Systems: A Systematic Review and Future Directions
Xiaocong Chen
L. Yao
Julian McAuley
Guanglin Zhou
Xianzhi Wang
AI4TS
22
61
0
08 Sep 2021
Recommendation Fairness: From Static to Dynamic
De-Fu Zhang
Jun Wang
OffRL
24
15
0
05 Sep 2021
Reinforcement Learning to Optimize Lifetime Value in Cold-Start Recommendation
Luo Ji
Qin Qi
Bingqing Han
Hongxia Yang
OffRL
11
28
0
20 Aug 2021
Conditional Sequential Slate Optimization
Yipeng Zhang
Mingjian Lu
Saratchandra Indrakanti
M. Kannadasan
A. Bagherjeiran
21
0
0
12 Aug 2021
Online Bootstrap Inference For Policy Evaluation in Reinforcement Learning
Pratik Ramprasad
Yuantong Li
Zhuoran Yang
Zhaoran Wang
W. Sun
Guang Cheng
OffRL
50
27
0
08 Aug 2021
Pessimistic Model-based Offline Reinforcement Learning under Partial Coverage
Masatoshi Uehara
Wen Sun
OffRL
98
144
0
13 Jul 2021
Quantifying Availability and Discovery in Recommender Systems via Stochastic Reachability
Mihaela Curmei
Sarah Dean
Benjamin Recht
11
7
0
30 Jun 2021
On component interactions in two-stage recommender systems
Jiri Hron
K. Krauth
Michael I. Jordan
Niki Kilbertus
CML
LRM
40
31
0
28 Jun 2021
Multi-Task Learning for User Engagement and Adoption in Live Video Streaming Events
Stefanos Antaris
Dimitrios Rafailidis
Romina Arriaza
OffRL
12
0
0
18 Jun 2021
Control Variates for Slate Off-Policy Evaluation
N. Vlassis
Ashok Chandrashekar
Fernando Amat Gil
Nathan Kallus
OffRL
20
9
0
15 Jun 2021
Zero-Shot Recommender Systems
Hao Ding
Yifei Ma
Anoop Deoras
Bernie Wang
Hao Wang
VLM
15
89
0
18 May 2021
Generative Actor-Critic: An Off-policy Algorithm Using the Push-forward Model
Lingwei Peng
Hui Qian
Zhebang Shen
Chao Zhang
Fei Li
22
2
0
08 May 2021
Contextual Bandits with Sparse Data in Web setting
Björn Eriksson
9
0
0
06 May 2021
Nearly Horizon-Free Offline Reinforcement Learning
Tongzheng Ren
Jialian Li
Bo Dai
S. Du
Sujay Sanghavi
OffRL
29
49
0
25 Mar 2021
RecSim NG: Toward Principled Uncertainty Modeling for Recommender Ecosystems
Martin Mladenov
Chih-Wei Hsu
Vihan Jain
Eugene Ie
Christopher Colby
Nicolas Mayoraz
H. Pham
Dustin Tran
Ivan Vendrov
Craig Boutilier
BDL
15
31
0
14 Mar 2021
Reducing Conservativeness Oriented Offline Reinforcement Learning
Hongchang Zhang
Jianzhun Shao
Yuhang Jiang
Shuncheng He
Xiangyang Ji
OffRL
32
6
0
27 Feb 2021
Reward Poisoning in Reinforcement Learning: Attacks Against Unknown Learners in Unknown Environments
Amin Rakhsha
Xuezhou Zhang
Xiaojin Zhu
Adish Singla
AAML
OffRL
44
37
0
16 Feb 2021
Deep Reinforcement Learning-Based Product Recommender for Online Advertising
Milad Vaali Esfahaani
Yanbo Xue
P. Setoodeh
OffRL
14
3
0
30 Jan 2021
Advances and Challenges in Conversational Recommender Systems: A Survey
Chongming Gao
Wenqiang Lei
Xiangnan He
Maarten de Rijke
Tat-Seng Chua
138
273
0
23 Jan 2021
Measuring Recommender System Effects with Simulated Users
Sirui Yao
Yoni Halpern
Nithum Thain
Xuezhi Wang
Kang Lee
Flavien Prost
Ed H. Chi
Jilin Chen
Alex Beutel
48
49
0
12 Jan 2021
Offline Meta-level Model-based Reinforcement Learning Approach for Cold-Start Recommendation
Yanan Wang
Yong Ge
Li Li
Rui Chen
Tong Xu
OffRL
14
7
0
04 Dec 2020
Do Offline Metrics Predict Online Performance in Recommender Systems?
K. Krauth
Sarah Dean
Alex Zhao
Wenshuo Guo
Mihaela Curmei
Benjamin Recht
Michael I. Jordan
OffRL
16
40
0
07 Nov 2020
Generative Inverse Deep Reinforcement Learning for Online Recommendation
Xiaocong Chen
Lina Yao
Aixin Sun
Xianzhi Wang
Xiwei Xu
Liming Zhu
OffRL
4
27
0
04 Nov 2020
CoinDICE: Off-Policy Confidence Interval Estimation
Bo Dai
Ofir Nachum
Yinlam Chow
Lihong Li
Csaba Szepesvári
Dale Schuurmans
OffRL
27
84
0
22 Oct 2020
Lambda Learner: Fast Incremental Learning on Data Streams
R. Ramanath
Konstantin Salomatin
Jeffrey Gee
Kirill Talanine
Onkar Dalal
Gungor Polatkan
S. Smoot
Deepak Kumar
11
9
0
11 Oct 2020
Learning from eXtreme Bandit Feedback
Romain Lopez
Inderjit S. Dhillon
Michael I. Jordan
OffRL
28
25
0
27 Sep 2020
div2vec: Diversity-Emphasized Node Embedding
Jisu Jeong
Jeong-Min Yun
Hongi Keam
Young-Jin Park
Zimin Park
Junki Cho
19
5
0
21 Sep 2020
Previous
1
2
3
4
Next