ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1812.02353
  4. Cited By
Top-K Off-Policy Correction for a REINFORCE Recommender System

Top-K Off-Policy Correction for a REINFORCE Recommender System

6 December 2018
Minmin Chen
Alex Beutel
Paul Covington
Sagar Jain
Francois Belletti
Ed H. Chi
    CML
    OffRL
ArXivPDFHTML

Papers citing "Top-K Off-Policy Correction for a REINFORCE Recommender System"

50 / 187 papers shown
Title
Efficient Distributed Framework for Collaborative Multi-Agent
  Reinforcement Learning
Efficient Distributed Framework for Collaborative Multi-Agent Reinforcement Learning
Shuhan Qi
Shuhao Zhang
Xiaohan Hou
Jia-jia Zhang
Xueliang Wang
Jing Xiao
24
0
0
11 May 2022
PinnerFormer: Sequence Modeling for User Representation at Pinterest
PinnerFormer: Sequence Modeling for User Representation at Pinterest
Nikil Pancha
Andrew Zhai
J. Leskovec
Charles R. Rosenberg
AI4TS
24
28
0
09 May 2022
When Should We Prefer Offline Reinforcement Learning Over Behavioral
  Cloning?
When Should We Prefer Offline Reinforcement Learning Over Behavioral Cloning?
Aviral Kumar
Joey Hong
Anika Singh
Sergey Levine
OffRL
45
76
0
12 Apr 2022
Platform Behavior under Market Shocks: A Simulation Framework and
  Reinforcement-Learning Based Study
Platform Behavior under Market Shocks: A Simulation Framework and Reinforcement-Learning Based Study
Xintong Wang
Gary Qiurui Ma
Alon Eden
Clara Li
Alexander R. Trott
Stephan Zheng
David C. Parkes
40
8
0
25 Mar 2022
Reinforcement Learning in Practice: Opportunities and Challenges
Reinforcement Learning in Practice: Opportunities and Challenges
Yuxi Li
OffRL
36
9
0
23 Feb 2022
Who Are the Best Adopters? User Selection Model for Free Trial Item
  Promotion
Who Are the Best Adopters? User Selection Model for Free Trial Item Promotion
Shiqi Wang
Chongming Gao
Min Gao
Junliang Yu
Zongwei Wang
Hongzhi Yin
26
10
0
19 Feb 2022
Should I send this notification? Optimizing push notifications decision
  making by modeling the future
Should I send this notification? Optimizing push notifications decision making by modeling the future
Conor O'Brien
Huasen Wu
Shaodan Zhai
Dalin Guo
Wenzhe Shi
Jonathan J. Hunt
19
4
0
17 Feb 2022
Offline Reinforcement Learning for Mobile Notifications
Offline Reinforcement Learning for Mobile Notifications
Yiping Yuan
A. Muralidharan
Preetam Nandy
Miao Cheng
Prakruthi Prabhakar
OffRL
33
9
0
04 Feb 2022
Context Uncertainty in Contextual Bandits with Applications to
  Recommender Systems
Context Uncertainty in Contextual Bandits with Applications to Recommender Systems
Hao Wang
Yifei Ma
Hao Ding
Yuyang Wang
40
6
0
01 Feb 2022
Sequential Search with Off-Policy Reinforcement Learning
Sequential Search with Off-Policy Reinforcement Learning
Dadong Miao
Yanan Wang
Guoyu Tang
Lin Liu
Sulong Xu
Bo Long
Yun Xiao
Lingfei Wu
Yunjiang Jiang
OffRL
16
3
0
01 Feb 2022
Recency Dropout for Recurrent Recommender Systems
Recency Dropout for Recurrent Recommender Systems
Bo-Yu Chang
Can Xu
Matt Le
Jingchen Feng
Ya Le
Sriraj Badam
Ed H. Chi
Minmin Chen
25
3
0
26 Jan 2022
Edge-Compatible Reinforcement Learning for Recommendations
Edge-Compatible Reinforcement Learning for Recommendations
James E. Kostas
Philip S. Thomas
Georgios Theocharous
OffRL
21
0
0
10 Dec 2021
A Validation Tool for Designing Reinforcement Learning Environments
A Validation Tool for Designing Reinforcement Learning Environments
Ruiyang Xu
Zhengxing Chen
OffRL
16
0
0
10 Dec 2021
Contextual Bandit Applications in Customer Support Bot
Contextual Bandit Applications in Customer Support Bot
Sandra Sajeev
Jade Huang
Nikos Karampatziakis
Matthew Hall
Sebastian Kochman
Weizhu Chen
22
10
0
06 Dec 2021
SelectAugment: Hierarchical Deterministic Sample Selection for Data
  Augmentation
SelectAugment: Hierarchical Deterministic Sample Selection for Data Augmentation
Shiqi Lin
Zhizheng Zhang
Xin Li
Wenjun Zeng
Zhibo Chen
41
9
0
06 Dec 2021
Supervised Advantage Actor-Critic for Recommender Systems
Supervised Advantage Actor-Critic for Recommender Systems
Xin Xin
Alexandros Karatzoglou
Ioannis Arapakis
J. Jose
OffRL
18
30
0
05 Nov 2021
FINN.no Slates Dataset: A new Sequential Dataset Logging Interactions,
  allViewed Items and Click Responses/No-Click for Recommender Systems Research
FINN.no Slates Dataset: A new Sequential Dataset Logging Interactions, allViewed Items and Click Responses/No-Click for Recommender Systems Research
Simen Eide
A. Frigessi
Helge Jenssen
David S. Leslie
Joakim Rishaug
Sofie Verrewaere
11
12
0
05 Nov 2021
Choosing the Best of Both Worlds: Diverse and Novel Recommendations
  through Multi-Objective Reinforcement Learning
Choosing the Best of Both Worlds: Diverse and Novel Recommendations through Multi-Objective Reinforcement Learning
Dusan Stamenkovic
Alexandros Karatzoglou
Ioannis Arapakis
Xin Xin
Kleomenis Katevas
26
46
0
28 Oct 2021
RL4RS: A Real-World Dataset for Reinforcement Learning based Recommender
  System
RL4RS: A Real-World Dataset for Reinforcement Learning based Recommender System
Kai Wang
Zhene Zou
Minghao Zhao
Qilin Deng
Yue Shang
Yile Liang
Runze Wu
Xudong Shen
Tangjie Lyu
Changjie Fan
OffRL
23
9
0
18 Oct 2021
Value Penalized Q-Learning for Recommender Systems
Value Penalized Q-Learning for Recommender Systems
Chengqian Gao
Ke Xu
Kuangqi Zhou
Lanqing Li
Xueqian Wang
Bo Yuan
P. Zhao
OffRL
54
20
0
15 Oct 2021
Showing Your Offline Reinforcement Learning Work: Online Evaluation
  Budget Matters
Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters
Vladislav Kurenkov
Sergey Kolesnikov
OffRL
32
24
0
08 Oct 2021
Deep Exploration for Recommendation Systems
Deep Exploration for Recommendation Systems
Zheqing Zhu
Benjamin Van Roy
32
11
0
26 Sep 2021
Accelerating Offline Reinforcement Learning Application in Real-Time
  Bidding and Recommendation: Potential Use of Simulation
Accelerating Offline Reinforcement Learning Application in Real-Time Bidding and Recommendation: Potential Use of Simulation
Haruka Kiyohara
K. Kawakami
Yuta Saito
OffRL
26
12
0
17 Sep 2021
A Survey of Deep Reinforcement Learning in Recommender Systems: A
  Systematic Review and Future Directions
A Survey of Deep Reinforcement Learning in Recommender Systems: A Systematic Review and Future Directions
Xiaocong Chen
L. Yao
Julian McAuley
Guanglin Zhou
Xianzhi Wang
AI4TS
22
61
0
08 Sep 2021
Recommendation Fairness: From Static to Dynamic
Recommendation Fairness: From Static to Dynamic
De-Fu Zhang
Jun Wang
OffRL
24
15
0
05 Sep 2021
Reinforcement Learning to Optimize Lifetime Value in Cold-Start
  Recommendation
Reinforcement Learning to Optimize Lifetime Value in Cold-Start Recommendation
Luo Ji
Qin Qi
Bingqing Han
Hongxia Yang
OffRL
11
28
0
20 Aug 2021
Conditional Sequential Slate Optimization
Conditional Sequential Slate Optimization
Yipeng Zhang
Mingjian Lu
Saratchandra Indrakanti
M. Kannadasan
A. Bagherjeiran
21
0
0
12 Aug 2021
Online Bootstrap Inference For Policy Evaluation in Reinforcement
  Learning
Online Bootstrap Inference For Policy Evaluation in Reinforcement Learning
Pratik Ramprasad
Yuantong Li
Zhuoran Yang
Zhaoran Wang
W. Sun
Guang Cheng
OffRL
50
27
0
08 Aug 2021
Pessimistic Model-based Offline Reinforcement Learning under Partial
  Coverage
Pessimistic Model-based Offline Reinforcement Learning under Partial Coverage
Masatoshi Uehara
Wen Sun
OffRL
98
144
0
13 Jul 2021
Quantifying Availability and Discovery in Recommender Systems via
  Stochastic Reachability
Quantifying Availability and Discovery in Recommender Systems via Stochastic Reachability
Mihaela Curmei
Sarah Dean
Benjamin Recht
11
7
0
30 Jun 2021
On component interactions in two-stage recommender systems
On component interactions in two-stage recommender systems
Jiri Hron
K. Krauth
Michael I. Jordan
Niki Kilbertus
CML
LRM
40
31
0
28 Jun 2021
Multi-Task Learning for User Engagement and Adoption in Live Video
  Streaming Events
Multi-Task Learning for User Engagement and Adoption in Live Video Streaming Events
Stefanos Antaris
Dimitrios Rafailidis
Romina Arriaza
OffRL
12
0
0
18 Jun 2021
Control Variates for Slate Off-Policy Evaluation
Control Variates for Slate Off-Policy Evaluation
N. Vlassis
Ashok Chandrashekar
Fernando Amat Gil
Nathan Kallus
OffRL
20
9
0
15 Jun 2021
Zero-Shot Recommender Systems
Zero-Shot Recommender Systems
Hao Ding
Yifei Ma
Anoop Deoras
Bernie Wang
Hao Wang
VLM
15
89
0
18 May 2021
Generative Actor-Critic: An Off-policy Algorithm Using the Push-forward Model
Lingwei Peng
Hui Qian
Zhebang Shen
Chao Zhang
Fei Li
22
2
0
08 May 2021
Contextual Bandits with Sparse Data in Web setting
Contextual Bandits with Sparse Data in Web setting
Björn Eriksson
9
0
0
06 May 2021
Nearly Horizon-Free Offline Reinforcement Learning
Nearly Horizon-Free Offline Reinforcement Learning
Tongzheng Ren
Jialian Li
Bo Dai
S. Du
Sujay Sanghavi
OffRL
29
49
0
25 Mar 2021
RecSim NG: Toward Principled Uncertainty Modeling for Recommender
  Ecosystems
RecSim NG: Toward Principled Uncertainty Modeling for Recommender Ecosystems
Martin Mladenov
Chih-Wei Hsu
Vihan Jain
Eugene Ie
Christopher Colby
Nicolas Mayoraz
H. Pham
Dustin Tran
Ivan Vendrov
Craig Boutilier
BDL
15
31
0
14 Mar 2021
Reducing Conservativeness Oriented Offline Reinforcement Learning
Reducing Conservativeness Oriented Offline Reinforcement Learning
Hongchang Zhang
Jianzhun Shao
Yuhang Jiang
Shuncheng He
Xiangyang Ji
OffRL
32
6
0
27 Feb 2021
Reward Poisoning in Reinforcement Learning: Attacks Against Unknown
  Learners in Unknown Environments
Reward Poisoning in Reinforcement Learning: Attacks Against Unknown Learners in Unknown Environments
Amin Rakhsha
Xuezhou Zhang
Xiaojin Zhu
Adish Singla
AAML
OffRL
44
37
0
16 Feb 2021
Deep Reinforcement Learning-Based Product Recommender for Online
  Advertising
Deep Reinforcement Learning-Based Product Recommender for Online Advertising
Milad Vaali Esfahaani
Yanbo Xue
P. Setoodeh
OffRL
14
3
0
30 Jan 2021
Advances and Challenges in Conversational Recommender Systems: A Survey
Advances and Challenges in Conversational Recommender Systems: A Survey
Chongming Gao
Wenqiang Lei
Xiangnan He
Maarten de Rijke
Tat-Seng Chua
138
273
0
23 Jan 2021
Measuring Recommender System Effects with Simulated Users
Measuring Recommender System Effects with Simulated Users
Sirui Yao
Yoni Halpern
Nithum Thain
Xuezhi Wang
Kang Lee
Flavien Prost
Ed H. Chi
Jilin Chen
Alex Beutel
48
49
0
12 Jan 2021
Offline Meta-level Model-based Reinforcement Learning Approach for
  Cold-Start Recommendation
Offline Meta-level Model-based Reinforcement Learning Approach for Cold-Start Recommendation
Yanan Wang
Yong Ge
Li Li
Rui Chen
Tong Xu
OffRL
14
7
0
04 Dec 2020
Do Offline Metrics Predict Online Performance in Recommender Systems?
Do Offline Metrics Predict Online Performance in Recommender Systems?
K. Krauth
Sarah Dean
Alex Zhao
Wenshuo Guo
Mihaela Curmei
Benjamin Recht
Michael I. Jordan
OffRL
16
40
0
07 Nov 2020
Generative Inverse Deep Reinforcement Learning for Online Recommendation
Generative Inverse Deep Reinforcement Learning for Online Recommendation
Xiaocong Chen
Lina Yao
Aixin Sun
Xianzhi Wang
Xiwei Xu
Liming Zhu
OffRL
4
27
0
04 Nov 2020
CoinDICE: Off-Policy Confidence Interval Estimation
CoinDICE: Off-Policy Confidence Interval Estimation
Bo Dai
Ofir Nachum
Yinlam Chow
Lihong Li
Csaba Szepesvári
Dale Schuurmans
OffRL
27
84
0
22 Oct 2020
Lambda Learner: Fast Incremental Learning on Data Streams
Lambda Learner: Fast Incremental Learning on Data Streams
R. Ramanath
Konstantin Salomatin
Jeffrey Gee
Kirill Talanine
Onkar Dalal
Gungor Polatkan
S. Smoot
Deepak Kumar
11
9
0
11 Oct 2020
Learning from eXtreme Bandit Feedback
Learning from eXtreme Bandit Feedback
Romain Lopez
Inderjit S. Dhillon
Michael I. Jordan
OffRL
28
25
0
27 Sep 2020
div2vec: Diversity-Emphasized Node Embedding
div2vec: Diversity-Emphasized Node Embedding
Jisu Jeong
Jeong-Min Yun
Hongi Keam
Young-Jin Park
Zimin Park
Junki Cho
19
5
0
21 Sep 2020
Previous
1234
Next