Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.13747
Cited By
Optimizing Long-term Value for Auction-Based Recommender Systems via On-Policy Reinforcement Learning
23 May 2023
Ruiyang Xu
Jalaj Bhandari
D. Korenkevych
F. Liu
Yuchen He
Alex Nikulkov
Zheqing Zhu
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Optimizing Long-term Value for Auction-Based Recommender Systems via On-Policy Reinforcement Learning"
6 / 6 papers shown
Title
Pearl: A Production-ready Reinforcement Learning Agent
Zheqing Zhu
Rodrigo de Salvo Braz
Jalaj Bhandari
Daniel Jiang
Yi Wan
...
D. Korenkevych
Ürün Dogan
Frank Cheng
Zheng Wu
Wanqiao Xu
VLM
OffRL
OnRL
39
6
0
06 Dec 2023
Non-Stationary Contextual Bandit Learning via Neural Predictive Ensemble Sampling
Zheqing Zhu
Yueyang Liu
Xu Kuang
Benjamin Van Roy
AI4TS
29
0
0
11 Oct 2023
Filter Bubbles in Recommender Systems: Fact or Fallacy -- A Systematic Review
Q. Areeb
Mohammad Nadeem
S. Sohail
Raza Imam
F. Doctor
Yassine Himeur
Amir Hussain
Abbes Amira
24
30
0
02 Jul 2023
Scalable Neural Contextual Bandit for Recommender Systems
Zheqing Zhu
Benjamin Van Roy
OffRL
24
9
0
26 Jun 2023
Evaluating Online Bandit Exploration In Large-Scale Recommender System
Hongbo Guo
Ruben Naeff
Alex Nikulkov
Zheqing Zhu
OffRL
24
6
0
05 Apr 2023
Deep Exploration for Recommendation Systems
Zheqing Zhu
Benjamin Van Roy
32
11
0
26 Sep 2021
1