Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.04162
Cited By
Policy Gradients for Contextual Recommendations
12 February 2018
Feiyang Pan
Qingpeng Cai
Pingzhong Tang
Fuzhen Zhuang
Qing He
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Policy Gradients for Contextual Recommendations"
8 / 8 papers shown
Title
Improving Reward-Conditioned Policies for Multi-Armed Bandits using Normalized Weight Functions
Kai Xu
Farid Tajaddodianfar
Ben Allison
23
0
0
16 Jun 2024
Examining Policy Entropy of Reinforcement Learning Agents for Personalization Tasks
Anton Dereventsov
Andrew Starnes
Clayton Webster
26
4
0
21 Nov 2022
Contextual Decision Trees
Tommaso Aldinucci
Enrico Civitelli
Leonardo Di Gangi
Alessandro Sestini
18
3
0
13 Jul 2022
Productivity, Portability, Performance: Data-Centric Python
Yiheng Wang
Yao Zhang
Yanzhang Wang
Yan Wan
Jiao Wang
Zhongyuan Wu
Yuhao Yang
Bowen She
59
95
0
01 Jul 2021
Generative Adversarial Reward Learning for Generalized Behavior Tendency Inference
Xiaocong Chen
Lina Yao
Xianzhi Wang
Aixin Sun
Wenjie Zhang
Quan Z. Sheng
22
8
0
03 May 2021
GoChat: Goal-oriented Chatbots with Hierarchical Reinforcement Learning
Jianfeng Liu
Feiyang Pan
Ling Luo
OffRL
20
23
0
24 May 2020
Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning
Y. Gal
Zoubin Ghahramani
UQCV
BDL
287
9,167
0
06 Jun 2015
Off-Policy Actor-Critic
T. Degris
Martha White
R. Sutton
OffRL
CML
163
220
0
22 May 2012
1