ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.04162
  4. Cited By
Policy Gradients for Contextual Recommendations

Policy Gradients for Contextual Recommendations

12 February 2018
Feiyang Pan
Qingpeng Cai
Pingzhong Tang
Fuzhen Zhuang
Qing He
    OffRL
ArXivPDFHTML

Papers citing "Policy Gradients for Contextual Recommendations"

8 / 8 papers shown
Title
Improving Reward-Conditioned Policies for Multi-Armed Bandits using
  Normalized Weight Functions
Improving Reward-Conditioned Policies for Multi-Armed Bandits using Normalized Weight Functions
Kai Xu
Farid Tajaddodianfar
Ben Allison
23
0
0
16 Jun 2024
Examining Policy Entropy of Reinforcement Learning Agents for
  Personalization Tasks
Examining Policy Entropy of Reinforcement Learning Agents for Personalization Tasks
Anton Dereventsov
Andrew Starnes
Clayton Webster
26
4
0
21 Nov 2022
Contextual Decision Trees
Contextual Decision Trees
Tommaso Aldinucci
Enrico Civitelli
Leonardo Di Gangi
Alessandro Sestini
18
3
0
13 Jul 2022
Productivity, Portability, Performance: Data-Centric Python
Productivity, Portability, Performance: Data-Centric Python
Yiheng Wang
Yao Zhang
Yanzhang Wang
Yan Wan
Jiao Wang
Zhongyuan Wu
Yuhao Yang
Bowen She
59
95
0
01 Jul 2021
Generative Adversarial Reward Learning for Generalized Behavior Tendency
  Inference
Generative Adversarial Reward Learning for Generalized Behavior Tendency Inference
Xiaocong Chen
Lina Yao
Xianzhi Wang
Aixin Sun
Wenjie Zhang
Quan Z. Sheng
22
8
0
03 May 2021
GoChat: Goal-oriented Chatbots with Hierarchical Reinforcement Learning
GoChat: Goal-oriented Chatbots with Hierarchical Reinforcement Learning
Jianfeng Liu
Feiyang Pan
Ling Luo
OffRL
20
23
0
24 May 2020
Dropout as a Bayesian Approximation: Representing Model Uncertainty in
  Deep Learning
Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning
Y. Gal
Zoubin Ghahramani
UQCV
BDL
287
9,167
0
06 Jun 2015
Off-Policy Actor-Critic
Off-Policy Actor-Critic
T. Degris
Martha White
R. Sutton
OffRL
CML
163
220
0
22 May 2012
1