Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1811.05869
Cited By
Large-scale Interactive Recommendation with Tree-structured Policy Gradient
14 November 2018
Haokun Chen
Xinyi Dai
Han Cai
Weinan Zhang
Xuejian Wang
Ruiming Tang
Yuzhou Zhang
Yong Yu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Large-scale Interactive Recommendation with Tree-structured Policy Gradient"
11 / 11 papers shown
Title
SAPIENT: Mastering Multi-turn Conversational Recommendation with Strategic Planning and Monte Carlo Tree Search
Hanwen Du
Bo Peng
Xia Ning
54
0
0
12 Oct 2024
Reinforcement Learning to Rank in E-Commerce Search Engine: Formalization, Analysis, and Application
Yujing Hu
Qing Da
Anxiang Zeng
Yang Yu
Yinghui Xu
33
179
0
02 Mar 2018
Real-Time Bidding with Multi-Agent Reinforcement Learning in Display Advertising
Junqi Jin
Cheng-Ning Song
Han Li
Kun Gai
Jun Wang
Weinan Zhang
46
179
0
27 Feb 2018
Recommendations with Negative Feedback via Pairwise Deep Reinforcement Learning
Xiangyu Zhao
Li Zhang
Zhuoye Ding
Long Xia
Jiliang Tang
Dawei Yin
63
331
0
19 Feb 2018
Action Branching Architectures for Deep Reinforcement Learning
Arash Tavakoli
Fabio Pardo
Petar Kormushev
41
260
0
24 Nov 2017
Simple Recurrent Units for Highly Parallelizable Recurrence
Tao Lei
Yu Zhang
Sida I. Wang
Huijing Dai
Yoav Artzi
LRM
73
271
0
08 Sep 2017
Real-Time Bidding by Reinforcement Learning in Display Advertising
Han Cai
Kan Ren
Weinan Zhang
Kleanthis Malialis
Jun Wang
Yong Yu
Defeng Guo
50
246
0
10 Jan 2017
Deep Reinforcement Learning in Large Discrete Action Spaces
Gabriel Dulac-Arnold
Richard Evans
H. V. Hasselt
P. Sunehag
Timothy Lillicrap
Jonathan J. Hunt
Timothy A. Mann
T. Weber
T. Degris
Ben Coppin
OffRL
54
573
0
24 Dec 2015
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
244
13,174
0
09 Sep 2015
Auto-Encoding Variational Bayes
Diederik P. Kingma
Max Welling
BDL
393
16,962
0
20 Dec 2013
A Contextual-Bandit Approach to Personalized News Article Recommendation
Lihong Li
Wei Chu
John Langford
Robert Schapire
317
2,935
0
28 Feb 2010
1