ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1811.05869
  4. Cited By
Large-scale Interactive Recommendation with Tree-structured Policy
  Gradient

Large-scale Interactive Recommendation with Tree-structured Policy Gradient

14 November 2018
Haokun Chen
Xinyi Dai
Han Cai
Weinan Zhang
Xuejian Wang
Ruiming Tang
Yuzhou Zhang
Yong Yu
ArXivPDFHTML

Papers citing "Large-scale Interactive Recommendation with Tree-structured Policy Gradient"

11 / 11 papers shown
Title
SAPIENT: Mastering Multi-turn Conversational Recommendation with Strategic Planning and Monte Carlo Tree Search
SAPIENT: Mastering Multi-turn Conversational Recommendation with Strategic Planning and Monte Carlo Tree Search
Hanwen Du
Bo Peng
Xia Ning
54
0
0
12 Oct 2024
Reinforcement Learning to Rank in E-Commerce Search Engine:
  Formalization, Analysis, and Application
Reinforcement Learning to Rank in E-Commerce Search Engine: Formalization, Analysis, and Application
Yujing Hu
Qing Da
Anxiang Zeng
Yang Yu
Yinghui Xu
33
179
0
02 Mar 2018
Real-Time Bidding with Multi-Agent Reinforcement Learning in Display
  Advertising
Real-Time Bidding with Multi-Agent Reinforcement Learning in Display Advertising
Junqi Jin
Cheng-Ning Song
Han Li
Kun Gai
Jun Wang
Weinan Zhang
46
179
0
27 Feb 2018
Recommendations with Negative Feedback via Pairwise Deep Reinforcement
  Learning
Recommendations with Negative Feedback via Pairwise Deep Reinforcement Learning
Xiangyu Zhao
Li Zhang
Zhuoye Ding
Long Xia
Jiliang Tang
Dawei Yin
63
331
0
19 Feb 2018
Action Branching Architectures for Deep Reinforcement Learning
Action Branching Architectures for Deep Reinforcement Learning
Arash Tavakoli
Fabio Pardo
Petar Kormushev
41
260
0
24 Nov 2017
Simple Recurrent Units for Highly Parallelizable Recurrence
Simple Recurrent Units for Highly Parallelizable Recurrence
Tao Lei
Yu Zhang
Sida I. Wang
Huijing Dai
Yoav Artzi
LRM
73
271
0
08 Sep 2017
Real-Time Bidding by Reinforcement Learning in Display Advertising
Real-Time Bidding by Reinforcement Learning in Display Advertising
Han Cai
Kan Ren
Weinan Zhang
Kleanthis Malialis
Jun Wang
Yong Yu
Defeng Guo
50
246
0
10 Jan 2017
Deep Reinforcement Learning in Large Discrete Action Spaces
Deep Reinforcement Learning in Large Discrete Action Spaces
Gabriel Dulac-Arnold
Richard Evans
H. V. Hasselt
P. Sunehag
Timothy Lillicrap
Jonathan J. Hunt
Timothy A. Mann
T. Weber
T. Degris
Ben Coppin
OffRL
54
573
0
24 Dec 2015
Continuous control with deep reinforcement learning
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
244
13,174
0
09 Sep 2015
Auto-Encoding Variational Bayes
Auto-Encoding Variational Bayes
Diederik P. Kingma
Max Welling
BDL
393
16,962
0
20 Dec 2013
A Contextual-Bandit Approach to Personalized News Article Recommendation
A Contextual-Bandit Approach to Personalized News Article Recommendation
Lihong Li
Wei Chu
John Langford
Robert Schapire
317
2,935
0
28 Feb 2010
1