Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2108.09141
Cited By
Reinforcement Learning to Optimize Lifetime Value in Cold-Start Recommendation
20 August 2021
Luo Ji
Qin Qi
Bingqing Han
Hongxia Yang
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Reinforcement Learning to Optimize Lifetime Value in Cold-Start Recommendation"
12 / 12 papers shown
Title
Offline Meta-level Model-based Reinforcement Learning Approach for Cold-Start Recommendation
Yanan Wang
Yong Ge
Li Li
Rui Chen
Tong Xu
OffRL
24
7
0
04 Dec 2020
Model-Based Reinforcement Learning with Adversarial Training for Online Recommendation
Xueying Bai
Jian Guan
Hongning Wang
OffRL
26
75
0
10 Nov 2019
Exact-K Recommendation via Maximal Clique Optimization
Yu Gong
Yu Zhu
Lu Duan
Qingwen Liu
Ziyu Guan
Fei Sun
Wenwu Ou
Kenny Q. Zhu
OffRL
CML
53
59
0
17 May 2019
Top-K Off-Policy Correction for a REINFORCE Recommender System
Minmin Chen
Alex Beutel
Paul Covington
Sagar Jain
Francois Belletti
Ed H. Chi
CML
OffRL
112
476
0
06 Dec 2018
Learning to Collaborate: Multi-Scenario Ranking via Multi-Agent Reinforcement Learning
Jun Feng
Heng Li
Minlie Huang
Shichen Liu
Wenwu Ou
Zhirong Wang
Xiaoyan Zhu
39
70
0
17 Sep 2018
Speeding up the Metabolism in E-commerce by Reinforcement Mechanism Design
Hua-Lin He
C. Pan
Qing Da
Anxiang Zeng
16
2
0
02 Jul 2018
Addressing the Item Cold-start Problem by Attribute-driven Active Learning
Y. Zhu
Jinhao Lin
S. He
Beidou Wang
Ziyu Guan
Haifeng Liu
Deng Cai
103
130
0
23 May 2018
Recommendations with Negative Feedback via Pairwise Deep Reinforcement Learning
Xiangyu Zhao
Li Zhang
Zhuoye Ding
Long Xia
Jiliang Tang
Dawei Yin
56
331
0
19 Feb 2018
Deep Reinforcement Learning for List-wise Recommendations
Xiangyu Zhao
Li Zhang
Yue Zhao
Zhuoye Ding
Dawei Yin
Jiliang Tang
30
173
0
30 Dec 2017
Cold-Start Reinforcement Learning with Softmax Policy Gradient
Nan Ding
Radu Soricut
40
46
0
27 Sep 2017
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
193
13,174
0
09 Sep 2015
Unbiased Offline Evaluation of Contextual-bandit-based News Article Recommendation Algorithms
Lihong Li
Wei Chu
John Langford
Xuanhui Wang
OffRL
150
574
0
31 Mar 2010
1