Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1512.01124
Cited By
Deep Reinforcement Learning with Attention for Slate Markov Decision Processes with High-Dimensional States and Actions
3 December 2015
P. Sunehag
Richard Evans
Gabriel Dulac-Arnold
Yori Zwols
D. Visentin
Ben Coppin
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Reinforcement Learning with Attention for Slate Markov Decision Processes with High-Dimensional States and Actions"
10 / 10 papers shown
Title
Automatic Music Playlist Generation via Simulation-based Reinforcement Learning
Federico Tomasi
Joseph Cauteruccio
Surya Kanoria
K. Ciosek
Matteo Rinaldi
Zhenwen Dai
OffRL
22
5
0
13 Oct 2023
Distributional Off-Policy Evaluation for Slate Recommendations
Shreyas Chaudhari
David Arbour
Georgios Theocharous
N. Vlassis
OffRL
44
0
0
27 Aug 2023
Estimating and Penalizing Induced Preference Shifts in Recommender Systems
Micah Carroll
Anca Dragan
Stuart J. Russell
Dylan Hadfield-Menell
OffRL
22
41
0
25 Apr 2022
Value Penalized Q-Learning for Recommender Systems
Chengqian Gao
Ke Xu
Kuangqi Zhou
Lanqing Li
Xueqian Wang
Bo Yuan
P. Zhao
OffRL
54
20
0
15 Oct 2021
Advances and Challenges in Conversational Recommender Systems: A Survey
Chongming Gao
Wenqiang Lei
Xiangnan He
Maarten de Rijke
Tat-Seng Chua
138
273
0
23 Jan 2021
RecSim: A Configurable Simulation Platform for Recommender Systems
Eugene Ie
Chih-Wei Hsu
Martin Mladenov
Vihan Jain
Sanmit Narvekar
Jing Wang
Rui Wu
Craig Boutilier
27
177
0
11 Sep 2019
Reinforcement Learning with Function-Valued Action Spaces for Partial Differential Equation Control
Yangchen Pan
Amir-massoud Farahmand
Martha White
S. Nabi
P. Grover
D. Nikovski
40
18
0
13 Jun 2018
Beyond Greedy Ranking: Slate Optimization via List-CVAE
Ray Jiang
Sven Gowal
Timothy A. Mann
Danilo Jimenez Rezende
22
49
0
05 Mar 2018
Recommendations with Negative Feedback via Pairwise Deep Reinforcement Learning
Xiangyu Zhao
Li Zhang
Zhuoye Ding
Long Xia
Jiliang Tang
Dawei Yin
26
327
0
19 Feb 2018
Matroid Bandits: Fast Combinatorial Optimization with Learning
B. Kveton
Zheng Wen
Azin Ashkan
Hoda Eydgahi
Brian Eriksson
46
119
0
20 Mar 2014
1