Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1603.09233
Cited By
Optimal Recommendation to Users that React: Online Learning for a Class of POMDPs
30 March 2016
R. Meshram
Aditya Gopalan
D. Manjunath
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Optimal Recommendation to Users that React: Online Learning for a Class of POMDPs"
4 / 4 papers shown
Title
AOTree: Aspect Order Tree-based Model for Explainable Recommendation
Wenxin Zhao
Peng Zhang
Hansu Gu
Dongsheng Li
Tun Lu
Ning Gu
36
0
0
29 Jul 2024
Finite-Time Analysis of Whittle Index based Q-Learning for Restless Multi-Armed Bandits with Neural Network Function Approximation
Guojun Xiong
Jian Li
38
13
0
03 Oct 2023
Optimistic Whittle Index Policy: Online Learning for Restless Bandits
Kai Wang
Lily Xu
Aparna Taneja
Milind Tambe
44
16
0
30 May 2022
Regret Bounds for Thompson Sampling in Episodic Restless Bandit Problems
Young Hun Jung
Ambuj Tewari
27
44
0
29 May 2019
1