Optimal Recommendation to Users that React: Online Learning for a Class of POMDPs

30 March 2016

Papers citing "Optimal Recommendation to Users that React: Online Learning for a Class of POMDPs"

4 / 4 papers shown

Title
AOTree: Aspect Order Tree-based Model for Explainable Recommendation Wenxin Zhao Peng Zhang Hansu Gu Dongsheng Li Tun Lu Ning Gu 36 0 0 29 Jul 2024
Finite-Time Analysis of Whittle Index based Q-Learning for Restless Multi-Armed Bandits with Neural Network Function Approximation Guojun Xiong Jian Li 38 13 0 03 Oct 2023
Optimistic Whittle Index Policy: Online Learning for Restless Bandits Kai Wang Lily Xu Aparna Taneja Milind Tambe 44 16 0 30 May 2022
Regret Bounds for Thompson Sampling in Episodic Restless Bandit Problems Young Hun Jung Ambuj Tewari 27 44 0 29 May 2019