Finite-Time Analysis of Whittle Index based Q-Learning for Restless Multi-Armed Bandits with Neural Network Function Approximation

3 October 2023

Papers citing "Finite-Time Analysis of Whittle Index based Q-Learning for Restless Multi-Armed Bandits with Neural Network Function Approximation"

11 / 11 papers shown

Title
DOPL: Direct Online Preference Learning for Restless Bandits with Preference Feedback Guojun Xiong Ujwal Dinesha Debajoy Mukherjee Jian Li Srinivas Shakkottai 50 2 0 07 Oct 2024
Improving the Prediction of Individual Engagement in Recommendations Using Cognitive Models Roderick Seow Yunfan Zhao Duncan Wood Milind Tambe Cleotilde Gonzalez 28 4 0 28 Aug 2024
GINO-Q: Learning an Asymptotically Optimal Index Policy for Restless Multi-armed Bandits Gongpu Chen Soung Chang Liew Deniz Gunduz 17 1 0 19 Aug 2024
The Bandit Whisperer: Communication Learning for Restless Bandits Yunfan Zhao Tonghan Wang Dheeraj M. Nagaraj Aparna Taneja Milind Tambe 52 5 0 11 Aug 2024
Tabular and Deep Learning for the Whittle Index Francisco Robledo Relaño Vivek Borkar U. Ayesta Konstantin Avrachenkov 31 2 0 04 Jun 2024
Provably Efficient Reinforcement Learning for Adversarial Restless Multi-Armed Bandits with Unknown Transitions and Bandit Feedback Guojun Xiong Jian Li 33 1 0 02 May 2024
Online Restless Multi-Armed Bandits with Long-Term Fairness Constraints Shu-Fan Wang Guojun Xiong Jian Li 59 6 0 16 Dec 2023
Towards a Pretrained Model for Restless Bandits via Multi-arm Generalization Yunfan Zhao Nikhil Behari Edward Hughes Edwin Zhang Dheeraj M. Nagaraj K. Tuyls Aparna Taneja Milind Tambe 29 8 0 23 Oct 2023
DeepTOP: Deep Threshold-Optimal Policy for MDPs and RMABs Khaled Nakhleh I.-Hong Hou 72 6 0 18 Sep 2022
NeurWIN: Neural Whittle Index Network For Restless Bandits Via Deep RL Khaled Nakhleh Santosh Ganji Ping-Chun Hsieh I.-Hong Hou S. Shakkottai 61 38 0 05 Oct 2021
Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes Chen-Yu Wei Mehdi Jafarnia-Jahromi Haipeng Luo Hiteshi Sharma R. Jain 107 100 0 15 Oct 2019