Regret Bounds for Thompson Sampling in Episodic Restless Bandit Problems

29 May 2019

Papers citing "Regret Bounds for Thompson Sampling in Episodic Restless Bandit Problems"

28 / 28 papers shown

Title
DOPL: Direct Online Preference Learning for Restless Bandits with Preference Feedback Guojun Xiong Ujwal Dinesha Debajoy Mukherjee Jian Li Srinivas Shakkottai 55 2 0 07 Oct 2024
Whittle Index Learning Algorithms for Restless Bandits with Constant Stepsizes Vishesh Mittal R. Meshram Surya Prakash 32 0 0 06 Sep 2024
A Federated Online Restless Bandit Framework for Cooperative Resource Allocation Jingwen Tong Xinran Li Liqun Fu Jun Zhang Khaled B. Letaief 55 1 0 12 Jun 2024
Restless Bandit Problem with Rewards Generated by a Linear Gaussian Dynamical System J. Gornet Bruno Sinopoli 39 0 0 15 May 2024
Structured Reinforcement Learning for Delay-Optimal Data Transmission in Dense mmWave Networks Shu-Fan Wang Guojun Xiong Shichen Zhang Huacheng Zeng Jian Li Shivendra Panwar 29 0 0 25 Apr 2024
A resource-constrained stochastic scheduling algorithm for homeless street outreach and gleaning edible food Conor M. Artman Aditya Mate Ezinne Nwankwo A. Heching Tsuyoshi Idé ... Kush R. Varshney Lauri Goldkind Gidi Kroch Jaclyn Sawyer Ian Watson 37 0 0 15 Mar 2024
Efficient Public Health Intervention Planning Using Decomposition-Based Decision-Focused Learning Sanket Shah A. Suggala Milind Tambe Aparna Taneja 27 0 0 08 Mar 2024
Fairness of Exposure in Online Restless Multi-armed Bandits Archit Sood Shweta Jain Sujit Gujar 40 1 0 09 Feb 2024
Online Restless Multi-Armed Bandits with Long-Term Fairness Constraints Shu-Fan Wang Guojun Xiong Jian Li 59 6 0 16 Dec 2023
Finite-Time Analysis of Whittle Index based Q-Learning for Restless Multi-Armed Bandits with Neural Network Function Approximation Guojun Xiong Jian Li 38 13 0 03 Oct 2023
Policy Optimization for Personalized Interventions in Behavioral Health Jackie Baek J. Boutilier Vivek F. Farias J. Jónasson Erez Yoeli OffRL 22 7 0 21 Mar 2023
A Definition of Non-Stationary Bandits Yueyang Liu Kuang Xu Benjamin Van Roy 24 11 0 23 Feb 2023
An Information-Theoretic Analysis of Nonstationary Bandit Learning Seungki Min Daniel Russo 34 6 0 09 Feb 2023
Decision-Focused Evaluation: Analyzing Performance of Deployed Restless Multi-Arm Bandits Paritosh Verma Shresth Verma Aditya Mate Aparna Taneja Milind Tambe 16 0 0 19 Jan 2023
Networked Restless Bandits with Positive Externalities Christine Herlihy John P. Dickerson 31 3 0 09 Dec 2022
Optimistic Whittle Index Policy: Online Learning for Restless Bandits Kai Wang Lily Xu Aparna Taneja Milind Tambe 41 16 0 30 May 2022
Whittle Index based Q-Learning for Wireless Edge Caching with Linear Function Approximation Guojun Xiong Shu-Fan Wang Jian Li Rahul Singh 33 6 0 26 Feb 2022
On learning Whittle index policy for restless bandits with scalable regret N. Akbarzadeh Aditya Mahajan 15 13 0 07 Feb 2022
Reinforcement Learning for Finite-Horizon Restless Multi-Armed Multi-Action Bandits Guojun Xiong Jian Li Rahul Singh 30 4 0 20 Sep 2021
Field Study in Deploying Restless Multi-Armed Bandits: Assisting Non-Profits in Improving Maternal and Child Health Aditya Mate Lovish Madaan Aparna Taneja N. Madhiwalla Shresth Verma Gargi Singh Aparna Hegde Pradeep Varakantham Milind Tambe 28 52 0 16 Sep 2021
Restless and Uncertain: Robust Policies for Restless Bandits via Deep Multi-Agent Reinforcement Learning J. Killian Lily Xu Arpita Biswas Milind Tambe 27 6 0 04 Jul 2021
Reinforcement Learning for Markovian Bandits: Is Posterior Sampling more Scalable than Optimism? Nicolas Gast B. Gaujal K. Khun 25 2 0 16 Jun 2021
Planning to Fairly Allocate: Probabilistic Fairness in the Restless Bandit Setting Christine Herlihy Aviva Prins A. Srinivasan John P. Dickerson 38 14 0 14 Jun 2021
Learning Augmented Index Policy for Optimal Service Placement at the Network Edge Guojun Xiong Rahul Singh Jian Li 24 9 0 10 Jan 2021
Restless-UCB, an Efficient and Low-complexity Algorithm for Online Restless Bandits Siwei Wang Longbo Huang John C. S. Lui OffRL 32 38 0 05 Nov 2020
Screening for an Infectious Disease as a Problem in Stochastic Control Jakub Mareˇcek 14 3 0 01 Nov 2020
Collapsing Bandits and Their Application to Public Health Interventions Aditya Mate J. Killian Haifeng Xu Andrew Perrault Milind Tambe 17 64 0 05 Jul 2020
Thompson Sampling in Non-Episodic Restless Bandits Young Hun Jung Marc Abeille Ambuj Tewari 9 19 0 12 Oct 2019