ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1905.12673
  4. Cited By
Regret Bounds for Thompson Sampling in Episodic Restless Bandit Problems

Regret Bounds for Thompson Sampling in Episodic Restless Bandit Problems

29 May 2019
Young Hun Jung
Ambuj Tewari
ArXivPDFHTML

Papers citing "Regret Bounds for Thompson Sampling in Episodic Restless Bandit Problems"

28 / 28 papers shown
Title
DOPL: Direct Online Preference Learning for Restless Bandits with
  Preference Feedback
DOPL: Direct Online Preference Learning for Restless Bandits with Preference Feedback
Guojun Xiong
Ujwal Dinesha
Debajoy Mukherjee
Jian Li
Srinivas Shakkottai
55
2
0
07 Oct 2024
Whittle Index Learning Algorithms for Restless Bandits with Constant
  Stepsizes
Whittle Index Learning Algorithms for Restless Bandits with Constant Stepsizes
Vishesh Mittal
R. Meshram
Surya Prakash
32
0
0
06 Sep 2024
A Federated Online Restless Bandit Framework for Cooperative Resource
  Allocation
A Federated Online Restless Bandit Framework for Cooperative Resource Allocation
Jingwen Tong
Xinran Li
Liqun Fu
Jun Zhang
Khaled B. Letaief
55
1
0
12 Jun 2024
Restless Bandit Problem with Rewards Generated by a Linear Gaussian
  Dynamical System
Restless Bandit Problem with Rewards Generated by a Linear Gaussian Dynamical System
J. Gornet
Bruno Sinopoli
39
0
0
15 May 2024
Structured Reinforcement Learning for Delay-Optimal Data Transmission in
  Dense mmWave Networks
Structured Reinforcement Learning for Delay-Optimal Data Transmission in Dense mmWave Networks
Shu-Fan Wang
Guojun Xiong
Shichen Zhang
Huacheng Zeng
Jian Li
Shivendra Panwar
29
0
0
25 Apr 2024
A resource-constrained stochastic scheduling algorithm for homeless
  street outreach and gleaning edible food
A resource-constrained stochastic scheduling algorithm for homeless street outreach and gleaning edible food
Conor M. Artman
Aditya Mate
Ezinne Nwankwo
A. Heching
Tsuyoshi Idé
...
Kush R. Varshney
Lauri Goldkind
Gidi Kroch
Jaclyn Sawyer
Ian Watson
37
0
0
15 Mar 2024
Efficient Public Health Intervention Planning Using Decomposition-Based
  Decision-Focused Learning
Efficient Public Health Intervention Planning Using Decomposition-Based Decision-Focused Learning
Sanket Shah
A. Suggala
Milind Tambe
Aparna Taneja
27
0
0
08 Mar 2024
Fairness of Exposure in Online Restless Multi-armed Bandits
Fairness of Exposure in Online Restless Multi-armed Bandits
Archit Sood
Shweta Jain
Sujit Gujar
40
1
0
09 Feb 2024
Online Restless Multi-Armed Bandits with Long-Term Fairness Constraints
Online Restless Multi-Armed Bandits with Long-Term Fairness Constraints
Shu-Fan Wang
Guojun Xiong
Jian Li
59
6
0
16 Dec 2023
Finite-Time Analysis of Whittle Index based Q-Learning for Restless
  Multi-Armed Bandits with Neural Network Function Approximation
Finite-Time Analysis of Whittle Index based Q-Learning for Restless Multi-Armed Bandits with Neural Network Function Approximation
Guojun Xiong
Jian Li
38
13
0
03 Oct 2023
Policy Optimization for Personalized Interventions in Behavioral Health
Policy Optimization for Personalized Interventions in Behavioral Health
Jackie Baek
J. Boutilier
Vivek F. Farias
J. Jónasson
Erez Yoeli
OffRL
22
7
0
21 Mar 2023
A Definition of Non-Stationary Bandits
A Definition of Non-Stationary Bandits
Yueyang Liu
Kuang Xu
Benjamin Van Roy
24
11
0
23 Feb 2023
An Information-Theoretic Analysis of Nonstationary Bandit Learning
An Information-Theoretic Analysis of Nonstationary Bandit Learning
Seungki Min
Daniel Russo
34
6
0
09 Feb 2023
Decision-Focused Evaluation: Analyzing Performance of Deployed Restless
  Multi-Arm Bandits
Decision-Focused Evaluation: Analyzing Performance of Deployed Restless Multi-Arm Bandits
Paritosh Verma
Shresth Verma
Aditya Mate
Aparna Taneja
Milind Tambe
16
0
0
19 Jan 2023
Networked Restless Bandits with Positive Externalities
Networked Restless Bandits with Positive Externalities
Christine Herlihy
John P. Dickerson
31
3
0
09 Dec 2022
Optimistic Whittle Index Policy: Online Learning for Restless Bandits
Optimistic Whittle Index Policy: Online Learning for Restless Bandits
Kai Wang
Lily Xu
Aparna Taneja
Milind Tambe
41
16
0
30 May 2022
Whittle Index based Q-Learning for Wireless Edge Caching with Linear
  Function Approximation
Whittle Index based Q-Learning for Wireless Edge Caching with Linear Function Approximation
Guojun Xiong
Shu-Fan Wang
Jian Li
Rahul Singh
33
6
0
26 Feb 2022
On learning Whittle index policy for restless bandits with scalable
  regret
On learning Whittle index policy for restless bandits with scalable regret
N. Akbarzadeh
Aditya Mahajan
15
13
0
07 Feb 2022
Reinforcement Learning for Finite-Horizon Restless Multi-Armed
  Multi-Action Bandits
Reinforcement Learning for Finite-Horizon Restless Multi-Armed Multi-Action Bandits
Guojun Xiong
Jian Li
Rahul Singh
30
4
0
20 Sep 2021
Field Study in Deploying Restless Multi-Armed Bandits: Assisting
  Non-Profits in Improving Maternal and Child Health
Field Study in Deploying Restless Multi-Armed Bandits: Assisting Non-Profits in Improving Maternal and Child Health
Aditya Mate
Lovish Madaan
Aparna Taneja
N. Madhiwalla
Shresth Verma
Gargi Singh
Aparna Hegde
Pradeep Varakantham
Milind Tambe
28
52
0
16 Sep 2021
Restless and Uncertain: Robust Policies for Restless Bandits via Deep
  Multi-Agent Reinforcement Learning
Restless and Uncertain: Robust Policies for Restless Bandits via Deep Multi-Agent Reinforcement Learning
J. Killian
Lily Xu
Arpita Biswas
Milind Tambe
27
6
0
04 Jul 2021
Reinforcement Learning for Markovian Bandits: Is Posterior Sampling more
  Scalable than Optimism?
Reinforcement Learning for Markovian Bandits: Is Posterior Sampling more Scalable than Optimism?
Nicolas Gast
B. Gaujal
K. Khun
25
2
0
16 Jun 2021
Planning to Fairly Allocate: Probabilistic Fairness in the Restless
  Bandit Setting
Planning to Fairly Allocate: Probabilistic Fairness in the Restless Bandit Setting
Christine Herlihy
Aviva Prins
A. Srinivasan
John P. Dickerson
38
14
0
14 Jun 2021
Learning Augmented Index Policy for Optimal Service Placement at the
  Network Edge
Learning Augmented Index Policy for Optimal Service Placement at the Network Edge
Guojun Xiong
Rahul Singh
Jian Li
24
9
0
10 Jan 2021
Restless-UCB, an Efficient and Low-complexity Algorithm for Online
  Restless Bandits
Restless-UCB, an Efficient and Low-complexity Algorithm for Online Restless Bandits
Siwei Wang
Longbo Huang
John C. S. Lui
OffRL
32
38
0
05 Nov 2020
Screening for an Infectious Disease as a Problem in Stochastic Control
Screening for an Infectious Disease as a Problem in Stochastic Control
Jakub Mareˇcek
14
3
0
01 Nov 2020
Collapsing Bandits and Their Application to Public Health Interventions
Collapsing Bandits and Their Application to Public Health Interventions
Aditya Mate
J. Killian
Haifeng Xu
Andrew Perrault
Milind Tambe
17
64
0
05 Jul 2020
Thompson Sampling in Non-Episodic Restless Bandits
Thompson Sampling in Non-Episodic Restless Bandits
Young Hun Jung
Marc Abeille
Ambuj Tewari
9
19
0
12 Oct 2019
1