ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2011.02664
  4. Cited By
Restless-UCB, an Efficient and Low-complexity Algorithm for Online
  Restless Bandits

Restless-UCB, an Efficient and Low-complexity Algorithm for Online Restless Bandits

5 November 2020
Siwei Wang
Longbo Huang
John C. S. Lui
    OffRL
ArXivPDFHTML

Papers citing "Restless-UCB, an Efficient and Low-complexity Algorithm for Online Restless Bandits"

19 / 19 papers shown
Title
On the Low-Complexity of Fair Learning for Combinatorial Multi-Armed Bandit
On the Low-Complexity of Fair Learning for Combinatorial Multi-Armed Bandit
Xiaoyi Wu
Bo Ji
Bin Li
FaML
46
0
0
01 Jan 2025
DOPL: Direct Online Preference Learning for Restless Bandits with
  Preference Feedback
DOPL: Direct Online Preference Learning for Restless Bandits with Preference Feedback
Guojun Xiong
Ujwal Dinesha
Debajoy Mukherjee
Jian Li
Srinivas Shakkottai
50
2
0
07 Oct 2024
Whittle Index Learning Algorithms for Restless Bandits with Constant
  Stepsizes
Whittle Index Learning Algorithms for Restless Bandits with Constant Stepsizes
Vishesh Mittal
R. Meshram
Surya Prakash
29
0
0
06 Sep 2024
A Federated Online Restless Bandit Framework for Cooperative Resource
  Allocation
A Federated Online Restless Bandit Framework for Cooperative Resource Allocation
Jingwen Tong
Xinran Li
Liqun Fu
Jun Zhang
Khaled B. Letaief
50
1
0
12 Jun 2024
Tabular and Deep Learning for the Whittle Index
Tabular and Deep Learning for the Whittle Index
Francisco Robledo Relaño
Vivek Borkar
U. Ayesta
Konstantin Avrachenkov
31
2
0
04 Jun 2024
Restless Bandit Problem with Rewards Generated by a Linear Gaussian
  Dynamical System
Restless Bandit Problem with Rewards Generated by a Linear Gaussian Dynamical System
J. Gornet
Bruno Sinopoli
39
0
0
15 May 2024
Provably Efficient Reinforcement Learning for Adversarial Restless
  Multi-Armed Bandits with Unknown Transitions and Bandit Feedback
Provably Efficient Reinforcement Learning for Adversarial Restless Multi-Armed Bandits with Unknown Transitions and Bandit Feedback
Guojun Xiong
Jian Li
33
1
0
02 May 2024
Structured Reinforcement Learning for Delay-Optimal Data Transmission in
  Dense mmWave Networks
Structured Reinforcement Learning for Delay-Optimal Data Transmission in Dense mmWave Networks
Shu-Fan Wang
Guojun Xiong
Shichen Zhang
Huacheng Zeng
Jian Li
Shivendra Panwar
29
0
0
25 Apr 2024
Online Restless Multi-Armed Bandits with Long-Term Fairness Constraints
Online Restless Multi-Armed Bandits with Long-Term Fairness Constraints
Shu-Fan Wang
Guojun Xiong
Jian Li
59
6
0
16 Dec 2023
Finite-Time Analysis of Whittle Index based Q-Learning for Restless
  Multi-Armed Bandits with Neural Network Function Approximation
Finite-Time Analysis of Whittle Index based Q-Learning for Restless Multi-Armed Bandits with Neural Network Function Approximation
Guojun Xiong
Jian Li
38
13
0
03 Oct 2023
Policy Optimization for Personalized Interventions in Behavioral Health
Policy Optimization for Personalized Interventions in Behavioral Health
Jackie Baek
J. Boutilier
Vivek F. Farias
J. Jónasson
Erez Yoeli
OffRL
17
7
0
21 Mar 2023
Approximately Stationary Bandits with Knapsacks
Approximately Stationary Bandits with Knapsacks
Giannis Fikioris
Éva Tardos
AAML
21
7
0
28 Feb 2023
Decision-Focused Evaluation: Analyzing Performance of Deployed Restless
  Multi-Arm Bandits
Decision-Focused Evaluation: Analyzing Performance of Deployed Restless Multi-Arm Bandits
Paritosh Verma
Shresth Verma
Aditya Mate
Aparna Taneja
Milind Tambe
16
0
0
19 Jan 2023
Stochastic Rising Bandits
Stochastic Rising Bandits
Alberto Maria Metelli
F. Trovò
Matteo Pirola
Marcello Restelli
17
16
0
07 Dec 2022
Optimistic Whittle Index Policy: Online Learning for Restless Bandits
Optimistic Whittle Index Policy: Online Learning for Restless Bandits
Kai Wang
Lily Xu
Aparna Taneja
Milind Tambe
39
16
0
30 May 2022
Whittle Index based Q-Learning for Wireless Edge Caching with Linear
  Function Approximation
Whittle Index based Q-Learning for Wireless Edge Caching with Linear Function Approximation
Guojun Xiong
Shu-Fan Wang
Jian Li
Rahul Singh
30
6
0
26 Feb 2022
Reinforcement Learning for Finite-Horizon Restless Multi-Armed
  Multi-Action Bandits
Reinforcement Learning for Finite-Horizon Restless Multi-Armed Multi-Action Bandits
Guojun Xiong
Jian Li
Rahul Singh
30
4
0
20 Sep 2021
Restless and Uncertain: Robust Policies for Restless Bandits via Deep
  Multi-Agent Reinforcement Learning
Restless and Uncertain: Robust Policies for Restless Bandits via Deep Multi-Agent Reinforcement Learning
J. Killian
Lily Xu
Arpita Biswas
Milind Tambe
27
6
0
04 Jul 2021
Reinforcement Learning for Markovian Bandits: Is Posterior Sampling more
  Scalable than Optimism?
Reinforcement Learning for Markovian Bandits: Is Posterior Sampling more Scalable than Optimism?
Nicolas Gast
B. Gaujal
K. Khun
15
2
0
16 Jun 2021
1