ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2310.02147
  4. Cited By
Finite-Time Analysis of Whittle Index based Q-Learning for Restless
  Multi-Armed Bandits with Neural Network Function Approximation

Finite-Time Analysis of Whittle Index based Q-Learning for Restless Multi-Armed Bandits with Neural Network Function Approximation

3 October 2023
Guojun Xiong
Jian Li
ArXivPDFHTML

Papers citing "Finite-Time Analysis of Whittle Index based Q-Learning for Restless Multi-Armed Bandits with Neural Network Function Approximation"

11 / 11 papers shown
Title
DOPL: Direct Online Preference Learning for Restless Bandits with
  Preference Feedback
DOPL: Direct Online Preference Learning for Restless Bandits with Preference Feedback
Guojun Xiong
Ujwal Dinesha
Debajoy Mukherjee
Jian Li
Srinivas Shakkottai
50
2
0
07 Oct 2024
Improving the Prediction of Individual Engagement in Recommendations
  Using Cognitive Models
Improving the Prediction of Individual Engagement in Recommendations Using Cognitive Models
Roderick Seow
Yunfan Zhao
Duncan Wood
Milind Tambe
Cleotilde Gonzalez
28
4
0
28 Aug 2024
GINO-Q: Learning an Asymptotically Optimal Index Policy for Restless
  Multi-armed Bandits
GINO-Q: Learning an Asymptotically Optimal Index Policy for Restless Multi-armed Bandits
Gongpu Chen
Soung Chang Liew
Deniz Gunduz
17
1
0
19 Aug 2024
The Bandit Whisperer: Communication Learning for Restless Bandits
The Bandit Whisperer: Communication Learning for Restless Bandits
Yunfan Zhao
Tonghan Wang
Dheeraj M. Nagaraj
Aparna Taneja
Milind Tambe
52
5
0
11 Aug 2024
Tabular and Deep Learning for the Whittle Index
Tabular and Deep Learning for the Whittle Index
Francisco Robledo Relaño
Vivek Borkar
U. Ayesta
Konstantin Avrachenkov
31
2
0
04 Jun 2024
Provably Efficient Reinforcement Learning for Adversarial Restless
  Multi-Armed Bandits with Unknown Transitions and Bandit Feedback
Provably Efficient Reinforcement Learning for Adversarial Restless Multi-Armed Bandits with Unknown Transitions and Bandit Feedback
Guojun Xiong
Jian Li
33
1
0
02 May 2024
Online Restless Multi-Armed Bandits with Long-Term Fairness Constraints
Online Restless Multi-Armed Bandits with Long-Term Fairness Constraints
Shu-Fan Wang
Guojun Xiong
Jian Li
59
6
0
16 Dec 2023
Towards a Pretrained Model for Restless Bandits via Multi-arm
  Generalization
Towards a Pretrained Model for Restless Bandits via Multi-arm Generalization
Yunfan Zhao
Nikhil Behari
Edward Hughes
Edwin Zhang
Dheeraj M. Nagaraj
K. Tuyls
Aparna Taneja
Milind Tambe
29
8
0
23 Oct 2023
DeepTOP: Deep Threshold-Optimal Policy for MDPs and RMABs
DeepTOP: Deep Threshold-Optimal Policy for MDPs and RMABs
Khaled Nakhleh
I.-Hong Hou
72
6
0
18 Sep 2022
NeurWIN: Neural Whittle Index Network For Restless Bandits Via Deep RL
NeurWIN: Neural Whittle Index Network For Restless Bandits Via Deep RL
Khaled Nakhleh
Santosh Ganji
Ping-Chun Hsieh
I.-Hong Hou
S. Shakkottai
61
38
0
05 Oct 2021
Model-free Reinforcement Learning in Infinite-horizon Average-reward
  Markov Decision Processes
Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes
Chen-Yu Wei
Mehdi Jafarnia-Jahromi
Haipeng Luo
Hiteshi Sharma
R. Jain
107
100
0
15 Oct 2019
1