ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2105.07965
  4. Cited By
Learn to Intervene: An Adaptive Learning Policy for Restless Bandits in
  Application to Preventive Healthcare

Learn to Intervene: An Adaptive Learning Policy for Restless Bandits in Application to Preventive Healthcare

17 May 2021
Arpita Biswas
Gaurav Aggarwal
Pradeep Varakantham
Milind Tambe
ArXivPDFHTML

Papers citing "Learn to Intervene: An Adaptive Learning Policy for Restless Bandits in Application to Preventive Healthcare"

29 / 29 papers shown
Title
Projection-based Lyapunov method for fully heterogeneous weakly-coupled MDPs
Projection-based Lyapunov method for fully heterogeneous weakly-coupled MDPs
Xiangcheng Zhang
Yige Hong
Weina Wang
56
0
0
09 Feb 2025
Lagrangian Index Policy for Restless Bandits with Average Reward
Lagrangian Index Policy for Restless Bandits with Average Reward
Konstantin Avrachenkov
Vivek Borkar
Pratik Shah
90
0
0
17 Dec 2024
DOPL: Direct Online Preference Learning for Restless Bandits with
  Preference Feedback
DOPL: Direct Online Preference Learning for Restless Bandits with Preference Feedback
Guojun Xiong
Ujwal Dinesha
Debajoy Mukherjee
Jian Li
Srinivas Shakkottai
57
2
0
07 Oct 2024
The Digital Transformation in Health: How AI Can Improve the Performance
  of Health Systems
The Digital Transformation in Health: How AI Can Improve the Performance of Health Systems
África Periánez
Ana Fernández del Río
Ivan Nazarov
Enric Jané
Moiz Hassan
Aditya Rastogi
Dexian Tang
54
11
0
24 Sep 2024
GINO-Q: Learning an Asymptotically Optimal Index Policy for Restless
  Multi-armed Bandits
GINO-Q: Learning an Asymptotically Optimal Index Policy for Restless Multi-armed Bandits
Gongpu Chen
Soung Chang Liew
Deniz Gunduz
27
1
0
19 Aug 2024
Adaptive User Journeys in Pharma E-Commerce with Reinforcement Learning:
  Insights from SwipeRx
Adaptive User Journeys in Pharma E-Commerce with Reinforcement Learning: Insights from SwipeRx
Ana Fernández del Río
Michael Brennan Leong
Paulo Saraiva
Ivan Nazarov
Aditya Rastogi
Moiz Hassan
Dexian Tang
África Periánez
OffRL
OnRL
49
2
0
15 Aug 2024
Optimizing HIV Patient Engagement with Reinforcement Learning in
  Resource-Limited Settings
Optimizing HIV Patient Engagement with Reinforcement Learning in Resource-Limited Settings
África Periánez
Kathrin Schmitz
Lazola Makhupula
Moiz Hassan
Moeti Moleko
Ana Fernández del Río
Ivan Nazarov
Aditya Rastogi
Dexian Tang
OffRL
42
0
0
14 Aug 2024
The Bandit Whisperer: Communication Learning for Restless Bandits
The Bandit Whisperer: Communication Learning for Restless Bandits
Yunfan Zhao
Tonghan Wang
Dheeraj M. Nagaraj
Aparna Taneja
Milind Tambe
57
5
0
11 Aug 2024
EduQate: Generating Adaptive Curricula through RMABs in Education
  Settings
EduQate: Generating Adaptive Curricula through RMABs in Education Settings
Sidney Tio
Dexun Li
Pradeep Varakantham
OffRL
16
0
0
20 Jun 2024
An Index Policy Based on Sarsa and Q-learning for Heterogeneous Smart
  Target Tracking
An Index Policy Based on Sarsa and Q-learning for Heterogeneous Smart Target Tracking
Yuhang Hao
Zengfu Wang
Jing-Zhi Fu
Quan Pan
49
0
0
19 Feb 2024
Online Restless Multi-Armed Bandits with Long-Term Fairness Constraints
Online Restless Multi-Armed Bandits with Long-Term Fairness Constraints
Shu-Fan Wang
Guojun Xiong
Jian Li
62
6
0
16 Dec 2023
Towards a Pretrained Model for Restless Bandits via Multi-arm
  Generalization
Towards a Pretrained Model for Restless Bandits via Multi-arm Generalization
Yunfan Zhao
Nikhil Behari
Edward Hughes
Edwin Zhang
Dheeraj M. Nagaraj
K. Tuyls
Aparna Taneja
Milind Tambe
37
8
0
23 Oct 2023
Finite-Time Analysis of Whittle Index based Q-Learning for Restless
  Multi-Armed Bandits with Neural Network Function Approximation
Finite-Time Analysis of Whittle Index based Q-Learning for Restless Multi-Armed Bandits with Neural Network Function Approximation
Guojun Xiong
Jian Li
47
13
0
03 Oct 2023
Policy Optimization for Personalized Interventions in Behavioral Health
Policy Optimization for Personalized Interventions in Behavioral Health
Jackie Baek
J. Boutilier
Vivek F. Farias
J. Jónasson
Erez Yoeli
OffRL
27
7
0
21 Mar 2023
Improved Policy Evaluation for Randomized Trials of Algorithmic Resource
  Allocation
Improved Policy Evaluation for Randomized Trials of Algorithmic Resource Allocation
Aditya Mate
Bryan Wilder
Aparna Taneja
Milind Tambe
OffRL
14
3
0
06 Feb 2023
Data-pooling Reinforcement Learning for Personalized Healthcare
  Intervention
Data-pooling Reinforcement Learning for Personalized Healthcare Intervention
Xinyun Chen
P. Shi
Shanwen Pu
OffRL
35
4
0
16 Nov 2022
DeepTOP: Deep Threshold-Optimal Policy for MDPs and RMABs
DeepTOP: Deep Threshold-Optimal Policy for MDPs and RMABs
Khaled Nakhleh
I.-Hong Hou
77
6
0
18 Sep 2022
On-the-fly Adaptation of Patrolling Strategies in Changing Environments
On-the-fly Adaptation of Patrolling Strategies in Changing Environments
Tomávs Brázdil
David Klavska
Antonín Kuvcera
Vít Musil
Petr Novotný
Vojtvech vRehák
TTA
AAML
14
0
0
16 Jun 2022
Efficient Resource Allocation with Fairness Constraints in Restless
  Multi-Armed Bandits
Efficient Resource Allocation with Fairness Constraints in Restless Multi-Armed Bandits
Dexun Li
Pradeep Varakantham
13
9
0
08 Jun 2022
Optimistic Whittle Index Policy: Online Learning for Restless Bandits
Optimistic Whittle Index Policy: Online Learning for Restless Bandits
Kai Wang
Lily Xu
Aparna Taneja
Milind Tambe
50
16
0
30 May 2022
Near-optimality for infinite-horizon restless bandits with many arms
Near-optimality for infinite-horizon restless bandits with many arms
Xinming Zhang
P. Frazier
9
14
0
29 Mar 2022
Whittle Index based Q-Learning for Wireless Edge Caching with Linear
  Function Approximation
Whittle Index based Q-Learning for Wireless Edge Caching with Linear Function Approximation
Guojun Xiong
Shu-Fan Wang
Jian Li
Rahul Singh
38
6
0
26 Feb 2022
Minimizing Expected Intrusion Detection Time in Adversarial Patrolling
Minimizing Expected Intrusion Detection Time in Adversarial Patrolling
David Klavska
Antonín Kuvcera
Vít Musil
Vojtvech vRehák
AAML
16
0
0
02 Feb 2022
Networked Restless Multi-Armed Bandits for Mobile Interventions
Networked Restless Multi-Armed Bandits for Mobile Interventions
H. Ou
Christoph Siebenbrunner
J. Killian
M. Brooks
David Kempe
Yevgeniy Vorobeychik
Milind Tambe
54
7
0
28 Jan 2022
NeurWIN: Neural Whittle Index Network For Restless Bandits Via Deep RL
NeurWIN: Neural Whittle Index Network For Restless Bandits Via Deep RL
Khaled Nakhleh
Santosh Ganji
Ping-Chun Hsieh
I.-Hong Hou
S. Shakkottai
61
38
0
05 Oct 2021
Field Study in Deploying Restless Multi-Armed Bandits: Assisting
  Non-Profits in Improving Maternal and Child Health
Field Study in Deploying Restless Multi-Armed Bandits: Assisting Non-Profits in Improving Maternal and Child Health
Aditya Mate
Lovish Madaan
Aparna Taneja
N. Madhiwalla
Shresth Verma
Gargi Singh
Aparna Hegde
Pradeep Varakantham
Milind Tambe
33
52
0
16 Sep 2021
Restless and Uncertain: Robust Policies for Restless Bandits via Deep
  Multi-Agent Reinforcement Learning
Restless and Uncertain: Robust Policies for Restless Bandits via Deep Multi-Agent Reinforcement Learning
J. Killian
Lily Xu
Arpita Biswas
Milind Tambe
32
6
0
04 Jul 2021
Q-Learning Lagrange Policies for Multi-Action Restless Bandits
Q-Learning Lagrange Policies for Multi-Action Restless Bandits
J. Killian
Arpita Biswas
Sanket Shah
Milind Tambe
OffRL
38
33
0
22 Jun 2021
Efficient Algorithms for Finite Horizon and Streaming Restless
  Multi-Armed Bandit Problems
Efficient Algorithms for Finite Horizon and Streaming Restless Multi-Armed Bandit Problems
Aditya Mate
Arpita Biswas
Christoph Siebenbrunner
Susobhan Ghosh
Milind Tambe
29
9
0
08 Mar 2021
1