ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1011.4969
  4. Cited By
Learning in A Changing World: Restless Multi-Armed Bandit with Unknown
  Dynamics

Learning in A Changing World: Restless Multi-Armed Bandit with Unknown Dynamics

22 November 2010
Haoyang Liu
Keqin Liu
Qing Zhao
ArXivPDFHTML

Papers citing "Learning in A Changing World: Restless Multi-Armed Bandit with Unknown Dynamics"

50 / 52 papers shown
Title
Planning and Learning in Risk-Aware Restless Multi-Arm Bandit Problem
Planning and Learning in Risk-Aware Restless Multi-Arm Bandit Problem
Nima Akbarzadeh
Erick Delage
Yossiri Adulyasak
35
0
0
30 Oct 2024
A Federated Online Restless Bandit Framework for Cooperative Resource
  Allocation
A Federated Online Restless Bandit Framework for Cooperative Resource Allocation
Jingwen Tong
Xinran Li
Liqun Fu
Jun Zhang
Khaled B. Letaief
50
1
0
12 Jun 2024
Human-in-the-loop Learning for Dynamic Congestion Games
Human-in-the-loop Learning for Dynamic Congestion Games
Hongbo Li
Lingjie Duan
43
3
0
24 Apr 2024
Partially-Observable Sequential Change-Point Detection for
  Autocorrelated Data via Upper Confidence Region
Partially-Observable Sequential Change-Point Detection for Autocorrelated Data via Upper Confidence Region
Haijie Xu
Xiaochen Xian
Chen Zhang
Kaibo Liu
19
0
0
30 Mar 2024
SINR-Aware Deep Reinforcement Learning for Distributed Dynamic Channel
  Allocation in Cognitive Interference Networks
SINR-Aware Deep Reinforcement Learning for Distributed Dynamic Channel Allocation in Cognitive Interference Networks
Yaniv Cohen
Tomer Gafni
Ronen Greenberg
Kobi Cohen
32
5
0
17 Feb 2024
Online Restless Multi-Armed Bandits with Long-Term Fairness Constraints
Online Restless Multi-Armed Bandits with Long-Term Fairness Constraints
Shu-Fan Wang
Guojun Xiong
Jian Li
59
6
0
16 Dec 2023
Finite-Time Analysis of Whittle Index based Q-Learning for Restless
  Multi-Armed Bandits with Neural Network Function Approximation
Finite-Time Analysis of Whittle Index based Q-Learning for Restless Multi-Armed Bandits with Neural Network Function Approximation
Guojun Xiong
Jian Li
35
13
0
03 Oct 2023
Fair Multi-Agent Bandits
Fair Multi-Agent Bandits
Amir Leshem
FedML
FaML
11
1
0
07 Jun 2023
Multi-Flow Transmission in Wireless Interference Networks: A Convergent
  Graph Learning Approach
Multi-Flow Transmission in Wireless Interference Networks: A Convergent Graph Learning Approach
Raz Paul
Kobi Cohen
Gil Kedar
34
5
0
27 Mar 2023
Client Selection for Generalization in Accelerated Federated Learning: A
  Multi-Armed Bandit Approach
Client Selection for Generalization in Accelerated Federated Learning: A Multi-Armed Bandit Approach
Dan Ben Ami
Kobi Cohen
Qing Zhao
FedML
32
11
0
18 Mar 2023
A Definition of Non-Stationary Bandits
A Definition of Non-Stationary Bandits
Yueyang Liu
Kuang Xu
Benjamin Van Roy
24
11
0
23 Feb 2023
Decision-Focused Evaluation: Analyzing Performance of Deployed Restless
  Multi-Arm Bandits
Decision-Focused Evaluation: Analyzing Performance of Deployed Restless Multi-Arm Bandits
Paritosh Verma
Shresth Verma
Aditya Mate
Aparna Taneja
Milind Tambe
16
0
0
19 Jan 2023
Deterministic Sequencing of Exploration and Exploitation for
  Reinforcement Learning
Deterministic Sequencing of Exploration and Exploitation for Reinforcement Learning
Piyush B. Gupta
Vaibhav Srivastava
OffRL
29
3
0
12 Sep 2022
Fast Change Identification in Multi-Play Bandits and its Applications in
  Wireless Networks
Fast Change Identification in Multi-Play Bandits and its Applications in Wireless Networks
Gourab Ghatak
45
1
0
20 May 2022
Restless Multi-Armed Bandits under Exogenous Global Markov Process
Restless Multi-Armed Bandits under Exogenous Global Markov Process
Tomer Gafni
M. Yemini
Kobi Cohen
27
3
0
28 Feb 2022
Whittle Index based Q-Learning for Wireless Edge Caching with Linear
  Function Approximation
Whittle Index based Q-Learning for Wireless Edge Caching with Linear Function Approximation
Guojun Xiong
Shu-Fan Wang
Jian Li
Rahul Singh
27
6
0
26 Feb 2022
On learning Whittle index policy for restless bandits with scalable
  regret
On learning Whittle index policy for restless bandits with scalable regret
N. Akbarzadeh
Aditya Mahajan
13
13
0
07 Feb 2022
Scalable Decision-Focused Learning in Restless Multi-Armed Bandits with
  Application to Maternal and Child Health
Scalable Decision-Focused Learning in Restless Multi-Armed Bandits with Application to Maternal and Child Health
Kai Wang
Shresth Verma
Aditya Mate
Sanket Shah
Aparna Taneja
N. Madhiwalla
Aparna Hegde
Milind Tambe
21
12
0
02 Feb 2022
Using Non-Stationary Bandits for Learning in Repeated Cournot Games with
  Non-Stationary Demand
Using Non-Stationary Bandits for Learning in Repeated Cournot Games with Non-Stationary Demand
Kshitija Taywade
Brent Harrison
J. Goldsmith
33
3
0
03 Jan 2022
Learning in Restless Bandits under Exogenous Global Markov Process
Learning in Restless Bandits under Exogenous Global Markov Process
Tomer Gafni
M. Yemini
Kobi Cohen
35
13
0
17 Dec 2021
Medium Access Control protocol for Collaborative Spectrum Learning in
  Wireless Networks
Medium Access Control protocol for Collaborative Spectrum Learning in Wireless Networks
Tomer Boyarski
Wenbo Wang
Amir Leshem
18
3
0
25 Oct 2021
Deep Reinforcement Learning for Simultaneous Sensing and Channel Access
  in Cognitive Networks
Deep Reinforcement Learning for Simultaneous Sensing and Channel Access in Cognitive Networks
Yoel Bokobza
R. Dabora
Kobi Cohen
47
13
0
24 Oct 2021
Reinforcement Learning for Finite-Horizon Restless Multi-Armed
  Multi-Action Bandits
Reinforcement Learning for Finite-Horizon Restless Multi-Armed Multi-Action Bandits
Guojun Xiong
Jian Li
Rahul Singh
30
4
0
20 Sep 2021
Cooperative Stochastic Multi-agent Multi-armed Bandits Robust to
  Adversarial Corruptions
Cooperative Stochastic Multi-agent Multi-armed Bandits Robust to Adversarial Corruptions
Junyan Liu
Shuai Li
Dapeng Li
20
6
0
08 Jun 2021
Learning to Detect an Odd Restless Markov Arm with a Trembling Hand
Learning to Detect an Odd Restless Markov Arm with a Trembling Hand
P. Karthik
R. Sundaresan
11
5
0
08 May 2021
Generalized non-stationary bandits
Generalized non-stationary bandits
Anne Gael Manegueu
Alexandra Carpentier
Yi Yu
39
10
0
01 Feb 2021
Adaptive KL-UCB based Bandit Algorithms for Markovian and i.i.d.
  Settings
Adaptive KL-UCB based Bandit Algorithms for Markovian and i.i.d. Settings
Member Ieee Arghyadip Roy
Fellow Ieee Sanjay Shakkottai
F. I. R. Srikant
16
2
0
14 Sep 2020
Robust Multi-Agent Multi-Armed Bandits
Robust Multi-Agent Multi-Armed Bandits
Daniel Vial
Sanjay Shakkottai
R. Srikant
14
36
0
07 Jul 2020
Detecting an Odd Restless Markov Arm with a Trembling Hand
Detecting an Odd Restless Markov Arm with a Trembling Hand
P. Karthik
R. Sundaresan
8
6
0
13 May 2020
My Fair Bandit: Distributed Learning of Max-Min Fairness with
  Multi-player Bandits
My Fair Bandit: Distributed Learning of Max-Min Fairness with Multi-player Bandits
Ilai Bistritz
Tavor Z. Baharav
Amir Leshem
Nicholas Bambos
FaML
15
37
0
23 Feb 2020
The Gossiping Insert-Eliminate Algorithm for Multi-Agent Bandits
The Gossiping Insert-Eliminate Algorithm for Multi-Agent Bandits
Ronshee Chawla
Abishek Sankararaman
A. Ganesh
Sanjay Shakkottai
9
50
0
15 Jan 2020
Thompson Sampling in Non-Episodic Restless Bandits
Thompson Sampling in Non-Episodic Restless Bandits
Young Hun Jung
Marc Abeille
Ambuj Tewari
9
19
0
12 Oct 2019
Social Learning in Multi Agent Multi Armed Bandits
Social Learning in Multi Agent Multi Armed Bandits
Abishek Sankararaman
A. Ganesh
Sanjay Shakkottai
26
84
0
04 Oct 2019
A Learning-Based Two-Stage Spectrum Sharing Strategy with Multiple
  Primary Transmit Power Levels
A Learning-Based Two-Stage Spectrum Sharing Strategy with Multiple Primary Transmit Power Levels
Rui Zhang
Peng Cheng
Zhuo Chen
Yonghui Li
Branka Vucetic
6
8
0
21 Jul 2019
Learning in Restless Multi-Armed Bandits via Adaptive Arm Sequencing
  Rules
Learning in Restless Multi-Armed Bandits via Adaptive Arm Sequencing Rules
Tomer Gafni
Kobi Cohen
9
22
0
19 Jun 2019
Regret Bounds for Thompson Sampling in Episodic Restless Bandit Problems
Regret Bounds for Thompson Sampling in Episodic Restless Bandit Problems
Young Hun Jung
Ambuj Tewari
19
44
0
29 May 2019
Multi-Player Bandits: The Adversarial Case
Multi-Player Bandits: The Adversarial Case
Pragnya Alatur
Kfir Y. Levy
Andreas Krause
AAML
14
37
0
21 Feb 2019
On Abruptly-Changing and Slowly-Varying Multiarmed Bandit Problems
On Abruptly-Changing and Slowly-Varying Multiarmed Bandit Problems
Lai Wei
Vaibhav Srivastava
24
37
0
23 Feb 2018
Online Influence Maximization in Non-Stationary Social Networks
Online Influence Maximization in Non-Stationary Social Networks
Yixin Bao
Xiaoke Wang
Zhi Wang
Chuan Wu
F. Lau
16
22
0
26 Apr 2016
Risk-Averse Multi-Armed Bandit Problems under Mean-Variance Measure
Risk-Averse Multi-Armed Bandit Problems under Mean-Variance Measure
Sattar Vakili
Qing Zhao
13
88
0
18 Apr 2016
Time-Varying Gaussian Process Bandit Optimization
Time-Varying Gaussian Process Bandit Optimization
Ilija Bogunovic
Jonathan Scarlett
V. Cevher
69
95
0
25 Jan 2016
Multi-armed Bandits with Application to 5G Small Cells
Multi-armed Bandits with Application to 5G Small Cells
S. Maghsudi
Ekram Hossain
18
120
0
02 Oct 2015
Episodic Multi-armed Bandits
Episodic Multi-armed Bandits
Cem Tekin
M. Schaar
OffRL
28
0
0
04 Aug 2015
Regulating Greed Over Time in Multi-Armed Bandits
Regulating Greed Over Time in Multi-Armed Bandits
Stefano Tracà
Cynthia Rudin
Weiyu Yan
17
5
0
21 May 2015
On Regret-Optimal Learning in Decentralized Multi-player Multi-armed
  Bandits
On Regret-Optimal Learning in Decentralized Multi-player Multi-armed Bandits
Naumaan Nayyar
D. Kalathil
R. Jain
OffRL
20
76
0
04 May 2015
Distributed Online Learning in Social Recommender Systems
Distributed Online Learning in Social Recommender Systems
Cem Tekin
Simpson Zhang
M. Schaar
OffRL
36
65
0
26 Sep 2013
Distributed Online Learning via Cooperative Contextual Bandits
Distributed Online Learning via Cooperative Contextual Bandits
Cem Tekin
M. Schaar
FedML
29
53
0
21 Aug 2013
Towards Distribution-Free Multi-Armed Bandits with Combinatorial
  Strategies
Towards Distribution-Free Multi-Armed Bandits with Combinatorial Strategies
Xiangyang Li
Shaojie Tang
Yaqin Zhou
46
0
0
20 Jul 2013
A Sensing Policy Based on Confidence Bounds and a Restless Multi-Armed
  Bandit Model
A Sensing Policy Based on Confidence Bounds and a Restless Multi-Armed Bandit Model
J. Oksanen
V. Koivunen
H. Vincent Poor
34
13
0
19 Nov 2012
Decentralized Learning for Multi-player Multi-armed Bandits
Decentralized Learning for Multi-player Multi-armed Bandits
D. Kalathil
Naumaan Nayyar
R. Jain
43
44
0
14 Jun 2012
12
Next