Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2102.05406
Cited By
Non-stationary Reinforcement Learning without Prior Knowledge: An Optimal Black-box Approach
10 February 2021
Chen-Yu Wei
Haipeng Luo
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Non-stationary Reinforcement Learning without Prior Knowledge: An Optimal Black-box Approach"
32 / 32 papers shown
Title
Tracking Most Significant Shifts in Infinite-Armed Bandits
Joe Suk
Jung-hun Kim
55
0
0
31 Jan 2025
Provably Efficient Reinforcement Learning with Multinomial Logit Function Approximation
Long-Fei Li
Yu-Jie Zhang
Peng Zhao
Zhi-Hua Zhou
101
4
0
17 Jan 2025
Lower Bounds for Time-Varying Kernelized Bandits
Xu Cai
Jonathan Scarlett
36
0
0
22 Oct 2024
Predictive Control and Regret Analysis of Non-Stationary MDP with Look-ahead Information
Ziyi Zhang
Yorie Nakahira
Guannan Qu
36
1
0
13 Sep 2024
MetaCURL: Non-stationary Concave Utility Reinforcement Learning
B. Moreno
Margaux Brégère
Pierre Gaillard
Nadia Oudjane
OffRL
39
0
0
30 May 2024
Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement Learning
Aneesh Muppidi
Zhiyu Zhang
Heng Yang
34
4
0
26 May 2024
Variance-Dependent Regret Bounds for Non-stationary Linear Bandits
Zhiyong Wang
Jize Xie
Yi Chen
J. C. Lui
Dongruo Zhou
28
0
0
15 Mar 2024
Harnessing the Power of Federated Learning in Federated Contextual Bandits
Chengshuai Shi
Ruida Zhou
Kun Yang
Cong Shen
FedML
21
0
0
26 Dec 2023
A Stability Principle for Learning under Non-Stationarity
Chengpiao Huang
Kaizheng Wang
39
2
0
27 Oct 2023
Meta-Learning Adversarial Bandit Algorithms
M. Khodak
Ilya Osadchiy
Keegan Harris
Maria-Florina Balcan
Kfir Y. Levy
Ron Meir
Zhiwei Steven Wu
FedML
28
2
0
05 Jul 2023
An Adaptive Method for Weak Supervision with Drifting Data
Alessio Mazzetto
Reza Esfandiarpoor
E. Upfal
Stephen H. Bach
Stephen H. Bach
65
1
0
02 Jun 2023
Online Reinforcement Learning in Periodic MDP
Ayush Aniket
Arpan Chattopadhyay
24
2
0
16 Mar 2023
Revisiting Weighted Strategy for Non-stationary Parametric Bandits
Jing Wang
Peng Zhao
Zhihong Zhou
30
5
0
05 Mar 2023
MNL-Bandit in non-stationary environments
Ayoub Foussoul
Vineet Goyal
Varun Gupta
34
2
0
04 Mar 2023
Online Reinforcement Learning in Non-Stationary Context-Driven Environments
Pouya Hamadanian
Arash Nasr-Esfahany
Malte Schwarzkopf
Siddartha Sen
MohammadIman Alizadeh
CLL
OffRL
50
0
0
04 Feb 2023
Rectified Pessimistic-Optimistic Learning for Stochastic Continuum-armed Bandit with Constraints
Heng Guo
Qi Zhu
Xin Liu
26
11
0
27 Nov 2022
Doubly Inhomogeneous Reinforcement Learning
Liyuan Hu
Mengbing Li
C. Shi
Zhanghua Wu
Piotr Fryzlewicz
OffRL
31
2
0
08 Nov 2022
Dynamic Regret of Online Markov Decision Processes
Peng Zhao
Longfei Li
Zhi-Hua Zhou
OffRL
27
17
0
26 Aug 2022
Performative Reinforcement Learning
Debmalya Mandal
Stelios Triantafyllou
Goran Radanović
33
17
0
30 Jun 2022
Open-environment Machine Learning
Zhi-Hua Zhou
VLM
37
133
0
01 Jun 2022
Testing Stationarity and Change Point Detection in Reinforcement Learning
Mengbing Li
C. Shi
Zhanghua Wu
Piotr Fryzlewicz
OffRL
42
9
0
03 Mar 2022
Non-stationary Bandits and Meta-Learning with a Small Set of Optimal Arms
Javad Azizi
T. Duong
Yasin Abbasi-Yadkori
András Gyorgy
Claire Vernade
Mohammad Ghavamzadeh
34
8
0
25 Feb 2022
Versatile Dueling Bandits: Best-of-both-World Analyses for Online Learning from Preferences
Aadirupa Saha
Pierre Gaillard
36
8
0
14 Feb 2022
Corralling a Larger Band of Bandits: A Case Study on Switching Regret for Linear Bandits
Haipeng Luo
Mengxiao Zhang
Peng Zhao
Zhi-Hua Zhou
31
17
0
12 Feb 2022
Recent Advances in Reinforcement Learning in Finance
B. Hambly
Renyuan Xu
Huining Yang
OffRL
27
166
0
08 Dec 2021
Optimal and Efficient Dynamic Regret Algorithms for Non-Stationary Dueling Bandits
Aadirupa Saha
Shubham Gupta
30
10
0
06 Nov 2021
Bandit Algorithms for Precision Medicine
Yangyi Lu
Ziping Xu
Ambuj Tewari
59
11
0
10 Aug 2021
Bayesian decision-making under misspecified priors with applications to meta-learning
Max Simchowitz
Christopher Tosh
A. Krishnamurthy
Daniel J. Hsu
Thodoris Lykouris
Miroslav Dudík
Robert Schapire
22
49
0
03 Jul 2021
Efficient Learning in Non-Stationary Linear Markov Decision Processes
Ahmed Touati
Pascal Vincent
37
29
0
24 Oct 2020
Bypassing the Monster: A Faster and Simpler Optimal Algorithm for Contextual Bandits under Realizability
D. Simchi-Levi
Yunzong Xu
OffRL
47
107
0
28 Mar 2020
Algorithms for Non-Stationary Generalized Linear Bandits
Yoan Russac
Olivier Cappé
Aurélien Garivier
43
23
0
23 Mar 2020
Weighted Linear Bandits for Non-Stationary Environments
Yoan Russac
Claire Vernade
Olivier Cappé
82
101
0
19 Sep 2019
1