ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2102.05406
  4. Cited By
Non-stationary Reinforcement Learning without Prior Knowledge: An
  Optimal Black-box Approach

Non-stationary Reinforcement Learning without Prior Knowledge: An Optimal Black-box Approach

10 February 2021
Chen-Yu Wei
Haipeng Luo
    OffRL
ArXivPDFHTML

Papers citing "Non-stationary Reinforcement Learning without Prior Knowledge: An Optimal Black-box Approach"

32 / 32 papers shown
Title
Tracking Most Significant Shifts in Infinite-Armed Bandits
Joe Suk
Jung-hun Kim
55
0
0
31 Jan 2025
Provably Efficient Reinforcement Learning with Multinomial Logit Function Approximation
Provably Efficient Reinforcement Learning with Multinomial Logit Function Approximation
Long-Fei Li
Yu-Jie Zhang
Peng Zhao
Zhi-Hua Zhou
101
4
0
17 Jan 2025
Lower Bounds for Time-Varying Kernelized Bandits
Lower Bounds for Time-Varying Kernelized Bandits
Xu Cai
Jonathan Scarlett
36
0
0
22 Oct 2024
Predictive Control and Regret Analysis of Non-Stationary MDP with
  Look-ahead Information
Predictive Control and Regret Analysis of Non-Stationary MDP with Look-ahead Information
Ziyi Zhang
Yorie Nakahira
Guannan Qu
36
1
0
13 Sep 2024
MetaCURL: Non-stationary Concave Utility Reinforcement Learning
MetaCURL: Non-stationary Concave Utility Reinforcement Learning
B. Moreno
Margaux Brégère
Pierre Gaillard
Nadia Oudjane
OffRL
39
0
0
30 May 2024
Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement
  Learning
Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement Learning
Aneesh Muppidi
Zhiyu Zhang
Heng Yang
34
4
0
26 May 2024
Variance-Dependent Regret Bounds for Non-stationary Linear Bandits
Variance-Dependent Regret Bounds for Non-stationary Linear Bandits
Zhiyong Wang
Jize Xie
Yi Chen
J. C. Lui
Dongruo Zhou
28
0
0
15 Mar 2024
Harnessing the Power of Federated Learning in Federated Contextual
  Bandits
Harnessing the Power of Federated Learning in Federated Contextual Bandits
Chengshuai Shi
Ruida Zhou
Kun Yang
Cong Shen
FedML
21
0
0
26 Dec 2023
A Stability Principle for Learning under Non-Stationarity
A Stability Principle for Learning under Non-Stationarity
Chengpiao Huang
Kaizheng Wang
39
2
0
27 Oct 2023
Meta-Learning Adversarial Bandit Algorithms
Meta-Learning Adversarial Bandit Algorithms
M. Khodak
Ilya Osadchiy
Keegan Harris
Maria-Florina Balcan
Kfir Y. Levy
Ron Meir
Zhiwei Steven Wu
FedML
28
2
0
05 Jul 2023
An Adaptive Method for Weak Supervision with Drifting Data
An Adaptive Method for Weak Supervision with Drifting Data
Alessio Mazzetto
Reza Esfandiarpoor
E. Upfal
Stephen H. Bach
Stephen H. Bach
65
1
0
02 Jun 2023
Online Reinforcement Learning in Periodic MDP
Online Reinforcement Learning in Periodic MDP
Ayush Aniket
Arpan Chattopadhyay
24
2
0
16 Mar 2023
Revisiting Weighted Strategy for Non-stationary Parametric Bandits
Revisiting Weighted Strategy for Non-stationary Parametric Bandits
Jing Wang
Peng Zhao
Zhihong Zhou
30
5
0
05 Mar 2023
MNL-Bandit in non-stationary environments
MNL-Bandit in non-stationary environments
Ayoub Foussoul
Vineet Goyal
Varun Gupta
34
2
0
04 Mar 2023
Online Reinforcement Learning in Non-Stationary Context-Driven Environments
Online Reinforcement Learning in Non-Stationary Context-Driven Environments
Pouya Hamadanian
Arash Nasr-Esfahany
Malte Schwarzkopf
Siddartha Sen
MohammadIman Alizadeh
CLL
OffRL
50
0
0
04 Feb 2023
Rectified Pessimistic-Optimistic Learning for Stochastic Continuum-armed
  Bandit with Constraints
Rectified Pessimistic-Optimistic Learning for Stochastic Continuum-armed Bandit with Constraints
Heng Guo
Qi Zhu
Xin Liu
26
11
0
27 Nov 2022
Doubly Inhomogeneous Reinforcement Learning
Doubly Inhomogeneous Reinforcement Learning
Liyuan Hu
Mengbing Li
C. Shi
Zhanghua Wu
Piotr Fryzlewicz
OffRL
31
2
0
08 Nov 2022
Dynamic Regret of Online Markov Decision Processes
Dynamic Regret of Online Markov Decision Processes
Peng Zhao
Longfei Li
Zhi-Hua Zhou
OffRL
27
17
0
26 Aug 2022
Performative Reinforcement Learning
Performative Reinforcement Learning
Debmalya Mandal
Stelios Triantafyllou
Goran Radanović
33
17
0
30 Jun 2022
Open-environment Machine Learning
Open-environment Machine Learning
Zhi-Hua Zhou
VLM
37
133
0
01 Jun 2022
Testing Stationarity and Change Point Detection in Reinforcement Learning
Testing Stationarity and Change Point Detection in Reinforcement Learning
Mengbing Li
C. Shi
Zhanghua Wu
Piotr Fryzlewicz
OffRL
42
9
0
03 Mar 2022
Non-stationary Bandits and Meta-Learning with a Small Set of Optimal
  Arms
Non-stationary Bandits and Meta-Learning with a Small Set of Optimal Arms
Javad Azizi
T. Duong
Yasin Abbasi-Yadkori
András Gyorgy
Claire Vernade
Mohammad Ghavamzadeh
34
8
0
25 Feb 2022
Versatile Dueling Bandits: Best-of-both-World Analyses for Online
  Learning from Preferences
Versatile Dueling Bandits: Best-of-both-World Analyses for Online Learning from Preferences
Aadirupa Saha
Pierre Gaillard
36
8
0
14 Feb 2022
Corralling a Larger Band of Bandits: A Case Study on Switching Regret
  for Linear Bandits
Corralling a Larger Band of Bandits: A Case Study on Switching Regret for Linear Bandits
Haipeng Luo
Mengxiao Zhang
Peng Zhao
Zhi-Hua Zhou
31
17
0
12 Feb 2022
Recent Advances in Reinforcement Learning in Finance
Recent Advances in Reinforcement Learning in Finance
B. Hambly
Renyuan Xu
Huining Yang
OffRL
27
166
0
08 Dec 2021
Optimal and Efficient Dynamic Regret Algorithms for Non-Stationary
  Dueling Bandits
Optimal and Efficient Dynamic Regret Algorithms for Non-Stationary Dueling Bandits
Aadirupa Saha
Shubham Gupta
30
10
0
06 Nov 2021
Bandit Algorithms for Precision Medicine
Bandit Algorithms for Precision Medicine
Yangyi Lu
Ziping Xu
Ambuj Tewari
59
11
0
10 Aug 2021
Bayesian decision-making under misspecified priors with applications to
  meta-learning
Bayesian decision-making under misspecified priors with applications to meta-learning
Max Simchowitz
Christopher Tosh
A. Krishnamurthy
Daniel J. Hsu
Thodoris Lykouris
Miroslav Dudík
Robert Schapire
22
49
0
03 Jul 2021
Efficient Learning in Non-Stationary Linear Markov Decision Processes
Efficient Learning in Non-Stationary Linear Markov Decision Processes
Ahmed Touati
Pascal Vincent
37
29
0
24 Oct 2020
Bypassing the Monster: A Faster and Simpler Optimal Algorithm for
  Contextual Bandits under Realizability
Bypassing the Monster: A Faster and Simpler Optimal Algorithm for Contextual Bandits under Realizability
D. Simchi-Levi
Yunzong Xu
OffRL
47
107
0
28 Mar 2020
Algorithms for Non-Stationary Generalized Linear Bandits
Algorithms for Non-Stationary Generalized Linear Bandits
Yoan Russac
Olivier Cappé
Aurélien Garivier
43
23
0
23 Mar 2020
Weighted Linear Bandits for Non-Stationary Environments
Weighted Linear Bandits for Non-Stationary Environments
Yoan Russac
Claire Vernade
Olivier Cappé
82
101
0
19 Sep 2019
1