Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.03024
Cited By
Learning to Optimize under Non-Stationarity
6 October 2018
Wang Chi Cheung
D. Simchi-Levi
Ruihao Zhu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Learning to Optimize under Non-Stationarity"
50 / 84 papers shown
Title
Natural Policy Gradient for Average Reward Non-Stationary RL
Neharika Jali
Eshika Pathak
Pranay Sharma
Guannan Qu
Gauri Joshi
32
0
0
23 Apr 2025
An Exploration-free Method for a Linear Stochastic Bandit Driven by a Linear Gaussian Dynamical System
J. Gornet
Yilin Mo
Bruno Sinopoli
30
0
0
04 Apr 2025
Tracking Most Significant Shifts in Infinite-Armed Bandits
Joe Suk
Jung-hun Kim
55
0
0
31 Jan 2025
Lower Bounds for Time-Varying Kernelized Bandits
Xu Cai
Jonathan Scarlett
36
0
0
22 Oct 2024
Right Now, Wrong Then: Non-Stationary Direct Preference Optimization under Preference Drift
Seongho Son
William Bankes
Sayak Ray Chowdhury
Brooks Paige
Ilija Bogunovic
42
4
0
26 Jul 2024
Variance-Dependent Regret Bounds for Non-stationary Linear Bandits
Zhiyong Wang
Jize Xie
Yi Chen
J. C. Lui
Dongruo Zhou
28
0
0
15 Mar 2024
Non-Stationary Latent Auto-Regressive Bandits
Anna L. Trella
Walter Dempsey
Asim H. Gazi
Ziping Xu
Finale Doshi-Velez
Susan A. Murphy
26
1
0
05 Feb 2024
Non-Stationary Contextual Bandit Learning via Neural Predictive Ensemble Sampling
Zheqing Zhu
Yueyang Liu
Xu Kuang
Benjamin Van Roy
AI4TS
29
0
0
11 Oct 2023
Tempo Adaptation in Non-stationary Reinforcement Learning
Hyunin Lee
Yuhao Ding
Jongmin Lee
Ming Jin
Javad Lavaei
Somayeh Sojoudi
9
3
0
26 Sep 2023
Provably Efficient Algorithm for Nonstationary Low-Rank MDPs
Yuan Cheng
J. Yang
Yitao Liang
OOD
38
1
0
10 Aug 2023
Online Learning with Costly Features in Non-stationary Environments
Saeed Ghoorchian
E. Kortukov
S. Maghsudi
OffRL
33
1
0
18 Jul 2023
Continual Learning as Computationally Constrained Reinforcement Learning
Saurabh Kumar
Henrik Marklund
Anand Srinivasa Rao
Yifan Zhu
Hong Jun Jeon
Yueyang Liu
Benjamin Van Roy
CLL
27
22
0
10 Jul 2023
BOF-UCB: A Bayesian-Optimistic Frequentist Algorithm for Non-Stationary Contextual Bandits
Nicklas Werge
Abdullah Akgul
M. Kandemir
38
0
0
07 Jul 2023
Tailoring Machine Learning for Process Mining
Paolo Ceravolo
Sylvio Barbon Junior
Ernesto Damiani
Wil M.P. van der Aalst
AI4TS
19
5
0
17 Jun 2023
Non-stationary Reinforcement Learning under General Function Approximation
Songtao Feng
Ming Yin
Ruiquan Huang
Yu-Xiang Wang
J. Yang
Yitao Liang
18
8
0
01 Jun 2023
Robust Lipschitz Bandits to Adversarial Corruptions
Yue Kang
Cho-Jui Hsieh
T. C. Lee
AAML
30
8
0
29 May 2023
Learning to Seek: Multi-Agent Online Source Seeking Against Non-Stochastic Disturbances
Bin Du
Kun Qian
Christian G. Claudel
Dengfeng Sun
16
0
0
29 Apr 2023
High-Dimensional Dynamic Pricing under Non-Stationarity: Learning and Earning with Change-Point Detection
Zifeng Zhao
Feiyu Jiang
Yi Yu
Xi Chen
43
2
0
14 Mar 2023
Revisiting Weighted Strategy for Non-stationary Parametric Bandits
Jing Wang
Peng Zhao
Zhihong Zhou
30
5
0
05 Mar 2023
MNL-Bandit in non-stationary environments
Ayoub Foussoul
Vineet Goyal
Varun Gupta
34
2
0
04 Mar 2023
A Definition of Non-Stationary Bandits
Yueyang Liu
Kuang Xu
Benjamin Van Roy
24
11
0
23 Feb 2023
Online Continuous Hyperparameter Optimization for Generalized Linear Contextual Bandits
Yue Kang
Cho-Jui Hsieh
T. C. Lee
24
1
0
18 Feb 2023
Linear Bandits with Memory: from Rotting to Rising
Giulia Clerici
Pierre Laforgue
Nicolò Cesa-Bianchi
25
3
0
16 Feb 2023
An Information-Theoretic Analysis of Nonstationary Bandit Learning
Seungki Min
Daniel Russo
28
6
0
09 Feb 2023
Online Resource Allocation: Bandits feedback and Advice on Time-varying Demands
Lixing Lyu
Wang Chi Cheung
18
0
0
08 Feb 2023
Multi-channel Autobidding with Budget and ROI Constraints
Yuan Deng
Negin Golrezaei
Patrick Jaillet
Jason Cheuk Nam Liang
Vahab Mirrokni
24
24
0
03 Feb 2023
Smooth Non-Stationary Bandits
S. Jia
Qian Xie
Nathan Kallus
P. Frazier
98
9
0
29 Jan 2023
Rectified Pessimistic-Optimistic Learning for Stochastic Continuum-armed Bandit with Constraints
Heng Guo
Qi Zhu
Xin Liu
29
11
0
27 Nov 2022
Learning to Price Supply Chain Contracts against a Learning Retailer
Xuejun Zhao
Ruihao Zhu
W. Haskell
OffRL
10
0
0
02 Nov 2022
Competing Bandits in Time Varying Matching Markets
Deepan Muthirayan
C. Maheshwari
Pramod P. Khargonekar
S. Shankar Sastry
28
1
0
21 Oct 2022
Adversarial Bandits against Arbitrary Strategies
Jung-hun Kim
Se-Young Yun
49
0
0
30 May 2022
Non-stationary Bandits with Knapsacks
Shang Liu
Jiashuo Jiang
Xiaocheng Li
15
20
0
25 May 2022
Non-Stationary Bandit Learning via Predictive Sampling
Yueyang Liu
Kuang Xu
Benjamin Van Roy
24
19
0
04 May 2022
Stochastic Multi-armed Bandits with Non-stationary Rewards Generated by a Linear Dynamical System
J. Gornet
M. Hosseinzadeh
Bruno Sinopoli
22
7
0
06 Apr 2022
Adaptive Experimentation in the Presence of Exogenous Nonstationary Variation
Chao Qin
Daniel Russo
52
6
0
18 Feb 2022
Corralling a Larger Band of Bandits: A Case Study on Switching Regret for Linear Bandits
Haipeng Luo
Mengxiao Zhang
Peng Zhao
Zhi-Hua Zhou
31
17
0
12 Feb 2022
Bayesian Non-stationary Linear Bandits for Large-Scale Recommender Systems
Saeed Ghoorchian
E. Kortukov
S. Maghsudi
OffRL
22
0
0
07 Feb 2022
Rotting Infinitely Many-armed Bandits
Jung-hun Kim
Milan Vojnović
Se-Young Yun
24
4
0
31 Jan 2022
Bridging Adversarial and Nonstationary Multi-armed Bandit
Ningyuan Chen
Shuoguang Yang
Hailun Zhang
AAML
11
4
0
05 Jan 2022
Using Non-Stationary Bandits for Learning in Repeated Cournot Games with Non-Stationary Demand
Kshitija Taywade
Brent Harrison
J. Goldsmith
26
3
0
03 Jan 2022
Tracking Most Significant Arm Switches in Bandits
Joe Suk
Samory Kpotufe
30
18
0
27 Dec 2021
Optimal and Efficient Dynamic Regret Algorithms for Non-Stationary Dueling Bandits
Aadirupa Saha
Shubham Gupta
30
10
0
06 Nov 2021
Dynamic Regret Minimization for Control of Non-stationary Linear Dynamical Systems
Yuwei Luo
Varun Gupta
Mladen Kolar
35
9
0
06 Nov 2021
The Pareto Frontier of model selection for general Contextual Bandits
T. V. Marinov
Julian Zimmert
13
22
0
25 Oct 2021
Linear Contextual Bandits with Adversarial Corruptions
Heyang Zhao
Dongruo Zhou
Quanquan Gu
AAML
20
24
0
25 Oct 2021
Optimistic Policy Optimization is Provably Efficient in Non-stationary MDPs
Han Zhong
Zhuoran Yang
Zhaoran Wang
Csaba Szepesvári
33
21
0
18 Oct 2021
Weighted Gaussian Process Bandits for Non-stationary Environments
Yuntian Deng
Xingyu Zhou
Baekjin Kim
Ambuj Tewari
Abhishek Gupta
Ness B. Shroff
4
23
0
06 Jul 2021
Fundamental Limits of Reinforcement Learning in Environment with Endogeneous and Exogeneous Uncertainty
Rongpeng Li
20
0
0
15 Jun 2021
Syndicated Bandits: A Framework for Auto Tuning Hyper-parameters in Contextual Bandit Algorithms
Qin Ding
Yue Kang
Yi-Wei Liu
Thomas C. M. Lee
Cho-Jui Hsieh
James Sharpnack
13
8
0
05 Jun 2021
Robust Stochastic Linear Contextual Bandits Under Adversarial Attacks
Qin Ding
Cho-Jui Hsieh
James Sharpnack
AAML
26
31
0
05 Jun 2021
1
2
Next