ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.03024
  4. Cited By
Learning to Optimize under Non-Stationarity

Learning to Optimize under Non-Stationarity

6 October 2018
Wang Chi Cheung
D. Simchi-Levi
Ruihao Zhu
ArXivPDFHTML

Papers citing "Learning to Optimize under Non-Stationarity"

50 / 84 papers shown
Title
Natural Policy Gradient for Average Reward Non-Stationary RL
Natural Policy Gradient for Average Reward Non-Stationary RL
Neharika Jali
Eshika Pathak
Pranay Sharma
Guannan Qu
Gauri Joshi
32
0
0
23 Apr 2025
An Exploration-free Method for a Linear Stochastic Bandit Driven by a Linear Gaussian Dynamical System
An Exploration-free Method for a Linear Stochastic Bandit Driven by a Linear Gaussian Dynamical System
J. Gornet
Yilin Mo
Bruno Sinopoli
30
0
0
04 Apr 2025
Tracking Most Significant Shifts in Infinite-Armed Bandits
Joe Suk
Jung-hun Kim
55
0
0
31 Jan 2025
Lower Bounds for Time-Varying Kernelized Bandits
Lower Bounds for Time-Varying Kernelized Bandits
Xu Cai
Jonathan Scarlett
36
0
0
22 Oct 2024
Right Now, Wrong Then: Non-Stationary Direct Preference Optimization
  under Preference Drift
Right Now, Wrong Then: Non-Stationary Direct Preference Optimization under Preference Drift
Seongho Son
William Bankes
Sayak Ray Chowdhury
Brooks Paige
Ilija Bogunovic
42
4
0
26 Jul 2024
Variance-Dependent Regret Bounds for Non-stationary Linear Bandits
Variance-Dependent Regret Bounds for Non-stationary Linear Bandits
Zhiyong Wang
Jize Xie
Yi Chen
J. C. Lui
Dongruo Zhou
28
0
0
15 Mar 2024
Non-Stationary Latent Auto-Regressive Bandits
Non-Stationary Latent Auto-Regressive Bandits
Anna L. Trella
Walter Dempsey
Asim H. Gazi
Ziping Xu
Finale Doshi-Velez
Susan A. Murphy
26
1
0
05 Feb 2024
Non-Stationary Contextual Bandit Learning via Neural Predictive Ensemble
  Sampling
Non-Stationary Contextual Bandit Learning via Neural Predictive Ensemble Sampling
Zheqing Zhu
Yueyang Liu
Xu Kuang
Benjamin Van Roy
AI4TS
29
0
0
11 Oct 2023
Tempo Adaptation in Non-stationary Reinforcement Learning
Tempo Adaptation in Non-stationary Reinforcement Learning
Hyunin Lee
Yuhao Ding
Jongmin Lee
Ming Jin
Javad Lavaei
Somayeh Sojoudi
9
3
0
26 Sep 2023
Provably Efficient Algorithm for Nonstationary Low-Rank MDPs
Provably Efficient Algorithm for Nonstationary Low-Rank MDPs
Yuan Cheng
J. Yang
Yitao Liang
OOD
38
1
0
10 Aug 2023
Online Learning with Costly Features in Non-stationary Environments
Online Learning with Costly Features in Non-stationary Environments
Saeed Ghoorchian
E. Kortukov
S. Maghsudi
OffRL
33
1
0
18 Jul 2023
Continual Learning as Computationally Constrained Reinforcement Learning
Continual Learning as Computationally Constrained Reinforcement Learning
Saurabh Kumar
Henrik Marklund
Anand Srinivasa Rao
Yifan Zhu
Hong Jun Jeon
Yueyang Liu
Benjamin Van Roy
CLL
27
22
0
10 Jul 2023
BOF-UCB: A Bayesian-Optimistic Frequentist Algorithm for Non-Stationary
  Contextual Bandits
BOF-UCB: A Bayesian-Optimistic Frequentist Algorithm for Non-Stationary Contextual Bandits
Nicklas Werge
Abdullah Akgul
M. Kandemir
38
0
0
07 Jul 2023
Tailoring Machine Learning for Process Mining
Tailoring Machine Learning for Process Mining
Paolo Ceravolo
Sylvio Barbon Junior
Ernesto Damiani
Wil M.P. van der Aalst
AI4TS
19
5
0
17 Jun 2023
Non-stationary Reinforcement Learning under General Function
  Approximation
Non-stationary Reinforcement Learning under General Function Approximation
Songtao Feng
Ming Yin
Ruiquan Huang
Yu-Xiang Wang
J. Yang
Yitao Liang
18
8
0
01 Jun 2023
Robust Lipschitz Bandits to Adversarial Corruptions
Robust Lipschitz Bandits to Adversarial Corruptions
Yue Kang
Cho-Jui Hsieh
T. C. Lee
AAML
30
8
0
29 May 2023
Learning to Seek: Multi-Agent Online Source Seeking Against
  Non-Stochastic Disturbances
Learning to Seek: Multi-Agent Online Source Seeking Against Non-Stochastic Disturbances
Bin Du
Kun Qian
Christian G. Claudel
Dengfeng Sun
16
0
0
29 Apr 2023
High-Dimensional Dynamic Pricing under Non-Stationarity: Learning and
  Earning with Change-Point Detection
High-Dimensional Dynamic Pricing under Non-Stationarity: Learning and Earning with Change-Point Detection
Zifeng Zhao
Feiyu Jiang
Yi Yu
Xi Chen
43
2
0
14 Mar 2023
Revisiting Weighted Strategy for Non-stationary Parametric Bandits
Revisiting Weighted Strategy for Non-stationary Parametric Bandits
Jing Wang
Peng Zhao
Zhihong Zhou
30
5
0
05 Mar 2023
MNL-Bandit in non-stationary environments
MNL-Bandit in non-stationary environments
Ayoub Foussoul
Vineet Goyal
Varun Gupta
34
2
0
04 Mar 2023
A Definition of Non-Stationary Bandits
A Definition of Non-Stationary Bandits
Yueyang Liu
Kuang Xu
Benjamin Van Roy
24
11
0
23 Feb 2023
Online Continuous Hyperparameter Optimization for Generalized Linear
  Contextual Bandits
Online Continuous Hyperparameter Optimization for Generalized Linear Contextual Bandits
Yue Kang
Cho-Jui Hsieh
T. C. Lee
24
1
0
18 Feb 2023
Linear Bandits with Memory: from Rotting to Rising
Linear Bandits with Memory: from Rotting to Rising
Giulia Clerici
Pierre Laforgue
Nicolò Cesa-Bianchi
25
3
0
16 Feb 2023
An Information-Theoretic Analysis of Nonstationary Bandit Learning
An Information-Theoretic Analysis of Nonstationary Bandit Learning
Seungki Min
Daniel Russo
28
6
0
09 Feb 2023
Online Resource Allocation: Bandits feedback and Advice on Time-varying
  Demands
Online Resource Allocation: Bandits feedback and Advice on Time-varying Demands
Lixing Lyu
Wang Chi Cheung
18
0
0
08 Feb 2023
Multi-channel Autobidding with Budget and ROI Constraints
Multi-channel Autobidding with Budget and ROI Constraints
Yuan Deng
Negin Golrezaei
Patrick Jaillet
Jason Cheuk Nam Liang
Vahab Mirrokni
24
24
0
03 Feb 2023
Smooth Non-Stationary Bandits
Smooth Non-Stationary Bandits
S. Jia
Qian Xie
Nathan Kallus
P. Frazier
98
9
0
29 Jan 2023
Rectified Pessimistic-Optimistic Learning for Stochastic Continuum-armed
  Bandit with Constraints
Rectified Pessimistic-Optimistic Learning for Stochastic Continuum-armed Bandit with Constraints
Heng Guo
Qi Zhu
Xin Liu
29
11
0
27 Nov 2022
Learning to Price Supply Chain Contracts against a Learning Retailer
Learning to Price Supply Chain Contracts against a Learning Retailer
Xuejun Zhao
Ruihao Zhu
W. Haskell
OffRL
10
0
0
02 Nov 2022
Competing Bandits in Time Varying Matching Markets
Competing Bandits in Time Varying Matching Markets
Deepan Muthirayan
C. Maheshwari
Pramod P. Khargonekar
S. Shankar Sastry
28
1
0
21 Oct 2022
Adversarial Bandits against Arbitrary Strategies
Adversarial Bandits against Arbitrary Strategies
Jung-hun Kim
Se-Young Yun
49
0
0
30 May 2022
Non-stationary Bandits with Knapsacks
Non-stationary Bandits with Knapsacks
Shang Liu
Jiashuo Jiang
Xiaocheng Li
15
20
0
25 May 2022
Non-Stationary Bandit Learning via Predictive Sampling
Non-Stationary Bandit Learning via Predictive Sampling
Yueyang Liu
Kuang Xu
Benjamin Van Roy
24
19
0
04 May 2022
Stochastic Multi-armed Bandits with Non-stationary Rewards Generated by
  a Linear Dynamical System
Stochastic Multi-armed Bandits with Non-stationary Rewards Generated by a Linear Dynamical System
J. Gornet
M. Hosseinzadeh
Bruno Sinopoli
22
7
0
06 Apr 2022
Adaptive Experimentation in the Presence of Exogenous Nonstationary
  Variation
Adaptive Experimentation in the Presence of Exogenous Nonstationary Variation
Chao Qin
Daniel Russo
52
6
0
18 Feb 2022
Corralling a Larger Band of Bandits: A Case Study on Switching Regret
  for Linear Bandits
Corralling a Larger Band of Bandits: A Case Study on Switching Regret for Linear Bandits
Haipeng Luo
Mengxiao Zhang
Peng Zhao
Zhi-Hua Zhou
31
17
0
12 Feb 2022
Bayesian Non-stationary Linear Bandits for Large-Scale Recommender
  Systems
Bayesian Non-stationary Linear Bandits for Large-Scale Recommender Systems
Saeed Ghoorchian
E. Kortukov
S. Maghsudi
OffRL
22
0
0
07 Feb 2022
Rotting Infinitely Many-armed Bandits
Rotting Infinitely Many-armed Bandits
Jung-hun Kim
Milan Vojnović
Se-Young Yun
24
4
0
31 Jan 2022
Bridging Adversarial and Nonstationary Multi-armed Bandit
Bridging Adversarial and Nonstationary Multi-armed Bandit
Ningyuan Chen
Shuoguang Yang
Hailun Zhang
AAML
11
4
0
05 Jan 2022
Using Non-Stationary Bandits for Learning in Repeated Cournot Games with
  Non-Stationary Demand
Using Non-Stationary Bandits for Learning in Repeated Cournot Games with Non-Stationary Demand
Kshitija Taywade
Brent Harrison
J. Goldsmith
26
3
0
03 Jan 2022
Tracking Most Significant Arm Switches in Bandits
Tracking Most Significant Arm Switches in Bandits
Joe Suk
Samory Kpotufe
30
18
0
27 Dec 2021
Optimal and Efficient Dynamic Regret Algorithms for Non-Stationary
  Dueling Bandits
Optimal and Efficient Dynamic Regret Algorithms for Non-Stationary Dueling Bandits
Aadirupa Saha
Shubham Gupta
30
10
0
06 Nov 2021
Dynamic Regret Minimization for Control of Non-stationary Linear
  Dynamical Systems
Dynamic Regret Minimization for Control of Non-stationary Linear Dynamical Systems
Yuwei Luo
Varun Gupta
Mladen Kolar
35
9
0
06 Nov 2021
The Pareto Frontier of model selection for general Contextual Bandits
The Pareto Frontier of model selection for general Contextual Bandits
T. V. Marinov
Julian Zimmert
13
22
0
25 Oct 2021
Linear Contextual Bandits with Adversarial Corruptions
Linear Contextual Bandits with Adversarial Corruptions
Heyang Zhao
Dongruo Zhou
Quanquan Gu
AAML
20
24
0
25 Oct 2021
Optimistic Policy Optimization is Provably Efficient in Non-stationary
  MDPs
Optimistic Policy Optimization is Provably Efficient in Non-stationary MDPs
Han Zhong
Zhuoran Yang
Zhaoran Wang
Csaba Szepesvári
33
21
0
18 Oct 2021
Weighted Gaussian Process Bandits for Non-stationary Environments
Weighted Gaussian Process Bandits for Non-stationary Environments
Yuntian Deng
Xingyu Zhou
Baekjin Kim
Ambuj Tewari
Abhishek Gupta
Ness B. Shroff
4
23
0
06 Jul 2021
Fundamental Limits of Reinforcement Learning in Environment with
  Endogeneous and Exogeneous Uncertainty
Fundamental Limits of Reinforcement Learning in Environment with Endogeneous and Exogeneous Uncertainty
Rongpeng Li
20
0
0
15 Jun 2021
Syndicated Bandits: A Framework for Auto Tuning Hyper-parameters in
  Contextual Bandit Algorithms
Syndicated Bandits: A Framework for Auto Tuning Hyper-parameters in Contextual Bandit Algorithms
Qin Ding
Yue Kang
Yi-Wei Liu
Thomas C. M. Lee
Cho-Jui Hsieh
James Sharpnack
13
8
0
05 Jun 2021
Robust Stochastic Linear Contextual Bandits Under Adversarial Attacks
Robust Stochastic Linear Contextual Bandits Under Adversarial Attacks
Qin Ding
Cho-Jui Hsieh
James Sharpnack
AAML
26
31
0
05 Jun 2021
12
Next