Learning to Optimize under Non-Stationarity

6 October 2018

Papers citing "Learning to Optimize under Non-Stationarity"

50 / 84 papers shown

Title
Natural Policy Gradient for Average Reward Non-Stationary RL Neharika Jali Eshika Pathak Pranay Sharma Guannan Qu Gauri Joshi 32 0 0 23 Apr 2025
An Exploration-free Method for a Linear Stochastic Bandit Driven by a Linear Gaussian Dynamical System J. Gornet Yilin Mo Bruno Sinopoli 30 0 0 04 Apr 2025
Tracking Most Significant Shifts in Infinite-Armed Bandits Joe Suk Jung-hun Kim 55 0 0 31 Jan 2025
Lower Bounds for Time-Varying Kernelized Bandits Xu Cai Jonathan Scarlett 36 0 0 22 Oct 2024
Right Now, Wrong Then: Non-Stationary Direct Preference Optimization under Preference Drift Seongho Son William Bankes Sayak Ray Chowdhury Brooks Paige Ilija Bogunovic 42 4 0 26 Jul 2024
Variance-Dependent Regret Bounds for Non-stationary Linear Bandits Zhiyong Wang Jize Xie Yi Chen J. C. Lui Dongruo Zhou 28 0 0 15 Mar 2024
Non-Stationary Latent Auto-Regressive Bandits Anna L. Trella Walter Dempsey Asim H. Gazi Ziping Xu Finale Doshi-Velez Susan A. Murphy 26 1 0 05 Feb 2024
Non-Stationary Contextual Bandit Learning via Neural Predictive Ensemble Sampling Zheqing Zhu Yueyang Liu Xu Kuang Benjamin Van Roy AI4TS 29 0 0 11 Oct 2023
Tempo Adaptation in Non-stationary Reinforcement Learning Hyunin Lee Yuhao Ding Jongmin Lee Ming Jin Javad Lavaei Somayeh Sojoudi 9 3 0 26 Sep 2023
Provably Efficient Algorithm for Nonstationary Low-Rank MDPs Yuan Cheng J. Yang Yitao Liang OOD 38 1 0 10 Aug 2023
Online Learning with Costly Features in Non-stationary Environments Saeed Ghoorchian E. Kortukov S. Maghsudi OffRL 33 1 0 18 Jul 2023
Continual Learning as Computationally Constrained Reinforcement Learning Saurabh Kumar Henrik Marklund Anand Srinivasa Rao Yifan Zhu Hong Jun Jeon Yueyang Liu Benjamin Van Roy CLL 27 22 0 10 Jul 2023
BOF-UCB: A Bayesian-Optimistic Frequentist Algorithm for Non-Stationary Contextual Bandits Nicklas Werge Abdullah Akgul M. Kandemir 38 0 0 07 Jul 2023
Tailoring Machine Learning for Process Mining Paolo Ceravolo Sylvio Barbon Junior Ernesto Damiani Wil M.P. van der Aalst AI4TS 19 5 0 17 Jun 2023
Non-stationary Reinforcement Learning under General Function Approximation Songtao Feng Ming Yin Ruiquan Huang Yu-Xiang Wang J. Yang Yitao Liang 18 8 0 01 Jun 2023
Robust Lipschitz Bandits to Adversarial Corruptions Yue Kang Cho-Jui Hsieh T. C. Lee AAML 30 8 0 29 May 2023
Learning to Seek: Multi-Agent Online Source Seeking Against Non-Stochastic Disturbances Bin Du Kun Qian Christian G. Claudel Dengfeng Sun 16 0 0 29 Apr 2023
High-Dimensional Dynamic Pricing under Non-Stationarity: Learning and Earning with Change-Point Detection Zifeng Zhao Feiyu Jiang Yi Yu Xi Chen 43 2 0 14 Mar 2023
Revisiting Weighted Strategy for Non-stationary Parametric Bandits Jing Wang Peng Zhao Zhihong Zhou 30 5 0 05 Mar 2023
MNL-Bandit in non-stationary environments Ayoub Foussoul Vineet Goyal Varun Gupta 34 2 0 04 Mar 2023
A Definition of Non-Stationary Bandits Yueyang Liu Kuang Xu Benjamin Van Roy 24 11 0 23 Feb 2023
Online Continuous Hyperparameter Optimization for Generalized Linear Contextual Bandits Yue Kang Cho-Jui Hsieh T. C. Lee 24 1 0 18 Feb 2023
Linear Bandits with Memory: from Rotting to Rising Giulia Clerici Pierre Laforgue Nicolò Cesa-Bianchi 25 3 0 16 Feb 2023
An Information-Theoretic Analysis of Nonstationary Bandit Learning Seungki Min Daniel Russo 28 6 0 09 Feb 2023
Online Resource Allocation: Bandits feedback and Advice on Time-varying Demands Lixing Lyu Wang Chi Cheung 18 0 0 08 Feb 2023
Multi-channel Autobidding with Budget and ROI Constraints Yuan Deng Negin Golrezaei Patrick Jaillet Jason Cheuk Nam Liang Vahab Mirrokni 24 24 0 03 Feb 2023
Smooth Non-Stationary Bandits S. Jia Qian Xie Nathan Kallus P. Frazier 98 9 0 29 Jan 2023
Rectified Pessimistic-Optimistic Learning for Stochastic Continuum-armed Bandit with Constraints Heng Guo Qi Zhu Xin Liu 29 11 0 27 Nov 2022
Learning to Price Supply Chain Contracts against a Learning Retailer Xuejun Zhao Ruihao Zhu W. Haskell OffRL 10 0 0 02 Nov 2022
Competing Bandits in Time Varying Matching Markets Deepan Muthirayan C. Maheshwari Pramod P. Khargonekar S. Shankar Sastry 28 1 0 21 Oct 2022
Adversarial Bandits against Arbitrary Strategies Jung-hun Kim Se-Young Yun 49 0 0 30 May 2022
Non-stationary Bandits with Knapsacks Shang Liu Jiashuo Jiang Xiaocheng Li 15 20 0 25 May 2022
Non-Stationary Bandit Learning via Predictive Sampling Yueyang Liu Kuang Xu Benjamin Van Roy 24 19 0 04 May 2022
Stochastic Multi-armed Bandits with Non-stationary Rewards Generated by a Linear Dynamical System J. Gornet M. Hosseinzadeh Bruno Sinopoli 22 7 0 06 Apr 2022
Adaptive Experimentation in the Presence of Exogenous Nonstationary Variation Chao Qin Daniel Russo 52 6 0 18 Feb 2022
Corralling a Larger Band of Bandits: A Case Study on Switching Regret for Linear Bandits Haipeng Luo Mengxiao Zhang Peng Zhao Zhi-Hua Zhou 31 17 0 12 Feb 2022
Bayesian Non-stationary Linear Bandits for Large-Scale Recommender Systems Saeed Ghoorchian E. Kortukov S. Maghsudi OffRL 22 0 0 07 Feb 2022
Rotting Infinitely Many-armed Bandits Jung-hun Kim Milan Vojnović Se-Young Yun 24 4 0 31 Jan 2022
Bridging Adversarial and Nonstationary Multi-armed Bandit Ningyuan Chen Shuoguang Yang Hailun Zhang AAML 11 4 0 05 Jan 2022
Using Non-Stationary Bandits for Learning in Repeated Cournot Games with Non-Stationary Demand Kshitija Taywade Brent Harrison J. Goldsmith 26 3 0 03 Jan 2022
Tracking Most Significant Arm Switches in Bandits Joe Suk Samory Kpotufe 30 18 0 27 Dec 2021
Optimal and Efficient Dynamic Regret Algorithms for Non-Stationary Dueling Bandits Aadirupa Saha Shubham Gupta 30 10 0 06 Nov 2021
Dynamic Regret Minimization for Control of Non-stationary Linear Dynamical Systems Yuwei Luo Varun Gupta Mladen Kolar 35 9 0 06 Nov 2021
The Pareto Frontier of model selection for general Contextual Bandits T. V. Marinov Julian Zimmert 13 22 0 25 Oct 2021
Linear Contextual Bandits with Adversarial Corruptions Heyang Zhao Dongruo Zhou Quanquan Gu AAML 20 24 0 25 Oct 2021
Optimistic Policy Optimization is Provably Efficient in Non-stationary MDPs Han Zhong Zhuoran Yang Zhaoran Wang Csaba Szepesvári 33 21 0 18 Oct 2021
Weighted Gaussian Process Bandits for Non-stationary Environments Yuntian Deng Xingyu Zhou Baekjin Kim Ambuj Tewari Abhishek Gupta Ness B. Shroff 4 23 0 06 Jul 2021
Fundamental Limits of Reinforcement Learning in Environment with Endogeneous and Exogeneous Uncertainty Rongpeng Li 20 0 0 15 Jun 2021
Syndicated Bandits: A Framework for Auto Tuning Hyper-parameters in Contextual Bandit Algorithms Qin Ding Yue Kang Yi-Wei Liu Thomas C. M. Lee Cho-Jui Hsieh James Sharpnack 13 8 0 05 Jun 2021
Robust Stochastic Linear Contextual Bandits Under Adversarial Attacks Qin Ding Cho-Jui Hsieh James Sharpnack AAML 26 31 0 05 Jun 2021