Efficient Contextual Bandits in Non-stationary Worlds

5 August 2017

Papers citing "Efficient Contextual Bandits in Non-stationary Worlds"

50 / 72 papers shown

Title
Beyond IID: data-driven decision-making in heterogeneous environments Omar Besbes Will Ma Omar Mouchtaki 42 7 0 03 Jan 2025
Improved Regret Bounds for Bandits with Expert Advice Nicolò Cesa-Bianchi Khaled Eldowa Emmanuel Esposito Julia Olkhovskaya 35 0 0 24 Jun 2024
A Contextual Online Learning Theory of Brokerage F. Bachoc Tommaso Cesari Roberto Colomboni 28 2 0 22 May 2024
Mitigating Biases in Collective Decision-Making: Enhancing Performance in the Face of Fake News Axel Abels Elias Fernández Domingos Ann Nowé Tom Lenaerts 19 1 0 11 Mar 2024
Near-optimal Per-Action Regret Bounds for Sleeping Bandits Quan Nguyen Nishant A. Mehta 19 1 0 02 Mar 2024
Adaptive Interventions with User-Defined Goals for Health Behavior Change Aishwarya Mandyam Matthew Joerke William Denton Barbara E. Engelhardt Emma Brunskill 32 1 0 16 Nov 2023
An Improved Relaxation for Oracle-Efficient Adversarial Contextual Bandits Kiarash Banihashem Mohammadtaghi Hajiaghayi Suho Shin Max Springer 16 1 0 29 Oct 2023
A Stability Principle for Learning under Non-Stationarity Chengpiao Huang Kaizheng Wang 39 2 0 27 Oct 2023
Non-Stationary Contextual Bandit Learning via Neural Predictive Ensemble Sampling Zheqing Zhu Yueyang Liu Xu Kuang Benjamin Van Roy AI4TS 29 0 0 11 Oct 2023
Bypassing the Simulator: Near-Optimal Adversarial Linear Contextual Bandits Haolin Liu Chen-Yu Wei Julian Zimmert 30 9 0 02 Sep 2023
Online Learning with Costly Features in Non-stationary Environments Saeed Ghoorchian E. Kortukov S. Maghsudi OffRL 33 1 0 18 Jul 2023
Tracking Most Significant Shifts in Nonparametric Contextual Bandits Joe Suk Samory Kpotufe 38 5 0 11 Jul 2023
Meta-Learning Adversarial Bandit Algorithms M. Khodak Ilya Osadchiy Keegan Harris Maria-Florina Balcan Kfir Y. Levy Ron Meir Zhiwei Steven Wu FedML 28 2 0 05 Jul 2023
Non-stationary Reinforcement Learning under General Function Approximation Songtao Feng Ming Yin Ruiquan Huang Yu-Xiang Wang J. Yang Yitao Liang 18 8 0 01 Jun 2023
Energy Regularized RNNs for Solving Non-Stationary Bandit Problems Michael Rotman Lior Wolf 16 1 0 12 Mar 2023
MNL-Bandit in non-stationary environments Ayoub Foussoul Vineet Goyal Varun Gupta 34 2 0 04 Mar 2023
A Definition of Non-Stationary Bandits Yueyang Liu Kuang Xu Benjamin Van Roy 24 11 0 23 Feb 2023
Linear Bandits with Memory: from Rotting to Rising Giulia Clerici Pierre Laforgue Nicolò Cesa-Bianchi 25 3 0 16 Feb 2023
Multi-channel Autobidding with Budget and ROI Constraints Yuan Deng Negin Golrezaei Patrick Jaillet Jason Cheuk Nam Liang Vahab Mirrokni 24 24 0 03 Feb 2023
Quantum contextual bandits and recommender systems for quantum data Shrigyan Brahmachari Josep Lumbreras Marco Tomamichel 32 3 0 31 Jan 2023
Bounded (O(1)) Regret Recommendation Learning via Synthetic Controls Oracle Hyunwook Kang P. R. Kumar OffRL 33 1 0 29 Jan 2023
Smooth Non-Stationary Bandits S. Jia Qian Xie Nathan Kallus P. Frazier 98 9 0 29 Jan 2023
Contextual Bandits and Optimistically Universal Learning Moise Blanchard Steve Hanneke Patrick Jaillet OffRL 19 1 0 31 Dec 2022
Learning to Price Supply Chain Contracts against a Learning Retailer Xuejun Zhao Ruihao Zhu W. Haskell OffRL 10 0 0 02 Nov 2022
ANACONDA: An Improved Dynamic Regret Algorithm for Adaptive Non-Stationary Dueling Bandits Thomas Kleine Buening Aadirupa Saha 46 6 0 25 Oct 2022
Extending Open Bandit Pipeline to Simulate Industry Challenges Bram van den Akker N. Weber Felipe Moraes Dmitri Goldenberg OffRL 16 1 0 09 Sep 2022
Decentralized Competing Bandits in Non-Stationary Matching Markets Avishek Ghosh Abishek Sankararaman Kannan Ramchandran T. Javidi A. Mazumdar 28 4 0 31 May 2022
Non-Stationary Bandit Learning via Predictive Sampling Yueyang Liu Kuang Xu Benjamin Van Roy 24 19 0 04 May 2022
Corralling a Larger Band of Bandits: A Case Study on Switching Regret for Linear Bandits Haipeng Luo Mengxiao Zhang Peng Zhao Zhi-Hua Zhou 31 17 0 12 Feb 2022
Bridging Adversarial and Nonstationary Multi-armed Bandit Ningyuan Chen Shuoguang Yang Hailun Zhang AAML 11 4 0 05 Jan 2022
Efficient and Optimal Algorithms for Contextual Dueling Bandits under Realizability Aadirupa Saha A. Krishnamurthy 31 35 0 24 Nov 2021
Optimal and Efficient Dynamic Regret Algorithms for Non-Stationary Dueling Bandits Aadirupa Saha Shubham Gupta 33 10 0 06 Nov 2021
The Pareto Frontier of model selection for general Contextual Bandits T. V. Marinov Julian Zimmert 13 22 0 25 Oct 2021
On Slowly-varying Non-stationary Bandits Ramakrishnan Krishnamurthy Médéric Fourmy 21 8 0 25 Oct 2021
Towards the D-Optimal Online Experiment Design for Recommender Selection Madina Abdrakhmanova Saniya Abushakimova Evren Körpeoglu H. A. Varol Kannan Achan 14 3 0 23 Oct 2021
Adapting to Misspecification in Contextual Bandits Dylan J. Foster Claudio Gentile M. Mohri Julian Zimmert 11 84 0 12 Jul 2021
Periodic-GP: Learning Periodic World with Gaussian Process Bandits Hengrui Cai Zhihao Cen Ling Leng Rui Song AI4TS 25 5 0 30 May 2021
When and Whom to Collaborate with in a Changing Environment: A Collaborative Dynamic Bandit Solution Chuanhao Li Qingyun Wu Hongning Wang 39 5 0 14 Apr 2021
Dynamic Pricing and Learning under the Bass Model Shipra Agrawal Steven Yin A. Zeevi 21 11 0 09 Mar 2021
Non-stationary Reinforcement Learning without Prior Knowledge: An Optimal Black-box Approach Chen-Yu Wei Haipeng Luo OffRL 24 102 0 10 Feb 2021
Learning User Preferences in Non-Stationary Environments Wasim Huleihel S. Pal O. Shayevitz 14 12 0 29 Jan 2021
Adversarial Linear Contextual Bandits with Graph-Structured Side Observations Lingda Wang Bingcong Li Huozhi Zhou G. Giannakis L. Varshney Zhizhen Zhao 13 7 0 10 Dec 2020
Non-Stationary Latent Bandits Joey Hong B. Kveton Manzil Zaheer Yinlam Chow Amr Ahmed Mohammad Ghavamzadeh Craig Boutilier OffRL 25 13 0 01 Dec 2020
Adversarial Dueling Bandits Aadirupa Saha Tomer Koren Yishay Mansour 13 25 0 27 Oct 2020
Effects of Model Misspecification on Bayesian Bandits: Case Studies in UX Optimization Mack Sweeney M. Adelsberg Kathryn B. Laskey C. Domeniconi 16 1 0 07 Oct 2020
Model-Free Non-Stationary RL: Near-Optimal Regret and Applications in Multi-Agent RL and Inventory Control Weichao Mao Kaipeng Zhang Ruihao Zhu D. Simchi-Levi Tamer Bacsar 22 13 0 07 Oct 2020
Learning Product Rankings Robust to Fake Users Negin Golrezaei Vahideh H. Manshadi Jon Schneider S. Sekar 13 26 0 10 Sep 2020
Unifying Clustered and Non-stationary Bandits Chuanhao Li Qingyun Wu Hongning Wang 27 12 0 05 Sep 2020
Self-Tuning Bandits over Unknown Covariate-Shifts Joe Suk Samory Kpotufe 14 9 0 16 Jul 2020
Dynamic Regret of Policy Optimization in Non-stationary Environments Yingjie Fei Zhuoran Yang Zhaoran Wang Qiaomin Xie 16 54 0 30 Jun 2020