Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1708.01799
Cited By
Efficient Contextual Bandits in Non-stationary Worlds
5 August 2017
Haipeng Luo
Chen-Yu Wei
Alekh Agarwal
John Langford
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Efficient Contextual Bandits in Non-stationary Worlds"
50 / 72 papers shown
Title
Beyond IID: data-driven decision-making in heterogeneous environments
Omar Besbes
Will Ma
Omar Mouchtaki
42
7
0
03 Jan 2025
Improved Regret Bounds for Bandits with Expert Advice
Nicolò Cesa-Bianchi
Khaled Eldowa
Emmanuel Esposito
Julia Olkhovskaya
35
0
0
24 Jun 2024
A Contextual Online Learning Theory of Brokerage
F. Bachoc
Tommaso Cesari
Roberto Colomboni
28
2
0
22 May 2024
Mitigating Biases in Collective Decision-Making: Enhancing Performance in the Face of Fake News
Axel Abels
Elias Fernández Domingos
Ann Nowé
Tom Lenaerts
19
1
0
11 Mar 2024
Near-optimal Per-Action Regret Bounds for Sleeping Bandits
Quan Nguyen
Nishant A. Mehta
19
1
0
02 Mar 2024
Adaptive Interventions with User-Defined Goals for Health Behavior Change
Aishwarya Mandyam
Matthew Joerke
William Denton
Barbara E. Engelhardt
Emma Brunskill
32
1
0
16 Nov 2023
An Improved Relaxation for Oracle-Efficient Adversarial Contextual Bandits
Kiarash Banihashem
Mohammadtaghi Hajiaghayi
Suho Shin
Max Springer
16
1
0
29 Oct 2023
A Stability Principle for Learning under Non-Stationarity
Chengpiao Huang
Kaizheng Wang
39
2
0
27 Oct 2023
Non-Stationary Contextual Bandit Learning via Neural Predictive Ensemble Sampling
Zheqing Zhu
Yueyang Liu
Xu Kuang
Benjamin Van Roy
AI4TS
29
0
0
11 Oct 2023
Bypassing the Simulator: Near-Optimal Adversarial Linear Contextual Bandits
Haolin Liu
Chen-Yu Wei
Julian Zimmert
30
9
0
02 Sep 2023
Online Learning with Costly Features in Non-stationary Environments
Saeed Ghoorchian
E. Kortukov
S. Maghsudi
OffRL
33
1
0
18 Jul 2023
Tracking Most Significant Shifts in Nonparametric Contextual Bandits
Joe Suk
Samory Kpotufe
38
5
0
11 Jul 2023
Meta-Learning Adversarial Bandit Algorithms
M. Khodak
Ilya Osadchiy
Keegan Harris
Maria-Florina Balcan
Kfir Y. Levy
Ron Meir
Zhiwei Steven Wu
FedML
28
2
0
05 Jul 2023
Non-stationary Reinforcement Learning under General Function Approximation
Songtao Feng
Ming Yin
Ruiquan Huang
Yu-Xiang Wang
J. Yang
Yitao Liang
18
8
0
01 Jun 2023
Energy Regularized RNNs for Solving Non-Stationary Bandit Problems
Michael Rotman
Lior Wolf
16
1
0
12 Mar 2023
MNL-Bandit in non-stationary environments
Ayoub Foussoul
Vineet Goyal
Varun Gupta
34
2
0
04 Mar 2023
A Definition of Non-Stationary Bandits
Yueyang Liu
Kuang Xu
Benjamin Van Roy
24
11
0
23 Feb 2023
Linear Bandits with Memory: from Rotting to Rising
Giulia Clerici
Pierre Laforgue
Nicolò Cesa-Bianchi
25
3
0
16 Feb 2023
Multi-channel Autobidding with Budget and ROI Constraints
Yuan Deng
Negin Golrezaei
Patrick Jaillet
Jason Cheuk Nam Liang
Vahab Mirrokni
24
24
0
03 Feb 2023
Quantum contextual bandits and recommender systems for quantum data
Shrigyan Brahmachari
Josep Lumbreras
Marco Tomamichel
32
3
0
31 Jan 2023
Bounded (O(1)) Regret Recommendation Learning via Synthetic Controls Oracle
Hyunwook Kang
P. R. Kumar
OffRL
33
1
0
29 Jan 2023
Smooth Non-Stationary Bandits
S. Jia
Qian Xie
Nathan Kallus
P. Frazier
98
9
0
29 Jan 2023
Contextual Bandits and Optimistically Universal Learning
Moise Blanchard
Steve Hanneke
Patrick Jaillet
OffRL
19
1
0
31 Dec 2022
Learning to Price Supply Chain Contracts against a Learning Retailer
Xuejun Zhao
Ruihao Zhu
W. Haskell
OffRL
10
0
0
02 Nov 2022
ANACONDA: An Improved Dynamic Regret Algorithm for Adaptive Non-Stationary Dueling Bandits
Thomas Kleine Buening
Aadirupa Saha
46
6
0
25 Oct 2022
Extending Open Bandit Pipeline to Simulate Industry Challenges
Bram van den Akker
N. Weber
Felipe Moraes
Dmitri Goldenberg
OffRL
16
1
0
09 Sep 2022
Decentralized Competing Bandits in Non-Stationary Matching Markets
Avishek Ghosh
Abishek Sankararaman
Kannan Ramchandran
T. Javidi
A. Mazumdar
28
4
0
31 May 2022
Non-Stationary Bandit Learning via Predictive Sampling
Yueyang Liu
Kuang Xu
Benjamin Van Roy
24
19
0
04 May 2022
Corralling a Larger Band of Bandits: A Case Study on Switching Regret for Linear Bandits
Haipeng Luo
Mengxiao Zhang
Peng Zhao
Zhi-Hua Zhou
31
17
0
12 Feb 2022
Bridging Adversarial and Nonstationary Multi-armed Bandit
Ningyuan Chen
Shuoguang Yang
Hailun Zhang
AAML
11
4
0
05 Jan 2022
Efficient and Optimal Algorithms for Contextual Dueling Bandits under Realizability
Aadirupa Saha
A. Krishnamurthy
31
35
0
24 Nov 2021
Optimal and Efficient Dynamic Regret Algorithms for Non-Stationary Dueling Bandits
Aadirupa Saha
Shubham Gupta
33
10
0
06 Nov 2021
The Pareto Frontier of model selection for general Contextual Bandits
T. V. Marinov
Julian Zimmert
13
22
0
25 Oct 2021
On Slowly-varying Non-stationary Bandits
Ramakrishnan Krishnamurthy
Médéric Fourmy
21
8
0
25 Oct 2021
Towards the D-Optimal Online Experiment Design for Recommender Selection
Madina Abdrakhmanova
Saniya Abushakimova
Evren Körpeoglu
H. A. Varol
Kannan Achan
14
3
0
23 Oct 2021
Adapting to Misspecification in Contextual Bandits
Dylan J. Foster
Claudio Gentile
M. Mohri
Julian Zimmert
11
84
0
12 Jul 2021
Periodic-GP: Learning Periodic World with Gaussian Process Bandits
Hengrui Cai
Zhihao Cen
Ling Leng
Rui Song
AI4TS
25
5
0
30 May 2021
When and Whom to Collaborate with in a Changing Environment: A Collaborative Dynamic Bandit Solution
Chuanhao Li
Qingyun Wu
Hongning Wang
39
5
0
14 Apr 2021
Dynamic Pricing and Learning under the Bass Model
Shipra Agrawal
Steven Yin
A. Zeevi
21
11
0
09 Mar 2021
Non-stationary Reinforcement Learning without Prior Knowledge: An Optimal Black-box Approach
Chen-Yu Wei
Haipeng Luo
OffRL
24
102
0
10 Feb 2021
Learning User Preferences in Non-Stationary Environments
Wasim Huleihel
S. Pal
O. Shayevitz
14
12
0
29 Jan 2021
Adversarial Linear Contextual Bandits with Graph-Structured Side Observations
Lingda Wang
Bingcong Li
Huozhi Zhou
G. Giannakis
L. Varshney
Zhizhen Zhao
13
7
0
10 Dec 2020
Non-Stationary Latent Bandits
Joey Hong
B. Kveton
Manzil Zaheer
Yinlam Chow
Amr Ahmed
Mohammad Ghavamzadeh
Craig Boutilier
OffRL
25
13
0
01 Dec 2020
Adversarial Dueling Bandits
Aadirupa Saha
Tomer Koren
Yishay Mansour
13
25
0
27 Oct 2020
Effects of Model Misspecification on Bayesian Bandits: Case Studies in UX Optimization
Mack Sweeney
M. Adelsberg
Kathryn B. Laskey
C. Domeniconi
16
1
0
07 Oct 2020
Model-Free Non-Stationary RL: Near-Optimal Regret and Applications in Multi-Agent RL and Inventory Control
Weichao Mao
Kaipeng Zhang
Ruihao Zhu
D. Simchi-Levi
Tamer Bacsar
22
13
0
07 Oct 2020
Learning Product Rankings Robust to Fake Users
Negin Golrezaei
Vahideh H. Manshadi
Jon Schneider
S. Sekar
13
26
0
10 Sep 2020
Unifying Clustered and Non-stationary Bandits
Chuanhao Li
Qingyun Wu
Hongning Wang
27
12
0
05 Sep 2020
Self-Tuning Bandits over Unknown Covariate-Shifts
Joe Suk
Samory Kpotufe
14
9
0
16 Jul 2020
Dynamic Regret of Policy Optimization in Non-stationary Environments
Yingjie Fei
Zhuoran Yang
Zhaoran Wang
Qiaomin Xie
16
54
0
30 Jun 2020
1
2
Next