ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1708.01799
  4. Cited By
Efficient Contextual Bandits in Non-stationary Worlds

Efficient Contextual Bandits in Non-stationary Worlds

5 August 2017
Haipeng Luo
Chen-Yu Wei
Alekh Agarwal
John Langford
ArXivPDFHTML

Papers citing "Efficient Contextual Bandits in Non-stationary Worlds"

50 / 72 papers shown
Title
Beyond IID: data-driven decision-making in heterogeneous environments
Beyond IID: data-driven decision-making in heterogeneous environments
Omar Besbes
Will Ma
Omar Mouchtaki
42
7
0
03 Jan 2025
Improved Regret Bounds for Bandits with Expert Advice
Improved Regret Bounds for Bandits with Expert Advice
Nicolò Cesa-Bianchi
Khaled Eldowa
Emmanuel Esposito
Julia Olkhovskaya
35
0
0
24 Jun 2024
A Contextual Online Learning Theory of Brokerage
A Contextual Online Learning Theory of Brokerage
F. Bachoc
Tommaso Cesari
Roberto Colomboni
28
2
0
22 May 2024
Mitigating Biases in Collective Decision-Making: Enhancing Performance
  in the Face of Fake News
Mitigating Biases in Collective Decision-Making: Enhancing Performance in the Face of Fake News
Axel Abels
Elias Fernández Domingos
Ann Nowé
Tom Lenaerts
19
1
0
11 Mar 2024
Near-optimal Per-Action Regret Bounds for Sleeping Bandits
Near-optimal Per-Action Regret Bounds for Sleeping Bandits
Quan Nguyen
Nishant A. Mehta
19
1
0
02 Mar 2024
Adaptive Interventions with User-Defined Goals for Health Behavior
  Change
Adaptive Interventions with User-Defined Goals for Health Behavior Change
Aishwarya Mandyam
Matthew Joerke
William Denton
Barbara E. Engelhardt
Emma Brunskill
32
1
0
16 Nov 2023
An Improved Relaxation for Oracle-Efficient Adversarial Contextual
  Bandits
An Improved Relaxation for Oracle-Efficient Adversarial Contextual Bandits
Kiarash Banihashem
Mohammadtaghi Hajiaghayi
Suho Shin
Max Springer
16
1
0
29 Oct 2023
A Stability Principle for Learning under Non-Stationarity
A Stability Principle for Learning under Non-Stationarity
Chengpiao Huang
Kaizheng Wang
39
2
0
27 Oct 2023
Non-Stationary Contextual Bandit Learning via Neural Predictive Ensemble
  Sampling
Non-Stationary Contextual Bandit Learning via Neural Predictive Ensemble Sampling
Zheqing Zhu
Yueyang Liu
Xu Kuang
Benjamin Van Roy
AI4TS
29
0
0
11 Oct 2023
Bypassing the Simulator: Near-Optimal Adversarial Linear Contextual
  Bandits
Bypassing the Simulator: Near-Optimal Adversarial Linear Contextual Bandits
Haolin Liu
Chen-Yu Wei
Julian Zimmert
30
9
0
02 Sep 2023
Online Learning with Costly Features in Non-stationary Environments
Online Learning with Costly Features in Non-stationary Environments
Saeed Ghoorchian
E. Kortukov
S. Maghsudi
OffRL
33
1
0
18 Jul 2023
Tracking Most Significant Shifts in Nonparametric Contextual Bandits
Tracking Most Significant Shifts in Nonparametric Contextual Bandits
Joe Suk
Samory Kpotufe
38
5
0
11 Jul 2023
Meta-Learning Adversarial Bandit Algorithms
Meta-Learning Adversarial Bandit Algorithms
M. Khodak
Ilya Osadchiy
Keegan Harris
Maria-Florina Balcan
Kfir Y. Levy
Ron Meir
Zhiwei Steven Wu
FedML
28
2
0
05 Jul 2023
Non-stationary Reinforcement Learning under General Function
  Approximation
Non-stationary Reinforcement Learning under General Function Approximation
Songtao Feng
Ming Yin
Ruiquan Huang
Yu-Xiang Wang
J. Yang
Yitao Liang
18
8
0
01 Jun 2023
Energy Regularized RNNs for Solving Non-Stationary Bandit Problems
Energy Regularized RNNs for Solving Non-Stationary Bandit Problems
Michael Rotman
Lior Wolf
16
1
0
12 Mar 2023
MNL-Bandit in non-stationary environments
MNL-Bandit in non-stationary environments
Ayoub Foussoul
Vineet Goyal
Varun Gupta
34
2
0
04 Mar 2023
A Definition of Non-Stationary Bandits
A Definition of Non-Stationary Bandits
Yueyang Liu
Kuang Xu
Benjamin Van Roy
24
11
0
23 Feb 2023
Linear Bandits with Memory: from Rotting to Rising
Linear Bandits with Memory: from Rotting to Rising
Giulia Clerici
Pierre Laforgue
Nicolò Cesa-Bianchi
25
3
0
16 Feb 2023
Multi-channel Autobidding with Budget and ROI Constraints
Multi-channel Autobidding with Budget and ROI Constraints
Yuan Deng
Negin Golrezaei
Patrick Jaillet
Jason Cheuk Nam Liang
Vahab Mirrokni
24
24
0
03 Feb 2023
Quantum contextual bandits and recommender systems for quantum data
Quantum contextual bandits and recommender systems for quantum data
Shrigyan Brahmachari
Josep Lumbreras
Marco Tomamichel
32
3
0
31 Jan 2023
Bounded (O(1)) Regret Recommendation Learning via Synthetic Controls
  Oracle
Bounded (O(1)) Regret Recommendation Learning via Synthetic Controls Oracle
Hyunwook Kang
P. R. Kumar
OffRL
33
1
0
29 Jan 2023
Smooth Non-Stationary Bandits
Smooth Non-Stationary Bandits
S. Jia
Qian Xie
Nathan Kallus
P. Frazier
98
9
0
29 Jan 2023
Contextual Bandits and Optimistically Universal Learning
Contextual Bandits and Optimistically Universal Learning
Moise Blanchard
Steve Hanneke
Patrick Jaillet
OffRL
19
1
0
31 Dec 2022
Learning to Price Supply Chain Contracts against a Learning Retailer
Learning to Price Supply Chain Contracts against a Learning Retailer
Xuejun Zhao
Ruihao Zhu
W. Haskell
OffRL
10
0
0
02 Nov 2022
ANACONDA: An Improved Dynamic Regret Algorithm for Adaptive
  Non-Stationary Dueling Bandits
ANACONDA: An Improved Dynamic Regret Algorithm for Adaptive Non-Stationary Dueling Bandits
Thomas Kleine Buening
Aadirupa Saha
46
6
0
25 Oct 2022
Extending Open Bandit Pipeline to Simulate Industry Challenges
Extending Open Bandit Pipeline to Simulate Industry Challenges
Bram van den Akker
N. Weber
Felipe Moraes
Dmitri Goldenberg
OffRL
16
1
0
09 Sep 2022
Decentralized Competing Bandits in Non-Stationary Matching Markets
Decentralized Competing Bandits in Non-Stationary Matching Markets
Avishek Ghosh
Abishek Sankararaman
Kannan Ramchandran
T. Javidi
A. Mazumdar
28
4
0
31 May 2022
Non-Stationary Bandit Learning via Predictive Sampling
Non-Stationary Bandit Learning via Predictive Sampling
Yueyang Liu
Kuang Xu
Benjamin Van Roy
24
19
0
04 May 2022
Corralling a Larger Band of Bandits: A Case Study on Switching Regret
  for Linear Bandits
Corralling a Larger Band of Bandits: A Case Study on Switching Regret for Linear Bandits
Haipeng Luo
Mengxiao Zhang
Peng Zhao
Zhi-Hua Zhou
31
17
0
12 Feb 2022
Bridging Adversarial and Nonstationary Multi-armed Bandit
Bridging Adversarial and Nonstationary Multi-armed Bandit
Ningyuan Chen
Shuoguang Yang
Hailun Zhang
AAML
11
4
0
05 Jan 2022
Efficient and Optimal Algorithms for Contextual Dueling Bandits under
  Realizability
Efficient and Optimal Algorithms for Contextual Dueling Bandits under Realizability
Aadirupa Saha
A. Krishnamurthy
31
35
0
24 Nov 2021
Optimal and Efficient Dynamic Regret Algorithms for Non-Stationary
  Dueling Bandits
Optimal and Efficient Dynamic Regret Algorithms for Non-Stationary Dueling Bandits
Aadirupa Saha
Shubham Gupta
33
10
0
06 Nov 2021
The Pareto Frontier of model selection for general Contextual Bandits
The Pareto Frontier of model selection for general Contextual Bandits
T. V. Marinov
Julian Zimmert
13
22
0
25 Oct 2021
On Slowly-varying Non-stationary Bandits
On Slowly-varying Non-stationary Bandits
Ramakrishnan Krishnamurthy
Médéric Fourmy
21
8
0
25 Oct 2021
Towards the D-Optimal Online Experiment Design for Recommender Selection
Towards the D-Optimal Online Experiment Design for Recommender Selection
Madina Abdrakhmanova
Saniya Abushakimova
Evren Körpeoglu
H. A. Varol
Kannan Achan
14
3
0
23 Oct 2021
Adapting to Misspecification in Contextual Bandits
Adapting to Misspecification in Contextual Bandits
Dylan J. Foster
Claudio Gentile
M. Mohri
Julian Zimmert
11
84
0
12 Jul 2021
Periodic-GP: Learning Periodic World with Gaussian Process Bandits
Periodic-GP: Learning Periodic World with Gaussian Process Bandits
Hengrui Cai
Zhihao Cen
Ling Leng
Rui Song
AI4TS
25
5
0
30 May 2021
When and Whom to Collaborate with in a Changing Environment: A
  Collaborative Dynamic Bandit Solution
When and Whom to Collaborate with in a Changing Environment: A Collaborative Dynamic Bandit Solution
Chuanhao Li
Qingyun Wu
Hongning Wang
39
5
0
14 Apr 2021
Dynamic Pricing and Learning under the Bass Model
Dynamic Pricing and Learning under the Bass Model
Shipra Agrawal
Steven Yin
A. Zeevi
21
11
0
09 Mar 2021
Non-stationary Reinforcement Learning without Prior Knowledge: An
  Optimal Black-box Approach
Non-stationary Reinforcement Learning without Prior Knowledge: An Optimal Black-box Approach
Chen-Yu Wei
Haipeng Luo
OffRL
24
102
0
10 Feb 2021
Learning User Preferences in Non-Stationary Environments
Learning User Preferences in Non-Stationary Environments
Wasim Huleihel
S. Pal
O. Shayevitz
14
12
0
29 Jan 2021
Adversarial Linear Contextual Bandits with Graph-Structured Side
  Observations
Adversarial Linear Contextual Bandits with Graph-Structured Side Observations
Lingda Wang
Bingcong Li
Huozhi Zhou
G. Giannakis
L. Varshney
Zhizhen Zhao
13
7
0
10 Dec 2020
Non-Stationary Latent Bandits
Non-Stationary Latent Bandits
Joey Hong
B. Kveton
Manzil Zaheer
Yinlam Chow
Amr Ahmed
Mohammad Ghavamzadeh
Craig Boutilier
OffRL
25
13
0
01 Dec 2020
Adversarial Dueling Bandits
Adversarial Dueling Bandits
Aadirupa Saha
Tomer Koren
Yishay Mansour
13
25
0
27 Oct 2020
Effects of Model Misspecification on Bayesian Bandits: Case Studies in
  UX Optimization
Effects of Model Misspecification on Bayesian Bandits: Case Studies in UX Optimization
Mack Sweeney
M. Adelsberg
Kathryn B. Laskey
C. Domeniconi
16
1
0
07 Oct 2020
Model-Free Non-Stationary RL: Near-Optimal Regret and Applications in
  Multi-Agent RL and Inventory Control
Model-Free Non-Stationary RL: Near-Optimal Regret and Applications in Multi-Agent RL and Inventory Control
Weichao Mao
Kaipeng Zhang
Ruihao Zhu
D. Simchi-Levi
Tamer Bacsar
22
13
0
07 Oct 2020
Learning Product Rankings Robust to Fake Users
Learning Product Rankings Robust to Fake Users
Negin Golrezaei
Vahideh H. Manshadi
Jon Schneider
S. Sekar
13
26
0
10 Sep 2020
Unifying Clustered and Non-stationary Bandits
Unifying Clustered and Non-stationary Bandits
Chuanhao Li
Qingyun Wu
Hongning Wang
27
12
0
05 Sep 2020
Self-Tuning Bandits over Unknown Covariate-Shifts
Self-Tuning Bandits over Unknown Covariate-Shifts
Joe Suk
Samory Kpotufe
14
9
0
16 Jul 2020
Dynamic Regret of Policy Optimization in Non-stationary Environments
Dynamic Regret of Policy Optimization in Non-stationary Environments
Yingjie Fei
Zhuoran Yang
Zhaoran Wang
Qiaomin Xie
16
54
0
30 Jun 2020
12
Next