Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1707.09727
Cited By
Taming Non-stationary Bandits: A Bayesian Approach
31 July 2017
Vishnu Raj
Sheetal Kalyani
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Taming Non-stationary Bandits: A Bayesian Approach"
40 / 40 papers shown
Title
Natural Policy Gradient for Average Reward Non-Stationary RL
Neharika Jali
Eshika Pathak
Pranay Sharma
Guannan Qu
Gauri Joshi
34
0
0
23 Apr 2025
Diminishing Exploration: A Minimalist Approach to Piecewise Stationary Multi-Armed Bandits
Kuan-Ta Li
Ping-Chun Hsieh
Yu-Chih Huang
34
1
0
08 Oct 2024
Improving Portfolio Optimization Results with Bandit Networks
Gustavo de Freitas Fonseca
Lucas Coelho e Silva
Paulo André Lima de Castro
16
0
0
05 Oct 2024
Bridging Rested and Restless Bandits with Graph-Triggering: Rising and Rotting
Gianmarco Genalti
Marco Mussi
Nicola Gatti
Marcello Restelli
Matteo Castiglioni
Alberto Maria Metelli
38
0
0
09 Sep 2024
Non-Stationary Latent Auto-Regressive Bandits
Anna L. Trella
Walter Dempsey
Asim H. Gazi
Ziping Xu
Finale Doshi-Velez
Susan A. Murphy
26
1
0
05 Feb 2024
Forced Exploration in Bandit Problems
Han Qi
Fei-Yu Guo
Li Zhu
18
0
0
12 Dec 2023
Non-Stationary Contextual Bandit Learning via Neural Predictive Ensemble Sampling
Zheqing Zhu
Yueyang Liu
Xu Kuang
Benjamin Van Roy
AI4TS
31
0
0
11 Oct 2023
Module-wise Adaptive Distillation for Multimodality Foundation Models
Chen Liang
Jiahui Yu
Ming-Hsuan Yang
Matthew A. Brown
Huayu Chen
Tuo Zhao
Boqing Gong
Tianyi Zhou
19
10
0
06 Oct 2023
Continual Learning as Computationally Constrained Reinforcement Learning
Saurabh Kumar
Henrik Marklund
Anand Srinivasa Rao
Yifan Zhu
Hong Jun Jeon
Yueyang Liu
Benjamin Van Roy
CLL
34
22
0
10 Jul 2023
Discounted Thompson Sampling for Non-Stationary Bandit Problems
Han Qi
Yue Wang
Li Zhu
40
4
0
18 May 2023
A Definition of Non-Stationary Bandits
Yueyang Liu
Kuang Xu
Benjamin Van Roy
24
11
0
23 Feb 2023
Reinforcement Learning for Combining Search Methods in the Calibration of Economic ABMs
Aldo Glielmo
Marco Favorito
Debmallya Chanda
Domenico Delli Gatti
37
6
0
23 Feb 2023
An Information-Theoretic Analysis of Nonstationary Bandit Learning
Seungki Min
Daniel Russo
34
6
0
09 Feb 2023
SLOPT: Bandit Optimization Framework for Mutation-Based Fuzzing
Yuki Koike
H. Katsura
Hiromu Yakura
Yuma Kurogome
31
5
0
07 Nov 2022
Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning
C. Steinparz
Thomas Schmied
Fabian Paischer
Marius-Constantin Dinu
Vihang Patil
Angela Bitto-Nemling
Hamid Eghbalzadeh
Sepp Hochreiter
CLL
29
11
0
12 Jul 2022
Dynamic Memory for Interpretable Sequential Optimisation
S. Chennu
Andrew Maher
Jamie Martin
Subash Prabanantham
11
0
0
28 Jun 2022
Safety Aware Changepoint Detection for Piecewise i.i.d. Bandits
Subhojyoti Mukherjee
21
1
0
27 May 2022
Fast Change Identification in Multi-Play Bandits and its Applications in Wireless Networks
Gourab Ghatak
45
1
0
20 May 2022
Non-Stationary Bandit Learning via Predictive Sampling
Yueyang Liu
Kuang Xu
Benjamin Van Roy
24
19
0
04 May 2022
Existence conditions for hidden feedback loops in online recommender systems
A. Khritankov
Anton A. Pilkevich
21
1
0
11 Sep 2021
On Limited-Memory Subsampling Strategies for Bandits
Dorian Baudry
Yoan Russac
Olivier Cappé
13
7
0
21 Jun 2021
Nonstationary Stochastic Multiarmed Bandits: UCB Policies and Minimax Regret
Lai Wei
Vaibhav Srivastava
9
11
0
22 Jan 2021
An empirical evaluation of active inference in multi-armed bandits
D. Marković
Hrvoje Stojić
Sarah Schwöbel
S. Kiebel
42
34
0
21 Jan 2021
Blending Search and Discovery: Tag-Based Query Refinement with Contextual Reinforcement Learning
Bingqing Yu
Jacopo Tagliabue
15
1
0
15 Oct 2020
Effects of Model Misspecification on Bayesian Bandits: Case Studies in UX Optimization
Mack Sweeney
M. Adelsberg
Kathryn B. Laskey
C. Domeniconi
21
1
0
07 Oct 2020
A Change-Detection Based Thompson Sampling Framework for Non-Stationary Bandits
Gourab Ghatak
25
17
0
06 Sep 2020
An Online Algorithm for Computation Offloading in Non-Stationary Environments
Aniq Ur Rahman
Gourab Ghatak
Antonio De Domenico
11
12
0
22 Jun 2020
Odds-Ratio Thompson Sampling to Control for Time-Varying Effect
Sulgi Kim
Kyungmin Kim
16
0
0
04 Mar 2020
Boltzmann Exploration Expectation-Maximisation
Mathias Edman
Neil Dhir
22
3
0
18 Dec 2019
Adapting Behaviour for Learning Progress
Tom Schaul
Diana Borsa
David Ding
David Szepesvari
Georg Ostrovski
Will Dabney
Simon Osindero
22
18
0
14 Dec 2019
Recovering Bandits
Ciara Pike-Burke
Steffen Grunewalder
11
40
0
31 Oct 2019
Weighted Linear Bandits for Non-Stationary Environments
Yoan Russac
Claire Vernade
Olivier Cappé
82
102
0
19 Sep 2019
Accelerated learning from recommender systems using multi-armed bandit
Meisam Hejazinia
Kyler M. Eastman
Shu Ye
A. Amirabadi
Ravi Divvela
21
3
0
16 Aug 2019
Distribution-dependent and Time-uniform Bounds for Piecewise i.i.d Bandits
Subhojyoti Mukherjee
Odalric-Ambrym Maillard
6
11
0
30 May 2019
AutoSeM: Automatic Task Selection and Mixing in Multi-Task Learning
Han Guo
Ramakanth Pasunuru
Joey Tianyi Zhou
25
47
0
08 Apr 2019
Efficient Change-Point Detection for Tackling Piecewise-Stationary Bandits
Lilian Besson
E. Kaufmann
Odalric-Ambrym Maillard
Julien Seznec
23
43
0
05 Feb 2019
Learning to Optimize under Non-Stationarity
Wang Chi Cheung
D. Simchi-Levi
Ruihao Zhu
44
133
0
06 Oct 2018
On Abruptly-Changing and Slowly-Varying Multiarmed Bandit Problems
Lai Wei
Vaibhav Srivastava
32
37
0
23 Feb 2018
On Adaptive Estimation for Dynamic Bernoulli Bandits
Xueguang Lu
N. Adams
N. Kantas
17
8
0
08 Dec 2017
Discrepancy-Based Algorithms for Non-Stationary Rested Bandits
Corinna Cortes
Giulia DeSalvo
Vitaly Kuznetsov
M. Mohri
Scott Yang
11
19
0
29 Oct 2017
1