ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.09727
  4. Cited By
Taming Non-stationary Bandits: A Bayesian Approach

Taming Non-stationary Bandits: A Bayesian Approach

31 July 2017
Vishnu Raj
Sheetal Kalyani
ArXivPDFHTML

Papers citing "Taming Non-stationary Bandits: A Bayesian Approach"

40 / 40 papers shown
Title
Natural Policy Gradient for Average Reward Non-Stationary RL
Natural Policy Gradient for Average Reward Non-Stationary RL
Neharika Jali
Eshika Pathak
Pranay Sharma
Guannan Qu
Gauri Joshi
37
0
0
23 Apr 2025
Diminishing Exploration: A Minimalist Approach to Piecewise Stationary
  Multi-Armed Bandits
Diminishing Exploration: A Minimalist Approach to Piecewise Stationary Multi-Armed Bandits
Kuan-Ta Li
Ping-Chun Hsieh
Yu-Chih Huang
34
1
0
08 Oct 2024
Improving Portfolio Optimization Results with Bandit Networks
Improving Portfolio Optimization Results with Bandit Networks
Gustavo de Freitas Fonseca
Lucas Coelho e Silva
Paulo André Lima de Castro
16
0
0
05 Oct 2024
Bridging Rested and Restless Bandits with Graph-Triggering: Rising and
  Rotting
Bridging Rested and Restless Bandits with Graph-Triggering: Rising and Rotting
Gianmarco Genalti
Marco Mussi
Nicola Gatti
Marcello Restelli
Matteo Castiglioni
Alberto Maria Metelli
38
0
0
09 Sep 2024
Non-Stationary Latent Auto-Regressive Bandits
Non-Stationary Latent Auto-Regressive Bandits
Anna L. Trella
Walter Dempsey
Asim H. Gazi
Ziping Xu
Finale Doshi-Velez
Susan A. Murphy
26
1
0
05 Feb 2024
Forced Exploration in Bandit Problems
Forced Exploration in Bandit Problems
Han Qi
Fei-Yu Guo
Li Zhu
20
0
0
12 Dec 2023
Non-Stationary Contextual Bandit Learning via Neural Predictive Ensemble
  Sampling
Non-Stationary Contextual Bandit Learning via Neural Predictive Ensemble Sampling
Zheqing Zhu
Yueyang Liu
Xu Kuang
Benjamin Van Roy
AI4TS
34
0
0
11 Oct 2023
Module-wise Adaptive Distillation for Multimodality Foundation Models
Module-wise Adaptive Distillation for Multimodality Foundation Models
Chen Liang
Jiahui Yu
Ming-Hsuan Yang
Matthew A. Brown
Huayu Chen
Tuo Zhao
Boqing Gong
Tianyi Zhou
19
10
0
06 Oct 2023
Continual Learning as Computationally Constrained Reinforcement Learning
Continual Learning as Computationally Constrained Reinforcement Learning
Saurabh Kumar
Henrik Marklund
Anand Srinivasa Rao
Yifan Zhu
Hong Jun Jeon
Yueyang Liu
Benjamin Van Roy
CLL
34
22
0
10 Jul 2023
Discounted Thompson Sampling for Non-Stationary Bandit Problems
Discounted Thompson Sampling for Non-Stationary Bandit Problems
Han Qi
Yue Wang
Li Zhu
40
4
0
18 May 2023
A Definition of Non-Stationary Bandits
A Definition of Non-Stationary Bandits
Yueyang Liu
Kuang Xu
Benjamin Van Roy
24
11
0
23 Feb 2023
Reinforcement Learning for Combining Search Methods in the Calibration
  of Economic ABMs
Reinforcement Learning for Combining Search Methods in the Calibration of Economic ABMs
Aldo Glielmo
Marco Favorito
Debmallya Chanda
Domenico Delli Gatti
37
6
0
23 Feb 2023
An Information-Theoretic Analysis of Nonstationary Bandit Learning
An Information-Theoretic Analysis of Nonstationary Bandit Learning
Seungki Min
Daniel Russo
34
6
0
09 Feb 2023
SLOPT: Bandit Optimization Framework for Mutation-Based Fuzzing
SLOPT: Bandit Optimization Framework for Mutation-Based Fuzzing
Yuki Koike
H. Katsura
Hiromu Yakura
Yuma Kurogome
31
5
0
07 Nov 2022
Reactive Exploration to Cope with Non-Stationarity in Lifelong
  Reinforcement Learning
Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning
C. Steinparz
Thomas Schmied
Fabian Paischer
Marius-Constantin Dinu
Vihang Patil
Angela Bitto-Nemling
Hamid Eghbalzadeh
Sepp Hochreiter
CLL
29
11
0
12 Jul 2022
Dynamic Memory for Interpretable Sequential Optimisation
Dynamic Memory for Interpretable Sequential Optimisation
S. Chennu
Andrew Maher
Jamie Martin
Subash Prabanantham
13
0
0
28 Jun 2022
Safety Aware Changepoint Detection for Piecewise i.i.d. Bandits
Safety Aware Changepoint Detection for Piecewise i.i.d. Bandits
Subhojyoti Mukherjee
21
1
0
27 May 2022
Fast Change Identification in Multi-Play Bandits and its Applications in
  Wireless Networks
Fast Change Identification in Multi-Play Bandits and its Applications in Wireless Networks
Gourab Ghatak
45
1
0
20 May 2022
Non-Stationary Bandit Learning via Predictive Sampling
Non-Stationary Bandit Learning via Predictive Sampling
Yueyang Liu
Kuang Xu
Benjamin Van Roy
24
19
0
04 May 2022
Existence conditions for hidden feedback loops in online recommender
  systems
Existence conditions for hidden feedback loops in online recommender systems
A. Khritankov
Anton A. Pilkevich
21
1
0
11 Sep 2021
On Limited-Memory Subsampling Strategies for Bandits
On Limited-Memory Subsampling Strategies for Bandits
Dorian Baudry
Yoan Russac
Olivier Cappé
13
7
0
21 Jun 2021
Nonstationary Stochastic Multiarmed Bandits: UCB Policies and Minimax
  Regret
Nonstationary Stochastic Multiarmed Bandits: UCB Policies and Minimax Regret
Lai Wei
Vaibhav Srivastava
11
11
0
22 Jan 2021
An empirical evaluation of active inference in multi-armed bandits
An empirical evaluation of active inference in multi-armed bandits
D. Marković
Hrvoje Stojić
Sarah Schwöbel
S. Kiebel
42
34
0
21 Jan 2021
Blending Search and Discovery: Tag-Based Query Refinement with
  Contextual Reinforcement Learning
Blending Search and Discovery: Tag-Based Query Refinement with Contextual Reinforcement Learning
Bingqing Yu
Jacopo Tagliabue
15
1
0
15 Oct 2020
Effects of Model Misspecification on Bayesian Bandits: Case Studies in
  UX Optimization
Effects of Model Misspecification on Bayesian Bandits: Case Studies in UX Optimization
Mack Sweeney
M. Adelsberg
Kathryn B. Laskey
C. Domeniconi
21
1
0
07 Oct 2020
A Change-Detection Based Thompson Sampling Framework for Non-Stationary
  Bandits
A Change-Detection Based Thompson Sampling Framework for Non-Stationary Bandits
Gourab Ghatak
25
17
0
06 Sep 2020
An Online Algorithm for Computation Offloading in Non-Stationary
  Environments
An Online Algorithm for Computation Offloading in Non-Stationary Environments
Aniq Ur Rahman
Gourab Ghatak
Antonio De Domenico
11
12
0
22 Jun 2020
Odds-Ratio Thompson Sampling to Control for Time-Varying Effect
Odds-Ratio Thompson Sampling to Control for Time-Varying Effect
Sulgi Kim
Kyungmin Kim
19
0
0
04 Mar 2020
Boltzmann Exploration Expectation-Maximisation
Boltzmann Exploration Expectation-Maximisation
Mathias Edman
Neil Dhir
22
3
0
18 Dec 2019
Adapting Behaviour for Learning Progress
Adapting Behaviour for Learning Progress
Tom Schaul
Diana Borsa
David Ding
David Szepesvari
Georg Ostrovski
Will Dabney
Simon Osindero
22
18
0
14 Dec 2019
Recovering Bandits
Recovering Bandits
Ciara Pike-Burke
Steffen Grunewalder
13
40
0
31 Oct 2019
Weighted Linear Bandits for Non-Stationary Environments
Weighted Linear Bandits for Non-Stationary Environments
Yoan Russac
Claire Vernade
Olivier Cappé
82
102
0
19 Sep 2019
Accelerated learning from recommender systems using multi-armed bandit
Accelerated learning from recommender systems using multi-armed bandit
Meisam Hejazinia
Kyler M. Eastman
Shu Ye
A. Amirabadi
Ravi Divvela
21
3
0
16 Aug 2019
Distribution-dependent and Time-uniform Bounds for Piecewise i.i.d
  Bandits
Distribution-dependent and Time-uniform Bounds for Piecewise i.i.d Bandits
Subhojyoti Mukherjee
Odalric-Ambrym Maillard
8
11
0
30 May 2019
AutoSeM: Automatic Task Selection and Mixing in Multi-Task Learning
AutoSeM: Automatic Task Selection and Mixing in Multi-Task Learning
Han Guo
Ramakanth Pasunuru
Joey Tianyi Zhou
25
47
0
08 Apr 2019
Efficient Change-Point Detection for Tackling Piecewise-Stationary
  Bandits
Efficient Change-Point Detection for Tackling Piecewise-Stationary Bandits
Lilian Besson
E. Kaufmann
Odalric-Ambrym Maillard
Julien Seznec
23
43
0
05 Feb 2019
Learning to Optimize under Non-Stationarity
Learning to Optimize under Non-Stationarity
Wang Chi Cheung
D. Simchi-Levi
Ruihao Zhu
47
133
0
06 Oct 2018
On Abruptly-Changing and Slowly-Varying Multiarmed Bandit Problems
On Abruptly-Changing and Slowly-Varying Multiarmed Bandit Problems
Lai Wei
Vaibhav Srivastava
32
37
0
23 Feb 2018
On Adaptive Estimation for Dynamic Bernoulli Bandits
On Adaptive Estimation for Dynamic Bernoulli Bandits
Xueguang Lu
N. Adams
N. Kantas
20
8
0
08 Dec 2017
Discrepancy-Based Algorithms for Non-Stationary Rested Bandits
Corinna Cortes
Giulia DeSalvo
Vitaly Kuznetsov
M. Mohri
Scott Yang
13
19
0
29 Oct 2017
1