ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.12202
  4. Cited By
A Definition of Non-Stationary Bandits

A Definition of Non-Stationary Bandits

23 February 2023
Yueyang Liu
Kuang Xu
Benjamin Van Roy
ArXivPDFHTML

Papers citing "A Definition of Non-Stationary Bandits"

15 / 15 papers shown
Title
An Information-Theoretic Analysis of Nonstationary Bandit Learning
An Information-Theoretic Analysis of Nonstationary Bandit Learning
Seungki Min
Daniel Russo
46
7
0
09 Feb 2023
Lifting the Information Ratio: An Information-Theoretic Analysis of
  Thompson Sampling for Contextual Bandits
Lifting the Information Ratio: An Information-Theoretic Analysis of Thompson Sampling for Contextual Bandits
Gergely Neu
Julia Olkhovskaya
Matteo Papini
Ludovic Schwartz
65
16
0
27 May 2022
Non-Stationary Bandit Learning via Predictive Sampling
Non-Stationary Bandit Learning via Predictive Sampling
Yueyang Liu
Kuang Xu
Benjamin Van Roy
44
19
0
04 May 2022
A Simple Approach for Non-stationary Linear Bandits
A Simple Approach for Non-stationary Linear Bandits
Peng Zhao
Lijun Zhang
Yuan Jiang
Zhi Zhou
54
84
0
09 Mar 2021
Reinforcement Learning, Bit by Bit
Reinforcement Learning, Bit by Bit
Xiuyuan Lu
Benjamin Van Roy
Vikranth Dwaracherla
M. Ibrahimi
Ian Osband
Zheng Wen
44
70
0
06 Mar 2021
Weighted Linear Bandits for Non-Stationary Environments
Weighted Linear Bandits for Non-Stationary Environments
Yoan Russac
Claire Vernade
Olivier Cappé
120
107
0
19 Sep 2019
Regret Bounds for Thompson Sampling in Episodic Restless Bandit Problems
Regret Bounds for Thompson Sampling in Episodic Restless Bandit Problems
Young Hun Jung
Ambuj Tewari
50
44
0
29 May 2019
A New Algorithm for Non-stationary Contextual Bandits: Efficient,
  Optimal, and Parameter-free
A New Algorithm for Non-stationary Contextual Bandits: Efficient, Optimal, and Parameter-free
Yifang Chen
Chung-Wei Lee
Haipeng Luo
Chen-Yu Wei
111
133
0
03 Feb 2019
An Information-Theoretic Approach to Minimax Regret in Partial
  Monitoring
An Information-Theoretic Approach to Minimax Regret in Partial Monitoring
Tor Lattimore
Csaba Szepesvári
35
70
0
01 Feb 2019
Nearly Optimal Adaptive Procedure with Change Detection for
  Piecewise-Stationary Bandit
Nearly Optimal Adaptive Procedure with Change Detection for Piecewise-Stationary Bandit
Yang Cao
Zheng Wen
Branislav Kveton
Yao Xie
57
95
0
11 Feb 2018
A Change-Detection based Framework for Piecewise-stationary Multi-Armed
  Bandit Problem
A Change-Detection based Framework for Piecewise-stationary Multi-Armed Bandit Problem
Fang Liu
Joohyung Lee
Ness B. Shroff
54
116
0
08 Nov 2017
Efficient Contextual Bandits in Non-stationary Worlds
Efficient Contextual Bandits in Non-stationary Worlds
Haipeng Luo
Chen-Yu Wei
Alekh Agarwal
John Langford
57
131
0
05 Aug 2017
Taming Non-stationary Bandits: A Bayesian Approach
Taming Non-stationary Bandits: A Bayesian Approach
Vishnu Raj
Sheetal Kalyani
107
76
0
31 Jul 2017
Time-Varying Gaussian Process Bandit Optimization
Time-Varying Gaussian Process Bandit Optimization
Ilija Bogunovic
Jonathan Scarlett
Volkan Cevher
84
96
0
25 Jan 2016
Learning in A Changing World: Restless Multi-Armed Bandit with Unknown
  Dynamics
Learning in A Changing World: Restless Multi-Armed Bandit with Unknown Dynamics
Haoyang Liu
Keqin Liu
Qing Zhao
108
159
0
22 Nov 2010
1