Non-Stationary Bandits with Habituation and Recovery Dynamics

26 July 2017

Papers citing "Non-Stationary Bandits with Habituation and Recovery Dynamics"

21 / 21 papers shown

Title
Preferences Evolve And So Should Your Bandits: Bandits with Evolving States for Online Platforms Khashayar Khosravi R. Leme Chara Podimata Apostolis Tsorvantzis 61 0 0 21 Jul 2023
Rotting Bandits Nir Levine K. Crammer Shie Mannor 54 102 0 23 Feb 2017
Statistics with Set-Valued Functions: Applications to Inverse Approximate Optimization A. Aswani 54 17 0 02 Feb 2017
Dynamic Regret of Strongly Adaptive Methods Lijun Zhang Tianbao Yang Rong Jin Zhi Zhou ODL 36 10 0 26 Jan 2017
Always Valid Inference: Bringing Sequential Analysis to A/B Testing Ramesh Johari L. Pekelis David Walsh 45 94 0 15 Dec 2015
Multi-armed Bandit Problem with Known Trend Djallel Bouneffouf Raphael Feraud 37 83 0 28 Aug 2015
Bayesian optimization for materials design P. Frazier Jialei Wang AI4CE 53 228 0 03 Jun 2015
Strongly Adaptive Online Learning Amit Daniely Alon Gonen Shai Shalev-Shwartz ODL 165 178 0 25 Feb 2015
Optimal Exploration-Exploitation in a Multi-Armed-Bandit Problem with Non-stationary Rewards Omar Besbes Y. Gur A. Zeevi 66 127 0 13 May 2014
An Information-Theoretic Analysis of Thompson Sampling Daniel Russo Benjamin Van Roy 153 425 0 21 Mar 2014
From Predictive to Prescriptive Analytics Dimitris Bertsimas Nathan Kallus 126 541 0 22 Feb 2014
Concentration in unbounded metric spaces and algorithmic stability A. Kontorovich 78 55 0 04 Sep 2013
Non-stationary Stochastic Optimization Omar Besbes Y. Gur A. Zeevi 178 433 0 20 Jul 2013
Learning to Optimize Via Posterior Sampling Daniel Russo Benjamin Van Roy 195 701 0 11 Jan 2013
Thompson Sampling for Contextual Bandits with Linear Payoffs Shipra Agrawal Navin Goyal 195 1,000 0 15 Sep 2012
Point-Based POMDP Algorithms: Improved Analysis and Implementation Trey Smith R. Simmons 70 423 0 04 Jul 2012
The KL-UCB Algorithm for Bounded Stochastic Bandits and Beyond Aurélien Garivier Olivier Cappé 168 612 0 12 Feb 2011
A Dynamic Near-Optimal Algorithm for Online Linear Programming Shipra Agrawal Zizhuo Wang Yinyu Ye 107 310 0 16 Nov 2009
Distributed Learning in Multi-Armed Bandit with Multiple Players Keqin Liu Qing Zhao 97 438 0 12 Oct 2009
Multi-Armed Bandits in Metric Spaces Robert D. Kleinberg Aleksandrs Slivkins E. Upfal 397 468 0 29 Sep 2008
On Upper-Confidence Bound Policies for Non-Stationary Bandit Problems Aurélien Garivier Eric Moulines 88 295 0 22 May 2008