Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1707.08423
Cited By
Non-Stationary Bandits with Habituation and Recovery Dynamics
26 July 2017
Yonatan Dov Mintz
A. Aswani
Philip M. Kaminsky
E. Flowers
Yoshimi Fukuoka
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Non-Stationary Bandits with Habituation and Recovery Dynamics"
21 / 21 papers shown
Title
Preferences Evolve And So Should Your Bandits: Bandits with Evolving States for Online Platforms
Khashayar Khosravi
R. Leme
Chara Podimata
Apostolis Tsorvantzis
61
0
0
21 Jul 2023
Rotting Bandits
Nir Levine
K. Crammer
Shie Mannor
54
102
0
23 Feb 2017
Statistics with Set-Valued Functions: Applications to Inverse Approximate Optimization
A. Aswani
54
17
0
02 Feb 2017
Dynamic Regret of Strongly Adaptive Methods
Lijun Zhang
Tianbao Yang
Rong Jin
Zhi Zhou
ODL
36
10
0
26 Jan 2017
Always Valid Inference: Bringing Sequential Analysis to A/B Testing
Ramesh Johari
L. Pekelis
David Walsh
45
94
0
15 Dec 2015
Multi-armed Bandit Problem with Known Trend
Djallel Bouneffouf
Raphael Feraud
37
83
0
28 Aug 2015
Bayesian optimization for materials design
P. Frazier
Jialei Wang
AI4CE
53
228
0
03 Jun 2015
Strongly Adaptive Online Learning
Amit Daniely
Alon Gonen
Shai Shalev-Shwartz
ODL
165
178
0
25 Feb 2015
Optimal Exploration-Exploitation in a Multi-Armed-Bandit Problem with Non-stationary Rewards
Omar Besbes
Y. Gur
A. Zeevi
66
127
0
13 May 2014
An Information-Theoretic Analysis of Thompson Sampling
Daniel Russo
Benjamin Van Roy
153
425
0
21 Mar 2014
From Predictive to Prescriptive Analytics
Dimitris Bertsimas
Nathan Kallus
126
541
0
22 Feb 2014
Concentration in unbounded metric spaces and algorithmic stability
A. Kontorovich
78
55
0
04 Sep 2013
Non-stationary Stochastic Optimization
Omar Besbes
Y. Gur
A. Zeevi
178
433
0
20 Jul 2013
Learning to Optimize Via Posterior Sampling
Daniel Russo
Benjamin Van Roy
195
701
0
11 Jan 2013
Thompson Sampling for Contextual Bandits with Linear Payoffs
Shipra Agrawal
Navin Goyal
195
1,000
0
15 Sep 2012
Point-Based POMDP Algorithms: Improved Analysis and Implementation
Trey Smith
R. Simmons
70
423
0
04 Jul 2012
The KL-UCB Algorithm for Bounded Stochastic Bandits and Beyond
Aurélien Garivier
Olivier Cappé
168
612
0
12 Feb 2011
A Dynamic Near-Optimal Algorithm for Online Linear Programming
Shipra Agrawal
Zizhuo Wang
Yinyu Ye
107
310
0
16 Nov 2009
Distributed Learning in Multi-Armed Bandit with Multiple Players
Keqin Liu
Qing Zhao
97
438
0
12 Oct 2009
Multi-Armed Bandits in Metric Spaces
Robert D. Kleinberg
Aleksandrs Slivkins
E. Upfal
397
468
0
29 Sep 2008
On Upper-Confidence Bound Policies for Non-Stationary Bandit Problems
Aurélien Garivier
Eric Moulines
88
295
0
22 May 2008
1