Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1806.00973
Cited By
Sequential Test for the Lowest Mean: From Thompson to Murphy Sampling
4 June 2018
E. Kaufmann
Wouter M. Koolen
Aurélien Garivier
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Sequential Test for the Lowest Mean: From Thompson to Murphy Sampling"
16 / 16 papers shown
Title
Best-Arm Identification in Unimodal Bandits
Riccardo Poiani
Marc Jourdan
E. Kaufmann
Rémy Degenne
91
0
0
04 Nov 2024
Thresholding Bandit for Dose-ranging: The Impact of Monotonicity
Aurélien Garivier
Pierre Ménard
Laurent Rossi
Pierre Menard
37
27
0
13 Nov 2017
Structured Best Arm Identification with Fixed Confidence
Ruitong Huang
Mohammad M. Ajallooeian
Csaba Szepesvári
Martin Müller
41
25
0
16 Jun 2017
Monte-Carlo Tree Search by Best Arm Identification
E. Kaufmann
Wouter M. Koolen
39
37
0
09 Jun 2017
Nearly Optimal Sampling Algorithms for Combinatorial Pure Exploration
Lijie Chen
Anupam Gupta
Jiacheng Li
Mingda Qiao
Ruosong Wang
78
47
0
04 Jun 2017
The Simulator: Understanding Adaptive Sampling in the Moderate-Confidence Regime
Max Simchowitz
Kevin Jamieson
Benjamin Recht
67
66
0
16 Feb 2017
An optimal algorithm for the Thresholding Bandit Problem
A. Locatelli
Maurilio Gutzeit
Alexandra Carpentier
35
132
0
27 May 2016
Simple Bayesian Algorithms for Best Arm Identification
Daniel Russo
43
275
0
26 Feb 2016
Explore First, Exploit Next: The True Shape of Regret in Bandit Problems
Aurélien Garivier
Pierre Ménard
Gilles Stoltz
40
211
0
23 Feb 2016
Maximin Action Identification: A New Bandit Framework for Games
Aurélien Garivier
E. Kaufmann
Wouter M. Koolen
34
28
0
15 Feb 2016
Optimal Best Arm Identification with Fixed Confidence
Aurélien Garivier
E. Kaufmann
48
341
0
15 Feb 2016
On the Complexity of Best Arm Identification in Multi-Armed Bandit Models
E. Kaufmann
Olivier Cappé
Aurélien Garivier
98
1,021
0
16 Jul 2014
lil' UCB : An Optimal Exploration Algorithm for Multi-Armed Bandits
Kevin Jamieson
Matthew Malloy
Robert D. Nowak
Sébastien Bubeck
49
411
0
27 Dec 2013
Estimating the Maximum Expected Value: An Analysis of (Nested) Cross Validation and the Maximum Sample Average
H. V. Hasselt
36
28
0
28 Feb 2013
Kullback-Leibler upper confidence bounds for optimal sequential allocation
Olivier Cappé
Aurélien Garivier
Odalric-Ambrym Maillard
Rémi Munos
Gilles Stoltz
70
394
0
03 Oct 2012
Thompson Sampling: An Asymptotically Optimal Finite Time Analysis
E. Kaufmann
N. Korda
Rémi Munos
79
585
0
18 May 2012
1