Sequential Test for the Lowest Mean: From Thompson to Murphy Sampling

4 June 2018

Papers citing "Sequential Test for the Lowest Mean: From Thompson to Murphy Sampling"

16 / 16 papers shown

Title
Best-Arm Identification in Unimodal Bandits Riccardo Poiani Marc Jourdan E. Kaufmann Rémy Degenne 91 0 0 04 Nov 2024
Thresholding Bandit for Dose-ranging: The Impact of Monotonicity Aurélien Garivier Pierre Ménard Laurent Rossi Pierre Menard 37 27 0 13 Nov 2017
Structured Best Arm Identification with Fixed Confidence Ruitong Huang Mohammad M. Ajallooeian Csaba Szepesvári Martin Müller 41 25 0 16 Jun 2017
Monte-Carlo Tree Search by Best Arm Identification E. Kaufmann Wouter M. Koolen 39 37 0 09 Jun 2017
Nearly Optimal Sampling Algorithms for Combinatorial Pure Exploration Lijie Chen Anupam Gupta Jiacheng Li Mingda Qiao Ruosong Wang 78 47 0 04 Jun 2017
The Simulator: Understanding Adaptive Sampling in the Moderate-Confidence Regime Max Simchowitz Kevin Jamieson Benjamin Recht 67 66 0 16 Feb 2017
An optimal algorithm for the Thresholding Bandit Problem A. Locatelli Maurilio Gutzeit Alexandra Carpentier 35 132 0 27 May 2016
Simple Bayesian Algorithms for Best Arm Identification Daniel Russo 43 275 0 26 Feb 2016
Explore First, Exploit Next: The True Shape of Regret in Bandit Problems Aurélien Garivier Pierre Ménard Gilles Stoltz 40 211 0 23 Feb 2016
Maximin Action Identification: A New Bandit Framework for Games Aurélien Garivier E. Kaufmann Wouter M. Koolen 34 28 0 15 Feb 2016
Optimal Best Arm Identification with Fixed Confidence Aurélien Garivier E. Kaufmann 48 341 0 15 Feb 2016
On the Complexity of Best Arm Identification in Multi-Armed Bandit Models E. Kaufmann Olivier Cappé Aurélien Garivier 98 1,021 0 16 Jul 2014
lil' UCB : An Optimal Exploration Algorithm for Multi-Armed Bandits Kevin Jamieson Matthew Malloy Robert D. Nowak Sébastien Bubeck 49 411 0 27 Dec 2013
Estimating the Maximum Expected Value: An Analysis of (Nested) Cross Validation and the Maximum Sample Average H. V. Hasselt 36 28 0 28 Feb 2013
Kullback-Leibler upper confidence bounds for optimal sequential allocation Olivier Cappé Aurélien Garivier Odalric-Ambrym Maillard Rémi Munos Gilles Stoltz 70 394 0 03 Oct 2012
Thompson Sampling: An Asymptotically Optimal Finite Time Analysis E. Kaufmann N. Korda Rémi Munos 79 585 0 18 May 2012