On the Complexity of Best Arm Identification in Multi-Armed Bandit Models

16 July 2014

Papers citing "On the Complexity of Best Arm Identification in Multi-Armed Bandit Models"

47 / 147 papers shown

Title
Best arm identification in multi-armed bandits with delayed feedback Aditya Grover Todor Markov Patrick Attia Norman Jin Nicholas Perkins ... M. Chen Zi Yang Stephen J. Harris W. Chueh Stefano Ermon 27 74 0 29 Mar 2018
Accelerated Gradient Boosting Gérard Biau B. Cadre L. Rouviere 24 108 0 06 Mar 2018
Regional Multi-Armed Bandits Zhiyang Wang Ruida Zhou Cong Shen 21 18 0 22 Feb 2018
Adaptive Sampling for Coarse Ranking S. Katariya Lalit P. Jain Nandana Sengupta James A. Evans Robert D. Nowak 13 25 0 20 Feb 2018
Stochastic Multi-armed Bandits in Constant Space David Liau Eric Price Zhao Song Ger Yang 25 35 0 25 Dec 2017
Thresholding Bandit for Dose-ranging: The Impact of Monotonicity Aurélien Garivier Pierre Ménard Laurent Rossi Pierre Menard 22 27 0 13 Nov 2017
Building machines that adapt and compute like brains Brenden M. Lake J. Tenenbaum AI4CE FedML NAI AILaw 254 889 0 11 Nov 2017
Minimal Exploration in Structured Stochastic Bandits Richard Combes Stefan Magureanu Alexandre Proutiere 33 115 0 01 Nov 2017
Landmark Diffusion Maps (L-dMaps): Accelerated manifold learning out-of-sample extension Andrew W. Long Andrew L. Ferguson 11 38 0 28 Jun 2017
A hybrid supervised/unsupervised machine learning approach to solar flare prediction F. Benvenuto Michele Piana C. Campi A. Massone 9 63 0 21 Jun 2017
Monte-Carlo Tree Search by Best Arm Identification E. Kaufmann Wouter M. Koolen 16 37 0 09 Jun 2017
Nearly Optimal Sampling Algorithms for Combinatorial Pure Exploration Lijie Chen Anupam Gupta Jiacheng Li Mingda Qiao Ruosong Wang 22 47 0 04 Jun 2017
Improving the Expected Improvement Algorithm Chao Qin Diego Klabjan Daniel Russo 27 135 0 29 May 2017
Permutation Tests for Infection Graphs Justin Khim Po-Ling Loh 16 7 0 22 May 2017
Practical Algorithms for Best-K Identification in Multi-Armed Bandits Haotian Jiang Jian Li Mingda Qiao 11 14 0 19 May 2017
Frequentist Consistency of Variational Bayes Yixin Wang David M. Blei BDL 37 204 0 09 May 2017
Inverse Reinforcement Learning from Summary Data A. Kangasrääsiö Samuel Kaski OffRL 26 15 0 28 Mar 2017
Nearly Instance Optimal Sample Complexity Bounds for Top-k Arm Selection Lijie Chen Jian Li Mingda Qiao 13 58 0 13 Feb 2017
Learning the distribution with largest mean: two bandit frameworks E. Kaufmann Aurélien Garivier 24 19 0 31 Jan 2017
What makes a gesture a gesture? Neural signatures involved in gesture recognition M. E. Cabrera K. Novak D. Foti Richard M. Voyles J. Wachs 16 13 0 20 Jan 2017
Identifying Best Interventions through Online Importance Sampling Rajat Sen Karthikeyan Shanmugam A. Dimakis Sanjay Shakkottai 31 72 0 10 Jan 2017
ChaLearn Looking at People: A Review of Events and Resources Sergio Escalera Xavier Baro Hugo Jair Escalante Isabelle M Guyon 36 40 0 10 Jan 2017
Learning an Invariant Hilbert Space for Domain Adaptation Samitha Herath Mehrtash Harandi Fatih Porikli 16 107 0 25 Nov 2016
The Recycling Gibbs Sampler for Efficient Learning Luca Martino Victor Elvira Gustau Camps-Valls 36 30 0 21 Nov 2016
BET on Independence Kai Zhang 21 48 0 17 Oct 2016
Mixture model modal clustering José E. Chacón 9 31 0 15 Sep 2016
Mapping the Similarities of Spectra: Global and Locally-biased Approaches to SDSS Galaxy Data David Lawlor T. Budavári Michael W. Mahoney 14 12 0 13 Sep 2016
Linear Regression with an Unknown Permutation: Statistical and Computational Limits A. Pananjady Martin J. Wainwright T. Courtade 23 47 0 09 Aug 2016
Active Ranking from Pairwise Comparisons and when Parametric Assumptions Don't Help Reinhard Heckel Nihar B. Shah Kannan Ramchandran Martin J. Wainwright 19 10 0 28 Jun 2016
Identifying individual facial expressions by deconstructing a neural network F. Arbabzadah G. Montavon K. Müller Wojciech Samek CVBM FAtt 30 31 0 23 Jun 2016
Pure Exploration of Multi-armed Bandit Under Matroid Constraints Lijie Chen Anupam Gupta Jian Li 25 49 0 23 May 2016
A Decentralized Quasi-Newton Method for Dual Formulations of Consensus Optimization Mark Eisen Aryan Mokhtari Alejandro Ribeiro 17 14 0 23 Mar 2016
Simple Bayesian Algorithms for Best Arm Identification Daniel Russo 25 273 0 26 Feb 2016
Explore First, Exploit Next: The True Shape of Regret in Bandit Problems Aurélien Garivier Pierre Ménard Gilles Stoltz 19 210 0 23 Feb 2016
Maximin Action Identification: A New Bandit Framework for Games Aurélien Garivier E. Kaufmann Wouter M. Koolen 16 28 0 15 Feb 2016
Improved graph-based SFA: Information preservation complements the slowness principle Alberto N. Escalante Laurenz Wiskott 16 16 0 15 Jan 2016
MERLiN: Mixture Effect Recovery in Linear Networks S. Weichwald Moritz Grosse-Wentrup Arthur Gretton CML 25 7 0 03 Dec 2015
On the Optimal Sample Complexity for Best Arm Identification Lijie Chen Jian Li 16 59 0 12 Nov 2015
Online Learning with Gaussian Payoffs and Side Observations Yifan Wu András Gyorgy Csaba Szepesvári 10 44 0 27 Oct 2015
Estimating network edge probabilities by neighborhood smoothing Yuan Zhang Elizaveta Levina Ji Zhu 22 121 0 29 Sep 2015
Online Censoring for Large-Scale Regressions with Application to Streaming Big Data Dimitris Berberidis V. Kekatos G. Giannakis 29 65 0 27 Jul 2015
Non-stochastic Best Arm Identification and Hyperparameter Optimization Kevin G. Jamieson Ameet Talwalkar 68 568 0 27 Feb 2015
Sparse Dueling Bandits Kevin G. Jamieson S. Katariya Atul Deshpande Robert D. Nowak 21 64 0 31 Jan 2015
Matrix Completion under Interval Uncertainty Jakub Mareˇcek Peter Richtárik Martin Takáč 48 19 0 11 Aug 2014
On the Optimality of Averaging in Distributed Statistical Learning Jonathan D. Rosenblatt B. Nadler FedML 39 109 0 10 Jul 2014
Semi-Stochastic Gradient Descent Methods Jakub Konecný Peter Richtárik ODL 62 237 0 05 Dec 2013
Bounded regret in stochastic multi-armed bandits Sébastien Bubeck Vianney Perchet Philippe Rigollet 71 91 0 06 Feb 2013