ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1407.4443
  4. Cited By
On the Complexity of Best Arm Identification in Multi-Armed Bandit
  Models

On the Complexity of Best Arm Identification in Multi-Armed Bandit Models

16 July 2014
E. Kaufmann
Olivier Cappé
Aurélien Garivier
ArXivPDFHTML

Papers citing "On the Complexity of Best Arm Identification in Multi-Armed Bandit Models"

47 / 147 papers shown
Title
Best arm identification in multi-armed bandits with delayed feedback
Best arm identification in multi-armed bandits with delayed feedback
Aditya Grover
Todor Markov
Patrick Attia
Norman Jin
Nicholas Perkins
...
M. Chen
Zi Yang
Stephen J. Harris
W. Chueh
Stefano Ermon
27
74
0
29 Mar 2018
Accelerated Gradient Boosting
Accelerated Gradient Boosting
Gérard Biau
B. Cadre
L. Rouviere
24
108
0
06 Mar 2018
Regional Multi-Armed Bandits
Regional Multi-Armed Bandits
Zhiyang Wang
Ruida Zhou
Cong Shen
21
18
0
22 Feb 2018
Adaptive Sampling for Coarse Ranking
Adaptive Sampling for Coarse Ranking
S. Katariya
Lalit P. Jain
Nandana Sengupta
James A. Evans
Robert D. Nowak
13
25
0
20 Feb 2018
Stochastic Multi-armed Bandits in Constant Space
Stochastic Multi-armed Bandits in Constant Space
David Liau
Eric Price
Zhao Song
Ger Yang
25
35
0
25 Dec 2017
Thresholding Bandit for Dose-ranging: The Impact of Monotonicity
Thresholding Bandit for Dose-ranging: The Impact of Monotonicity
Aurélien Garivier
Pierre Ménard
Laurent Rossi
Pierre Menard
22
27
0
13 Nov 2017
Building machines that adapt and compute like brains
Building machines that adapt and compute like brains
Brenden M. Lake
J. Tenenbaum
AI4CE
FedML
NAI
AILaw
254
889
0
11 Nov 2017
Minimal Exploration in Structured Stochastic Bandits
Minimal Exploration in Structured Stochastic Bandits
Richard Combes
Stefan Magureanu
Alexandre Proutiere
33
115
0
01 Nov 2017
Landmark Diffusion Maps (L-dMaps): Accelerated manifold learning
  out-of-sample extension
Landmark Diffusion Maps (L-dMaps): Accelerated manifold learning out-of-sample extension
Andrew W. Long
Andrew L. Ferguson
11
38
0
28 Jun 2017
A hybrid supervised/unsupervised machine learning approach to solar
  flare prediction
A hybrid supervised/unsupervised machine learning approach to solar flare prediction
F. Benvenuto
Michele Piana
C. Campi
A. Massone
9
63
0
21 Jun 2017
Monte-Carlo Tree Search by Best Arm Identification
Monte-Carlo Tree Search by Best Arm Identification
E. Kaufmann
Wouter M. Koolen
16
37
0
09 Jun 2017
Nearly Optimal Sampling Algorithms for Combinatorial Pure Exploration
Nearly Optimal Sampling Algorithms for Combinatorial Pure Exploration
Lijie Chen
Anupam Gupta
Jiacheng Li
Mingda Qiao
Ruosong Wang
22
47
0
04 Jun 2017
Improving the Expected Improvement Algorithm
Improving the Expected Improvement Algorithm
Chao Qin
Diego Klabjan
Daniel Russo
27
135
0
29 May 2017
Permutation Tests for Infection Graphs
Permutation Tests for Infection Graphs
Justin Khim
Po-Ling Loh
16
7
0
22 May 2017
Practical Algorithms for Best-K Identification in Multi-Armed Bandits
Practical Algorithms for Best-K Identification in Multi-Armed Bandits
Haotian Jiang
Jian Li
Mingda Qiao
11
14
0
19 May 2017
Frequentist Consistency of Variational Bayes
Frequentist Consistency of Variational Bayes
Yixin Wang
David M. Blei
BDL
37
204
0
09 May 2017
Inverse Reinforcement Learning from Summary Data
Inverse Reinforcement Learning from Summary Data
A. Kangasrääsiö
Samuel Kaski
OffRL
26
15
0
28 Mar 2017
Nearly Instance Optimal Sample Complexity Bounds for Top-k Arm Selection
Nearly Instance Optimal Sample Complexity Bounds for Top-k Arm Selection
Lijie Chen
Jian Li
Mingda Qiao
13
58
0
13 Feb 2017
Learning the distribution with largest mean: two bandit frameworks
Learning the distribution with largest mean: two bandit frameworks
E. Kaufmann
Aurélien Garivier
24
19
0
31 Jan 2017
What makes a gesture a gesture? Neural signatures involved in gesture
  recognition
What makes a gesture a gesture? Neural signatures involved in gesture recognition
M. E. Cabrera
K. Novak
D. Foti
Richard M. Voyles
J. Wachs
16
13
0
20 Jan 2017
Identifying Best Interventions through Online Importance Sampling
Identifying Best Interventions through Online Importance Sampling
Rajat Sen
Karthikeyan Shanmugam
A. Dimakis
Sanjay Shakkottai
31
72
0
10 Jan 2017
ChaLearn Looking at People: A Review of Events and Resources
ChaLearn Looking at People: A Review of Events and Resources
Sergio Escalera
Xavier Baro
Hugo Jair Escalante
Isabelle M Guyon
36
40
0
10 Jan 2017
Learning an Invariant Hilbert Space for Domain Adaptation
Learning an Invariant Hilbert Space for Domain Adaptation
Samitha Herath
Mehrtash Harandi
Fatih Porikli
16
107
0
25 Nov 2016
The Recycling Gibbs Sampler for Efficient Learning
The Recycling Gibbs Sampler for Efficient Learning
Luca Martino
Victor Elvira
Gustau Camps-Valls
36
30
0
21 Nov 2016
BET on Independence
BET on Independence
Kai Zhang
21
48
0
17 Oct 2016
Mixture model modal clustering
Mixture model modal clustering
José E. Chacón
9
31
0
15 Sep 2016
Mapping the Similarities of Spectra: Global and Locally-biased
  Approaches to SDSS Galaxy Data
Mapping the Similarities of Spectra: Global and Locally-biased Approaches to SDSS Galaxy Data
David Lawlor
T. Budavári
Michael W. Mahoney
14
12
0
13 Sep 2016
Linear Regression with an Unknown Permutation: Statistical and
  Computational Limits
Linear Regression with an Unknown Permutation: Statistical and Computational Limits
A. Pananjady
Martin J. Wainwright
T. Courtade
23
47
0
09 Aug 2016
Active Ranking from Pairwise Comparisons and when Parametric Assumptions
  Don't Help
Active Ranking from Pairwise Comparisons and when Parametric Assumptions Don't Help
Reinhard Heckel
Nihar B. Shah
Kannan Ramchandran
Martin J. Wainwright
19
10
0
28 Jun 2016
Identifying individual facial expressions by deconstructing a neural
  network
Identifying individual facial expressions by deconstructing a neural network
F. Arbabzadah
G. Montavon
K. Müller
Wojciech Samek
CVBM
FAtt
30
31
0
23 Jun 2016
Pure Exploration of Multi-armed Bandit Under Matroid Constraints
Pure Exploration of Multi-armed Bandit Under Matroid Constraints
Lijie Chen
Anupam Gupta
Jian Li
25
49
0
23 May 2016
A Decentralized Quasi-Newton Method for Dual Formulations of Consensus
  Optimization
A Decentralized Quasi-Newton Method for Dual Formulations of Consensus Optimization
Mark Eisen
Aryan Mokhtari
Alejandro Ribeiro
17
14
0
23 Mar 2016
Simple Bayesian Algorithms for Best Arm Identification
Simple Bayesian Algorithms for Best Arm Identification
Daniel Russo
25
273
0
26 Feb 2016
Explore First, Exploit Next: The True Shape of Regret in Bandit Problems
Explore First, Exploit Next: The True Shape of Regret in Bandit Problems
Aurélien Garivier
Pierre Ménard
Gilles Stoltz
19
210
0
23 Feb 2016
Maximin Action Identification: A New Bandit Framework for Games
Maximin Action Identification: A New Bandit Framework for Games
Aurélien Garivier
E. Kaufmann
Wouter M. Koolen
16
28
0
15 Feb 2016
Improved graph-based SFA: Information preservation complements the
  slowness principle
Improved graph-based SFA: Information preservation complements the slowness principle
Alberto N. Escalante
Laurenz Wiskott
16
16
0
15 Jan 2016
MERLiN: Mixture Effect Recovery in Linear Networks
MERLiN: Mixture Effect Recovery in Linear Networks
S. Weichwald
Moritz Grosse-Wentrup
Arthur Gretton
CML
25
7
0
03 Dec 2015
On the Optimal Sample Complexity for Best Arm Identification
On the Optimal Sample Complexity for Best Arm Identification
Lijie Chen
Jian Li
16
59
0
12 Nov 2015
Online Learning with Gaussian Payoffs and Side Observations
Online Learning with Gaussian Payoffs and Side Observations
Yifan Wu
András Gyorgy
Csaba Szepesvári
10
44
0
27 Oct 2015
Estimating network edge probabilities by neighborhood smoothing
Estimating network edge probabilities by neighborhood smoothing
Yuan Zhang
Elizaveta Levina
Ji Zhu
22
121
0
29 Sep 2015
Online Censoring for Large-Scale Regressions with Application to
  Streaming Big Data
Online Censoring for Large-Scale Regressions with Application to Streaming Big Data
Dimitris Berberidis
V. Kekatos
G. Giannakis
29
65
0
27 Jul 2015
Non-stochastic Best Arm Identification and Hyperparameter Optimization
Non-stochastic Best Arm Identification and Hyperparameter Optimization
Kevin G. Jamieson
Ameet Talwalkar
68
568
0
27 Feb 2015
Sparse Dueling Bandits
Sparse Dueling Bandits
Kevin G. Jamieson
S. Katariya
Atul Deshpande
Robert D. Nowak
21
64
0
31 Jan 2015
Matrix Completion under Interval Uncertainty
Matrix Completion under Interval Uncertainty
Jakub Mareˇcek
Peter Richtárik
Martin Takáč
48
19
0
11 Aug 2014
On the Optimality of Averaging in Distributed Statistical Learning
On the Optimality of Averaging in Distributed Statistical Learning
Jonathan D. Rosenblatt
B. Nadler
FedML
39
109
0
10 Jul 2014
Semi-Stochastic Gradient Descent Methods
Semi-Stochastic Gradient Descent Methods
Jakub Konecný
Peter Richtárik
ODL
62
237
0
05 Dec 2013
Bounded regret in stochastic multi-armed bandits
Bounded regret in stochastic multi-armed bandits
Sébastien Bubeck
Vianney Perchet
Philippe Rigollet
71
91
0
06 Feb 2013
Previous
123