Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1407.4443
Cited By
On the Complexity of Best Arm Identification in Multi-Armed Bandit Models
16 July 2014
E. Kaufmann
Olivier Cappé
Aurélien Garivier
Re-assign community
ArXiv
PDF
HTML
Papers citing
"On the Complexity of Best Arm Identification in Multi-Armed Bandit Models"
50 / 147 papers shown
Title
Nearly Optimal Algorithms for Level Set Estimation
Blake Mason
Romain Camilleri
Subhojyoti Mukherjee
Kevin G. Jamieson
Robert D. Nowak
Lalit P. Jain
30
22
0
02 Nov 2021
Dealing With Misspecification In Fixed-Confidence Linear Top-m Identification
Clémence Réda
Andrea Tirinzoni
Rémy Degenne
31
9
0
02 Nov 2021
Collaborative Pure Exploration in Kernel Bandit
Yihan Du
Wei Chen
Yuko Kuroki
Longbo Huang
40
10
0
29 Oct 2021
A/B/n Testing with Control in the Presence of Subpopulations
Yoan Russac
C. Katsimerou
Dennis Bohle
Olivier Cappé
Aurélien Garivier
Wouter M. Koolen
24
25
0
29 Oct 2021
Policy Choice and Best Arm Identification: Asymptotic Analysis of Exploration Sampling
Kaito Ariu
Masahiro Kato
Junpei Komiyama
K. McAlinn
Chao Qin
65
24
0
16 Sep 2021
Near Instance Optimal Model Selection for Pure Exploration Linear Bandits
Yinglun Zhu
Julian Katz-Samuels
Robert D. Nowak
38
6
0
10 Sep 2021
Learning from an Exploring Demonstrator: Optimal Reward Estimation for Bandits
Wenshuo Guo
Kumar Krishna Agrawal
Aditya Grover
Vidya Muthukumar
A. Pananjady
16
8
0
28 Jun 2021
The Role of Contextual Information in Best Arm Identification
Masahiro Kato
Kaito Ariu
40
18
0
26 Jun 2021
Navigating to the Best Policy in Markov Decision Processes
Aymen Al Marjani
Aurélien Garivier
Alexandre Proutiere
35
21
0
05 Jun 2021
Learning to Detect an Odd Restless Markov Arm with a Trembling Hand
P. Karthik
R. Sundaresan
13
5
0
08 May 2021
Pure Exploration with Structured Preference Feedback
Shubham Gupta
Aadirupa Saha
S. Katariya
35
0
0
12 Apr 2021
Task-Optimal Exploration in Linear Dynamical Systems
Andrew Wagenmaker
Max Simchowitz
Kevin G. Jamieson
27
18
0
10 Feb 2021
Dynamics of coordinate ascent variational inference: A case study in 2D Ising models
Sean Plummer
D. Pati
A. Bhattacharya
38
18
0
13 Jul 2020
A Provably Efficient Sample Collection Strategy for Reinforcement Learning
Jean Tarbouriech
Matteo Pirotta
Michal Valko
A. Lazaric
OffRL
25
16
0
13 Jul 2020
Optimal Best-arm Identification in Linear Bandits
Yassir Jedra
Alexandre Proutiere
11
75
0
29 Jun 2020
An Empirical Process Approach to the Union Bound: Practical Algorithms for Combinatorial and Linear Bandits
Julian Katz-Samuels
Lalit P. Jain
Zohar Karnin
Kevin G. Jamieson
14
65
0
21 Jun 2020
Explicit Best Arm Identification in Linear Bandits Using No-Regret Learners
Mohammadi Zaki
Avinash Mohan
Aditya Gopalan
12
9
0
13 Jun 2020
Query complexity of heavy hitter estimation
Sahasrajit Sarmasarkar
Kota Srinivas Reddy
Nikhil Karamchandani
16
1
0
29 May 2020
Treatment recommendation with distributional targets
Anders Bredahl Kock
David Preinerstorfer
Bezirgen Veliyev
OffRL
6
7
0
19 May 2020
A Robust Experimental Evaluation of Automated Multi-Label Classification Methods
A. G. C. D. Sá
C. Pimenta
G. Pappa
A. Freitas
19
7
0
16 May 2020
Stopping criterion for active learning based on deterministic generalization bounds
Hideaki Ishibashi
H. Hino
16
29
0
15 May 2020
Detecting an Odd Restless Markov Arm with a Trembling Hand
P. Karthik
R. Sundaresan
10
6
0
13 May 2020
AMIL: Adversarial Multi Instance Learning for Human Pose Estimation
Pourya Shamsolmoali
Masoumeh Zareapoor
Huiyu Zhou
Jie Yang
GAN
20
6
0
18 Mar 2020
A unified framework for 21cm tomography sample generation and parameter inference with Progressively Growing GANs
Florian List
G. Lewis
16
14
0
19 Feb 2020
Reward-Free Exploration for Reinforcement Learning
Chi Jin
A. Krishnamurthy
Max Simchowitz
Tiancheng Yu
OffRL
112
194
0
07 Feb 2020
Improper Learning for Non-Stochastic Control
Max Simchowitz
Karan Singh
Elad Hazan
16
153
0
25 Jan 2020
Best Arm Identification for Cascading Bandits in the Fixed Confidence Setting
Zixin Zhong
Wang Chi Cheung
Vincent Y. F. Tan
37
8
0
23 Jan 2020
Sequential Mode Estimation with Oracle Queries
Dhruti Shah
Tuhinangshu Choudhury
Nikhil Karamchandani
Aditya Gopalan
21
6
0
19 Nov 2019
Learning Probably Approximately Correct Maximin Strategies in Simulation-Based Games with Infinite Strategy Spaces
A. Marchesi
F. Trovò
N. Gatti
13
18
0
18 Nov 2019
Sequential Controlled Sensing for Composite Multihypothesis Testing
Aditya Deshmukh
S. Bhashyam
V. Veeravalli
14
13
0
24 Oct 2019
Explaining and Interpreting LSTMs
L. Arras
Jose A. Arjona-Medina
Michael Widrich
G. Montavon
Michael Gillhofer
K. Müller
Sepp Hochreiter
Wojciech Samek
FAtt
AI4TS
21
79
0
25 Sep 2019
Swapped Face Detection using Deep Learning and Subjective Assessment
Xinyi Ding
Zohreh Raziei
Eric C. Larson
E. Olinick
P. Krueger
Michael Hahsler
PICV
CVBM
34
65
0
10 Sep 2019
From self-tuning regulators to reinforcement learning and back again
Nikolai Matni
Alexandre Proutiere
Anders Rantzer
Stephen Tu
27
88
0
27 Jun 2019
Sequential estimation of quantiles with applications to A/B-testing and best-arm identification
Steven R. Howard
Aaditya Ramdas
14
60
0
24 Jun 2019
Machine Learning Testing: Survey, Landscapes and Horizons
Jie M. Zhang
Mark Harman
Lei Ma
Yang Liu
VLM
AILaw
39
741
0
19 Jun 2019
Likelihood-free approximate Gibbs sampling
G. S. Rodrigues
David J. Nott
Scott A. Sisson
30
24
0
11 Jun 2019
Software and application patterns for explanation methods
Maximilian Alber
38
11
0
09 Apr 2019
Polynomial-time Algorithms for Multiple-arm Identification with Full-bandit Feedback
Yuko Kuroki
Liyuan Xu
Atsushi Miyauchi
Junya Honda
Masashi Sugiyama
25
17
0
27 Feb 2019
Multi-task Learning for Target-dependent Sentiment Classification
Divam Gupta
Kushagra Singh
Soumen Chakrabarti
Tanmoy Chakraborty
22
8
0
08 Feb 2019
Virtual Training for a Real Application: Accurate Object-Robot Relative Localization without Calibration
Vianney Loing
Renaud Marlet
Mathieu Aubry
29
23
0
07 Feb 2019
Prepaid parameter estimation without likelihoods
M. Mestdagh
S. Verdonck
Kristof Meers
Tim Loossens
F. Tuerlinckx
16
16
0
24 Dec 2018
Mixture Martingales Revisited with Applications to Sequential Tests and Confidence Intervals
E. Kaufmann
Wouter M. Koolen
21
117
0
28 Nov 2018
Understanding the Acceleration Phenomenon via High-Resolution Differential Equations
Bin Shi
S. Du
Michael I. Jordan
Weijie J. Su
17
254
0
21 Oct 2018
Explaining the Unique Nature of Individual Gait Patterns with Deep Learning
Fabian Horst
Sebastian Lapuschkin
Wojciech Samek
K. Müller
W. Schöllhorn
AI4CE
31
207
0
13 Aug 2018
PAC Battling Bandits in the Plackett-Luce Model
Aadirupa Saha
Aditya Gopalan
23
33
0
12 Aug 2018
Instance-Optimality in the Noisy Value-and Comparison-Model --- Accept, Accept, Strong Accept: Which Papers get in?
Vincent Cohen-Addad
Frederik Mallmann-Trenn
Claire Mathieu
11
10
0
21 Jun 2018
Causal Bandits with Propagating Inference
Akihiro Yabe
Daisuke Hatano
Hanna Sumita
Shinji Ito
Naonori Kakimura
Takuro Fukunaga
Ken-ichi Kawarabayashi
CML
11
32
0
06 Jun 2018
Sequential Test for the Lowest Mean: From Thompson to Murphy Sampling
E. Kaufmann
Wouter M. Koolen
Aurélien Garivier
16
25
0
04 Jun 2018
Exploration in Structured Reinforcement Learning
Jungseul Ok
Alexandre Proutiere
Damianos Tranos
25
62
0
03 Jun 2018
Probabilistic Formulations of Regression with Mixed Guidance
Aubrey Gress
Ian Davidson
3DV
8
4
0
01 Apr 2018
Previous
1
2
3
Next