ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1407.4443
  4. Cited By
On the Complexity of Best Arm Identification in Multi-Armed Bandit
  Models

On the Complexity of Best Arm Identification in Multi-Armed Bandit Models

16 July 2014
E. Kaufmann
Olivier Cappé
Aurélien Garivier
ArXivPDFHTML

Papers citing "On the Complexity of Best Arm Identification in Multi-Armed Bandit Models"

50 / 147 papers shown
Title
Nearly Optimal Algorithms for Level Set Estimation
Nearly Optimal Algorithms for Level Set Estimation
Blake Mason
Romain Camilleri
Subhojyoti Mukherjee
Kevin G. Jamieson
Robert D. Nowak
Lalit P. Jain
30
22
0
02 Nov 2021
Dealing With Misspecification In Fixed-Confidence Linear Top-m
  Identification
Dealing With Misspecification In Fixed-Confidence Linear Top-m Identification
Clémence Réda
Andrea Tirinzoni
Rémy Degenne
31
9
0
02 Nov 2021
Collaborative Pure Exploration in Kernel Bandit
Collaborative Pure Exploration in Kernel Bandit
Yihan Du
Wei Chen
Yuko Kuroki
Longbo Huang
40
10
0
29 Oct 2021
A/B/n Testing with Control in the Presence of Subpopulations
A/B/n Testing with Control in the Presence of Subpopulations
Yoan Russac
C. Katsimerou
Dennis Bohle
Olivier Cappé
Aurélien Garivier
Wouter M. Koolen
24
25
0
29 Oct 2021
Policy Choice and Best Arm Identification: Asymptotic Analysis of
  Exploration Sampling
Policy Choice and Best Arm Identification: Asymptotic Analysis of Exploration Sampling
Kaito Ariu
Masahiro Kato
Junpei Komiyama
K. McAlinn
Chao Qin
65
24
0
16 Sep 2021
Near Instance Optimal Model Selection for Pure Exploration Linear
  Bandits
Near Instance Optimal Model Selection for Pure Exploration Linear Bandits
Yinglun Zhu
Julian Katz-Samuels
Robert D. Nowak
38
6
0
10 Sep 2021
Learning from an Exploring Demonstrator: Optimal Reward Estimation for
  Bandits
Learning from an Exploring Demonstrator: Optimal Reward Estimation for Bandits
Wenshuo Guo
Kumar Krishna Agrawal
Aditya Grover
Vidya Muthukumar
A. Pananjady
16
8
0
28 Jun 2021
The Role of Contextual Information in Best Arm Identification
The Role of Contextual Information in Best Arm Identification
Masahiro Kato
Kaito Ariu
40
18
0
26 Jun 2021
Navigating to the Best Policy in Markov Decision Processes
Navigating to the Best Policy in Markov Decision Processes
Aymen Al Marjani
Aurélien Garivier
Alexandre Proutiere
35
21
0
05 Jun 2021
Learning to Detect an Odd Restless Markov Arm with a Trembling Hand
Learning to Detect an Odd Restless Markov Arm with a Trembling Hand
P. Karthik
R. Sundaresan
13
5
0
08 May 2021
Pure Exploration with Structured Preference Feedback
Pure Exploration with Structured Preference Feedback
Shubham Gupta
Aadirupa Saha
S. Katariya
35
0
0
12 Apr 2021
Task-Optimal Exploration in Linear Dynamical Systems
Task-Optimal Exploration in Linear Dynamical Systems
Andrew Wagenmaker
Max Simchowitz
Kevin G. Jamieson
27
18
0
10 Feb 2021
Dynamics of coordinate ascent variational inference: A case study in 2D
  Ising models
Dynamics of coordinate ascent variational inference: A case study in 2D Ising models
Sean Plummer
D. Pati
A. Bhattacharya
38
18
0
13 Jul 2020
A Provably Efficient Sample Collection Strategy for Reinforcement
  Learning
A Provably Efficient Sample Collection Strategy for Reinforcement Learning
Jean Tarbouriech
Matteo Pirotta
Michal Valko
A. Lazaric
OffRL
25
16
0
13 Jul 2020
Optimal Best-arm Identification in Linear Bandits
Optimal Best-arm Identification in Linear Bandits
Yassir Jedra
Alexandre Proutiere
11
75
0
29 Jun 2020
An Empirical Process Approach to the Union Bound: Practical Algorithms
  for Combinatorial and Linear Bandits
An Empirical Process Approach to the Union Bound: Practical Algorithms for Combinatorial and Linear Bandits
Julian Katz-Samuels
Lalit P. Jain
Zohar Karnin
Kevin G. Jamieson
14
65
0
21 Jun 2020
Explicit Best Arm Identification in Linear Bandits Using No-Regret
  Learners
Explicit Best Arm Identification in Linear Bandits Using No-Regret Learners
Mohammadi Zaki
Avinash Mohan
Aditya Gopalan
12
9
0
13 Jun 2020
Query complexity of heavy hitter estimation
Query complexity of heavy hitter estimation
Sahasrajit Sarmasarkar
Kota Srinivas Reddy
Nikhil Karamchandani
16
1
0
29 May 2020
Treatment recommendation with distributional targets
Treatment recommendation with distributional targets
Anders Bredahl Kock
David Preinerstorfer
Bezirgen Veliyev
OffRL
6
7
0
19 May 2020
A Robust Experimental Evaluation of Automated Multi-Label Classification
  Methods
A Robust Experimental Evaluation of Automated Multi-Label Classification Methods
A. G. C. D. Sá
C. Pimenta
G. Pappa
A. Freitas
19
7
0
16 May 2020
Stopping criterion for active learning based on deterministic
  generalization bounds
Stopping criterion for active learning based on deterministic generalization bounds
Hideaki Ishibashi
H. Hino
16
29
0
15 May 2020
Detecting an Odd Restless Markov Arm with a Trembling Hand
Detecting an Odd Restless Markov Arm with a Trembling Hand
P. Karthik
R. Sundaresan
10
6
0
13 May 2020
AMIL: Adversarial Multi Instance Learning for Human Pose Estimation
AMIL: Adversarial Multi Instance Learning for Human Pose Estimation
Pourya Shamsolmoali
Masoumeh Zareapoor
Huiyu Zhou
Jie Yang
GAN
20
6
0
18 Mar 2020
A unified framework for 21cm tomography sample generation and parameter
  inference with Progressively Growing GANs
A unified framework for 21cm tomography sample generation and parameter inference with Progressively Growing GANs
Florian List
G. Lewis
16
14
0
19 Feb 2020
Reward-Free Exploration for Reinforcement Learning
Reward-Free Exploration for Reinforcement Learning
Chi Jin
A. Krishnamurthy
Max Simchowitz
Tiancheng Yu
OffRL
112
194
0
07 Feb 2020
Improper Learning for Non-Stochastic Control
Improper Learning for Non-Stochastic Control
Max Simchowitz
Karan Singh
Elad Hazan
16
153
0
25 Jan 2020
Best Arm Identification for Cascading Bandits in the Fixed Confidence
  Setting
Best Arm Identification for Cascading Bandits in the Fixed Confidence Setting
Zixin Zhong
Wang Chi Cheung
Vincent Y. F. Tan
37
8
0
23 Jan 2020
Sequential Mode Estimation with Oracle Queries
Sequential Mode Estimation with Oracle Queries
Dhruti Shah
Tuhinangshu Choudhury
Nikhil Karamchandani
Aditya Gopalan
21
6
0
19 Nov 2019
Learning Probably Approximately Correct Maximin Strategies in
  Simulation-Based Games with Infinite Strategy Spaces
Learning Probably Approximately Correct Maximin Strategies in Simulation-Based Games with Infinite Strategy Spaces
A. Marchesi
F. Trovò
N. Gatti
13
18
0
18 Nov 2019
Sequential Controlled Sensing for Composite Multihypothesis Testing
Sequential Controlled Sensing for Composite Multihypothesis Testing
Aditya Deshmukh
S. Bhashyam
V. Veeravalli
14
13
0
24 Oct 2019
Explaining and Interpreting LSTMs
Explaining and Interpreting LSTMs
L. Arras
Jose A. Arjona-Medina
Michael Widrich
G. Montavon
Michael Gillhofer
K. Müller
Sepp Hochreiter
Wojciech Samek
FAtt
AI4TS
21
79
0
25 Sep 2019
Swapped Face Detection using Deep Learning and Subjective Assessment
Swapped Face Detection using Deep Learning and Subjective Assessment
Xinyi Ding
Zohreh Raziei
Eric C. Larson
E. Olinick
P. Krueger
Michael Hahsler
PICV
CVBM
34
65
0
10 Sep 2019
From self-tuning regulators to reinforcement learning and back again
From self-tuning regulators to reinforcement learning and back again
Nikolai Matni
Alexandre Proutiere
Anders Rantzer
Stephen Tu
27
88
0
27 Jun 2019
Sequential estimation of quantiles with applications to A/B-testing and
  best-arm identification
Sequential estimation of quantiles with applications to A/B-testing and best-arm identification
Steven R. Howard
Aaditya Ramdas
14
60
0
24 Jun 2019
Machine Learning Testing: Survey, Landscapes and Horizons
Machine Learning Testing: Survey, Landscapes and Horizons
Jie M. Zhang
Mark Harman
Lei Ma
Yang Liu
VLM
AILaw
39
741
0
19 Jun 2019
Likelihood-free approximate Gibbs sampling
Likelihood-free approximate Gibbs sampling
G. S. Rodrigues
David J. Nott
Scott A. Sisson
30
24
0
11 Jun 2019
Software and application patterns for explanation methods
Software and application patterns for explanation methods
Maximilian Alber
38
11
0
09 Apr 2019
Polynomial-time Algorithms for Multiple-arm Identification with
  Full-bandit Feedback
Polynomial-time Algorithms for Multiple-arm Identification with Full-bandit Feedback
Yuko Kuroki
Liyuan Xu
Atsushi Miyauchi
Junya Honda
Masashi Sugiyama
25
17
0
27 Feb 2019
Multi-task Learning for Target-dependent Sentiment Classification
Multi-task Learning for Target-dependent Sentiment Classification
Divam Gupta
Kushagra Singh
Soumen Chakrabarti
Tanmoy Chakraborty
22
8
0
08 Feb 2019
Virtual Training for a Real Application: Accurate Object-Robot Relative
  Localization without Calibration
Virtual Training for a Real Application: Accurate Object-Robot Relative Localization without Calibration
Vianney Loing
Renaud Marlet
Mathieu Aubry
29
23
0
07 Feb 2019
Prepaid parameter estimation without likelihoods
Prepaid parameter estimation without likelihoods
M. Mestdagh
S. Verdonck
Kristof Meers
Tim Loossens
F. Tuerlinckx
16
16
0
24 Dec 2018
Mixture Martingales Revisited with Applications to Sequential Tests and
  Confidence Intervals
Mixture Martingales Revisited with Applications to Sequential Tests and Confidence Intervals
E. Kaufmann
Wouter M. Koolen
21
117
0
28 Nov 2018
Understanding the Acceleration Phenomenon via High-Resolution
  Differential Equations
Understanding the Acceleration Phenomenon via High-Resolution Differential Equations
Bin Shi
S. Du
Michael I. Jordan
Weijie J. Su
17
254
0
21 Oct 2018
Explaining the Unique Nature of Individual Gait Patterns with Deep
  Learning
Explaining the Unique Nature of Individual Gait Patterns with Deep Learning
Fabian Horst
Sebastian Lapuschkin
Wojciech Samek
K. Müller
W. Schöllhorn
AI4CE
31
207
0
13 Aug 2018
PAC Battling Bandits in the Plackett-Luce Model
PAC Battling Bandits in the Plackett-Luce Model
Aadirupa Saha
Aditya Gopalan
23
33
0
12 Aug 2018
Instance-Optimality in the Noisy Value-and Comparison-Model --- Accept,
  Accept, Strong Accept: Which Papers get in?
Instance-Optimality in the Noisy Value-and Comparison-Model --- Accept, Accept, Strong Accept: Which Papers get in?
Vincent Cohen-Addad
Frederik Mallmann-Trenn
Claire Mathieu
11
10
0
21 Jun 2018
Causal Bandits with Propagating Inference
Causal Bandits with Propagating Inference
Akihiro Yabe
Daisuke Hatano
Hanna Sumita
Shinji Ito
Naonori Kakimura
Takuro Fukunaga
Ken-ichi Kawarabayashi
CML
11
32
0
06 Jun 2018
Sequential Test for the Lowest Mean: From Thompson to Murphy Sampling
Sequential Test for the Lowest Mean: From Thompson to Murphy Sampling
E. Kaufmann
Wouter M. Koolen
Aurélien Garivier
16
25
0
04 Jun 2018
Exploration in Structured Reinforcement Learning
Exploration in Structured Reinforcement Learning
Jungseul Ok
Alexandre Proutiere
Damianos Tranos
25
62
0
03 Jun 2018
Probabilistic Formulations of Regression with Mixed Guidance
Probabilistic Formulations of Regression with Mixed Guidance
Aubrey Gress
Ian Davidson
3DV
8
4
0
01 Apr 2018
Previous
123
Next