On the Complexity of Best Arm Identification in Multi-Armed Bandit Models

16 July 2014

Papers citing "On the Complexity of Best Arm Identification in Multi-Armed Bandit Models"

50 / 147 papers shown

Title
Nearly Optimal Algorithms for Level Set Estimation Blake Mason Romain Camilleri Subhojyoti Mukherjee Kevin G. Jamieson Robert D. Nowak Lalit P. Jain 30 22 0 02 Nov 2021
Dealing With Misspecification In Fixed-Confidence Linear Top-m Identification Clémence Réda Andrea Tirinzoni Rémy Degenne 31 9 0 02 Nov 2021
Collaborative Pure Exploration in Kernel Bandit Yihan Du Wei Chen Yuko Kuroki Longbo Huang 40 10 0 29 Oct 2021
A/B/n Testing with Control in the Presence of Subpopulations Yoan Russac C. Katsimerou Dennis Bohle Olivier Cappé Aurélien Garivier Wouter M. Koolen 24 25 0 29 Oct 2021
Policy Choice and Best Arm Identification: Asymptotic Analysis of Exploration Sampling Kaito Ariu Masahiro Kato Junpei Komiyama K. McAlinn Chao Qin 65 24 0 16 Sep 2021
Near Instance Optimal Model Selection for Pure Exploration Linear Bandits Yinglun Zhu Julian Katz-Samuels Robert D. Nowak 38 6 0 10 Sep 2021
Learning from an Exploring Demonstrator: Optimal Reward Estimation for Bandits Wenshuo Guo Kumar Krishna Agrawal Aditya Grover Vidya Muthukumar A. Pananjady 16 8 0 28 Jun 2021
The Role of Contextual Information in Best Arm Identification Masahiro Kato Kaito Ariu 40 18 0 26 Jun 2021
Navigating to the Best Policy in Markov Decision Processes Aymen Al Marjani Aurélien Garivier Alexandre Proutiere 35 21 0 05 Jun 2021
Learning to Detect an Odd Restless Markov Arm with a Trembling Hand P. Karthik R. Sundaresan 13 5 0 08 May 2021
Pure Exploration with Structured Preference Feedback Shubham Gupta Aadirupa Saha S. Katariya 35 0 0 12 Apr 2021
Task-Optimal Exploration in Linear Dynamical Systems Andrew Wagenmaker Max Simchowitz Kevin G. Jamieson 27 18 0 10 Feb 2021
Dynamics of coordinate ascent variational inference: A case study in 2D Ising models Sean Plummer D. Pati A. Bhattacharya 38 18 0 13 Jul 2020
A Provably Efficient Sample Collection Strategy for Reinforcement Learning Jean Tarbouriech Matteo Pirotta Michal Valko A. Lazaric OffRL 25 16 0 13 Jul 2020
Optimal Best-arm Identification in Linear Bandits Yassir Jedra Alexandre Proutiere 11 75 0 29 Jun 2020
An Empirical Process Approach to the Union Bound: Practical Algorithms for Combinatorial and Linear Bandits Julian Katz-Samuels Lalit P. Jain Zohar Karnin Kevin G. Jamieson 14 65 0 21 Jun 2020
Explicit Best Arm Identification in Linear Bandits Using No-Regret Learners Mohammadi Zaki Avinash Mohan Aditya Gopalan 12 9 0 13 Jun 2020
Query complexity of heavy hitter estimation Sahasrajit Sarmasarkar Kota Srinivas Reddy Nikhil Karamchandani 16 1 0 29 May 2020
Treatment recommendation with distributional targets Anders Bredahl Kock David Preinerstorfer Bezirgen Veliyev OffRL 6 7 0 19 May 2020
A Robust Experimental Evaluation of Automated Multi-Label Classification Methods A. G. C. D. Sá C. Pimenta G. Pappa A. Freitas 19 7 0 16 May 2020
Stopping criterion for active learning based on deterministic generalization bounds Hideaki Ishibashi H. Hino 16 29 0 15 May 2020
Detecting an Odd Restless Markov Arm with a Trembling Hand P. Karthik R. Sundaresan 10 6 0 13 May 2020
AMIL: Adversarial Multi Instance Learning for Human Pose Estimation Pourya Shamsolmoali Masoumeh Zareapoor Huiyu Zhou Jie Yang GAN 20 6 0 18 Mar 2020
A unified framework for 21cm tomography sample generation and parameter inference with Progressively Growing GANs Florian List G. Lewis 16 14 0 19 Feb 2020
Reward-Free Exploration for Reinforcement Learning Chi Jin A. Krishnamurthy Max Simchowitz Tiancheng Yu OffRL 112 194 0 07 Feb 2020
Improper Learning for Non-Stochastic Control Max Simchowitz Karan Singh Elad Hazan 16 153 0 25 Jan 2020
Best Arm Identification for Cascading Bandits in the Fixed Confidence Setting Zixin Zhong Wang Chi Cheung Vincent Y. F. Tan 37 8 0 23 Jan 2020
Sequential Mode Estimation with Oracle Queries Dhruti Shah Tuhinangshu Choudhury Nikhil Karamchandani Aditya Gopalan 21 6 0 19 Nov 2019
Learning Probably Approximately Correct Maximin Strategies in Simulation-Based Games with Infinite Strategy Spaces A. Marchesi F. Trovò N. Gatti 13 18 0 18 Nov 2019
Sequential Controlled Sensing for Composite Multihypothesis Testing Aditya Deshmukh S. Bhashyam V. Veeravalli 14 13 0 24 Oct 2019
Explaining and Interpreting LSTMs L. Arras Jose A. Arjona-Medina Michael Widrich G. Montavon Michael Gillhofer K. Müller Sepp Hochreiter Wojciech Samek FAtt AI4TS 21 79 0 25 Sep 2019
Swapped Face Detection using Deep Learning and Subjective Assessment Xinyi Ding Zohreh Raziei Eric C. Larson E. Olinick P. Krueger Michael Hahsler PICV CVBM 34 65 0 10 Sep 2019
From self-tuning regulators to reinforcement learning and back again Nikolai Matni Alexandre Proutiere Anders Rantzer Stephen Tu 27 88 0 27 Jun 2019
Sequential estimation of quantiles with applications to A/B-testing and best-arm identification Steven R. Howard Aaditya Ramdas 14 60 0 24 Jun 2019
Machine Learning Testing: Survey, Landscapes and Horizons Jie M. Zhang Mark Harman Lei Ma Yang Liu VLM AILaw 39 741 0 19 Jun 2019
Likelihood-free approximate Gibbs sampling G. S. Rodrigues David J. Nott Scott A. Sisson 30 24 0 11 Jun 2019
Software and application patterns for explanation methods Maximilian Alber 38 11 0 09 Apr 2019
Polynomial-time Algorithms for Multiple-arm Identification with Full-bandit Feedback Yuko Kuroki Liyuan Xu Atsushi Miyauchi Junya Honda Masashi Sugiyama 25 17 0 27 Feb 2019
Multi-task Learning for Target-dependent Sentiment Classification Divam Gupta Kushagra Singh Soumen Chakrabarti Tanmoy Chakraborty 22 8 0 08 Feb 2019
Virtual Training for a Real Application: Accurate Object-Robot Relative Localization without Calibration Vianney Loing Renaud Marlet Mathieu Aubry 29 23 0 07 Feb 2019
Prepaid parameter estimation without likelihoods M. Mestdagh S. Verdonck Kristof Meers Tim Loossens F. Tuerlinckx 16 16 0 24 Dec 2018
Mixture Martingales Revisited with Applications to Sequential Tests and Confidence Intervals E. Kaufmann Wouter M. Koolen 21 117 0 28 Nov 2018
Understanding the Acceleration Phenomenon via High-Resolution Differential Equations Bin Shi S. Du Michael I. Jordan Weijie J. Su 17 254 0 21 Oct 2018
Explaining the Unique Nature of Individual Gait Patterns with Deep Learning Fabian Horst Sebastian Lapuschkin Wojciech Samek K. Müller W. Schöllhorn AI4CE 31 207 0 13 Aug 2018
PAC Battling Bandits in the Plackett-Luce Model Aadirupa Saha Aditya Gopalan 23 33 0 12 Aug 2018
Instance-Optimality in the Noisy Value-and Comparison-Model --- Accept, Accept, Strong Accept: Which Papers get in? Vincent Cohen-Addad Frederik Mallmann-Trenn Claire Mathieu 11 10 0 21 Jun 2018
Causal Bandits with Propagating Inference Akihiro Yabe Daisuke Hatano Hanna Sumita Shinji Ito Naonori Kakimura Takuro Fukunaga Ken-ichi Kawarabayashi CML 11 32 0 06 Jun 2018
Sequential Test for the Lowest Mean: From Thompson to Murphy Sampling E. Kaufmann Wouter M. Koolen Aurélien Garivier 16 25 0 04 Jun 2018
Exploration in Structured Reinforcement Learning Jungseul Ok Alexandre Proutiere Damianos Tranos 25 62 0 03 Jun 2018
Probabilistic Formulations of Regression with Mixed Guidance Aubrey Gress Ian Davidson 3DV 8 4 0 01 Apr 2018