v1v2 (latest)

Taming the Monster: A Fast and Simple Algorithm for Contextual Bandits

4 February 2014

Papers citing "Taming the Monster: A Fast and Simple Algorithm for Contextual Bandits"

50 / 202 papers shown

Title
Contextual Search via Intrinsic Volumes R. Leme Jon Schneider 125 43 0 09 Apr 2018
Semiparametric Contextual Bandits A. Krishnamurthy Zhiwei Steven Wu Vasilis Syrgkanis 148 45 0 12 Mar 2018
Multi-objective Contextual Bandit Problem with Similarity Information E. Turğay Doruk Öner Cem Tekin 66 37 0 11 Mar 2018
Enhancing Evolutionary Conversion Rate Optimization via Multi-armed Bandit Algorithms Xin Qiu Risto Miikkulainen 15 4 0 10 Mar 2018
Practical Contextual Bandits with Regression Oracles Dylan J. Foster Alekh Agarwal Miroslav Dudík Haipeng Luo Robert Schapire 406 127 0 03 Mar 2018
On Oracle-Efficient PAC RL with Rich Observations Christoph Dann Nan Jiang A. Krishnamurthy Alekh Agarwal John Langford Robert Schapire 60 98 0 01 Mar 2018
Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling C. Riquelme George Tucker Jasper Snoek BDL 93 366 0 26 Feb 2018
Active Learning with Logged Data Songbai Yan Kamalika Chaudhuri T. Javidi 159 27 0 25 Feb 2018
A Contextual Bandit Bake-off A. Bietti Alekh Agarwal John Langford 433 105 0 12 Feb 2018
Learning to Bid Without Knowing your Value Zhe Feng Chara Podimata Vasilis Syrgkanis 119 57 0 03 Nov 2017
Multi-objective Contextual Multi-armed Bandit with a Dominant Objective Cem Tekin E. Turğay 89 37 0 18 Aug 2017
Efficient Contextual Bandits in Non-stationary Worlds Haipeng Luo Chen-Yu Wei Alekh Agarwal John Langford 91 133 0 05 Aug 2017
Pyramid: Enhancing Selectivity in Big Data Protection with Count Featurization Mathias Lécuyer Riley Spahn Roxana Geambasu Tzu-Kuo Huang S. Sen 24 9 0 21 May 2017
Mapping Instructions and Visual Observations to Actions with Reinforcement Learning Dipendra Kumar Misra John Langford Yoav Artzi 86 247 0 28 Apr 2017
Unifying PAC and Regret: Uniform PAC Bounds for Episodic Reinforcement Learning Christoph Dann Tor Lattimore Emma Brunskill 109 311 0 22 Mar 2017
Effective Evaluation using Logged Bandit Feedback from Multiple Loggers Aman Agarwal Soumya Basu Tobias Schnabel Thorsten Joachims OffRL 148 69 0 17 Mar 2017
Active Learning for Cost-Sensitive Classification A. Krishnamurthy Alekh Agarwal Tzu-Kuo Huang Hal Daumé John Langford 278 80 0 03 Mar 2017
Provably Optimal Algorithms for Generalized Linear Contextual Bandits Lihong Li Yu Lu Dengyong Zhou 184 94 0 28 Feb 2017
$Efficient Online Bandit Multiclass Learning with $\tilde{O}(\sqrt{T})$ Regret$ Efficient Online Bandit Multiclass Learning with $\tilde{O}(\sqrt{T})$ Regret A. Beygelzimer Francesco Orabona Chicheng Zhang 48 20 0 25 Feb 2017
Policy Learning with Observational Data Susan Athey Stefan Wager CML OffRL 459 183 0 09 Feb 2017
Learning causal effects from many randomized experiments using regularized instrumental variables A. Peysakhovich Dean Eckles CML 118 23 0 04 Jan 2017
Corralling a Band of Bandit Algorithms Alekh Agarwal Haipeng Luo Behnam Neyshabur Robert Schapire 166 157 0 19 Dec 2016
Oracle-Efficient Online Learning and Auction Design Miroslav Dudík Nika Haghtalab Haipeng Luo Robert Schapire Vasilis Syrgkanis Jennifer Wortman Vaughan 105 62 0 05 Nov 2016
Multidimensional Binary Search for Contextual Decision-Making Ilan Lobel R. Leme Adrian Vladu 90 62 0 02 Nov 2016
Fair Algorithms for Infinite and Contextual Bandits Matthew Joseph Michael Kearns Jamie Morgenstern Seth Neel Aaron Roth FedML FaML 93 56 0 29 Oct 2016
Contextual Decision Processes with Low Bellman Rank are PAC-Learnable Nan Jiang A. Krishnamurthy Alekh Agarwal John Langford Robert Schapire 193 421 0 29 Oct 2016
Online Learning Schemes for Power Allocation in Energy Harvesting Communications Pranav Sakulkar Bhaskar Krishnamachari OffRL 49 19 0 08 Jul 2016
Concrete Problems in AI Safety Dario Amodei C. Olah Jacob Steinhardt Paul Christiano John Schulman Dandelion Mané 312 2,406 0 21 Jun 2016
Learning Optimal Interventions Jonas W. Mueller David N. Reshef George Du Tommi Jaakkola 40 9 0 16 Jun 2016
Making Contextual Decisions with Low Technical Debt Alekh Agarwal Sarah Bird Markus Cozowicz Luong Hoang John Langford ... Dan Melamed Gal Oshri Oswaldo Ribas S. Sen Alex Slivkins OffRL 161 34 0 13 Jun 2016
Causal Bandits: Learning Good Interventions via Causal Inference Finnian Lattimore Tor Lattimore Mark D. Reid CML 62 163 0 10 Jun 2016
Improved Regret Bounds for Oracle-Based Adversarial Contextual Bandits Vasilis Syrgkanis Haipeng Luo A. Krishnamurthy Robert Schapire 167 42 0 01 Jun 2016
Fairness in Learning: Classic and Contextual Bandits Matthew Joseph Michael Kearns Jamie Morgenstern Aaron Roth FaML 81 477 0 23 May 2016
Stochastic Contextual Bandits with Known Reward Functions Pranav Sakulkar Bhaskar Krishnamachari 51 14 0 30 Apr 2016
Latent Contextual Bandits and their Application to Personalized Recommendations for New Users Li Zhou Emma Brunskill 88 62 0 22 Apr 2016
Bayesian Exploration: Incentivizing Exploration in Bayesian Games Yishay Mansour Aleksandrs Slivkins Vasilis Syrgkanis Zhiwei Steven Wu 91 105 0 24 Feb 2016
Efficient Algorithms for Adversarial Contextual Learning Vasilis Syrgkanis A. Krishnamurthy Robert Schapire 179 80 0 08 Feb 2016
BISTRO: An Efficient Relaxation-Based Method for Contextual Bandits Alexander Rakhlin Karthik Sridharan OffRL 395 72 0 06 Feb 2016
A Survey of Online Experiment Design with the Stochastic Multi-Armed Bandit Giuseppe Burtini Jason L. Loeppky Ramon Lawrence 88 119 0 02 Oct 2015
A Survey on Contextual Multi-armed Bandits Li Zhou 98 127 0 13 Aug 2015
Linear Contextual Bandits with Knapsacks Shipra Agrawal Nikhil R. Devanur 192 144 0 24 Jul 2015
On the Prior Sensitivity of Thompson Sampling Che-Yu Liu Lihong Li 90 25 0 10 Jun 2015
An efficient algorithm for contextual bandits with knapsacks, and an extension to concave objectives Shipra Agrawal Nikhil R. Devanur Lihong Li 122 91 0 10 Jun 2015
Random Forest for the Contextual Bandit Problem - extended version Raphael Feraud Robin Allesiardo Tanguy Urvoy Fabrice Clérot 180 48 0 27 Apr 2015
Algorithms with Logarithmic or Sublinear Regret for Constrained Contextual Bandits Huasen Wu R. Srikant Xin Liu Chong Jiang 118 95 0 27 Apr 2015
The Computational Power of Optimization in Online Learning Elad Hazan Tomer Koren 211 69 0 08 Apr 2015
Contextual Dueling Bandits Miroslav Dudík Katja Hofmann Robert Schapire Aleksandrs Slivkins M. Zoghi 125 127 0 23 Feb 2015
Contextual Semibandits via Supervised Learning Oracles A. Krishnamurthy Alekh Agarwal Miroslav Dudík OffRL 189 21 0 20 Feb 2015
Learning Reductions that Really Work A. Beygelzimer Hal Daumé John Langford Paul Mineiro AI4CE 107 24 0 09 Feb 2015
Counterfactual Risk Minimization: Learning from Logged Bandit Feedback Adith Swaminathan Thorsten Joachims OffRL 158 167 0 09 Feb 2015