v1v2v3v4v5 (latest)

A Contextual Bandit Bake-off

12 February 2018

Papers citing "A Contextual Bandit Bake-off"

22 / 22 papers shown

Title
Greedy Algorithm for Structured Bandits: A Sharp Characterization of Asymptotic Success / Failure Aleksandrs Slivkins Yunzong Xu Shiliang Zuo 519 1 0 06 Mar 2025
Private Selection with Heterogeneous Sensitivities Daniela Antonova Allegra Laro Audra McMillan Lorenz Wolf 149 0 0 10 Jan 2025
On Bits and Bandits: Quantifying the Regret-Information Trade-off Itai Shufaro Nadav Merlis Nir Weinberger Shie Mannor 164 0 0 26 May 2024
Open Bandit Dataset and Pipeline: Towards Realistic and Reproducible Off-Policy Evaluation Yuta Saito Shunsuke Aihara Megumi Matsutani Yusuke Narita OffRL 171 75 0 17 Aug 2020
Practical Contextual Bandits with Regression Oracles Dylan J. Foster Alekh Agarwal Miroslav Dudík Haipeng Luo Robert Schapire 393 127 0 03 Mar 2018
Make the Minority Great Again: First-Order Regret Bound for Contextual Bandits Zeyuan Allen-Zhu Sébastien Bubeck Yuanzhi Li LRM 152 30 0 09 Feb 2018
Mostly Exploration-Free Algorithms for Contextual Bandits Hamsa Bastani Mohsen Bayati Khashayar Khosravi 373 158 0 28 Apr 2017
Active Learning for Cost-Sensitive Classification A. Krishnamurthy Alekh Agarwal Tzu-Kuo Huang Hal Daumé John Langford 217 79 0 03 Mar 2017
Making Contextual Decisions with Low Technical Debt Alekh Agarwal Sarah Bird Markus Cozowicz Luong Hoang John Langford ... Dan Melamed Gal Oshri Oswaldo Ribas S. Sen Alex Slivkins OffRL 134 34 0 13 Jun 2016
Recommendations as Treatments: Debiasing Learning and Evaluation Tobias Schnabel Adith Swaminathan Ashudeep Singh Navin Chandak Thorsten Joachims CML 162 686 0 17 Feb 2016
Bootstrapped Thompson Sampling and Deep Exploration Ian Osband Benjamin Van Roy 152 105 0 01 Jul 2015
Efficient and Parsimonious Agnostic Active Learning Tzu-Kuo Huang Alekh Agarwal Daniel J. Hsu John Langford Robert Schapire 132 47 0 29 Jun 2015
Thompson sampling with the online bootstrap Dean Eckles M. Kaptein 114 58 0 15 Oct 2014
Normalized Online Learning Stéphane Ross Paul Mineiro John Langford 146 69 0 09 Aug 2014
Taming the Monster: A Fast and Simple Algorithm for Contextual Bandits Alekh Agarwal Daniel J. Hsu Satyen Kale John Langford Lihong Li Robert Schapire OffRL 396 510 0 04 Feb 2014
Efficient Online Bootstrapping for Large Scale Learning Zhen Qin V. Petříček Nikos Karampatziakis Lihong Li John Langford OnRL 121 8 0 18 Dec 2013
Thompson Sampling for Contextual Bandits with Linear Payoffs Shipra Agrawal Navin Goyal 195 1,004 0 15 Sep 2012
Contextual Bandit Learning with Predictable Rewards Alekh Agarwal Miroslav Dudík Satyen Kale John Langford Robert Schapire OffRL 404 86 0 07 Feb 2012
Efficient Optimal Learning for Contextual Bandits Miroslav Dudík Daniel J. Hsu Satyen Kale Nikos Karampatziakis John Langford L. Reyzin Tong Zhang 192 302 0 13 Jun 2011
Doubly Robust Policy Evaluation and Learning Miroslav Dudík John Langford Lihong Li OffRL 343 697 0 23 Mar 2011
Online Importance Weight Aware Updates Nikos Karampatziakis John Langford 174 79 0 06 Nov 2010
A Contextual-Bandit Approach to Personalized News Article Recommendation Lihong Li Wei Chu John Langford Robert Schapire 471 2,954 0 28 Feb 2010