Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.04064
Cited By
v1
v2
v3
v4
v5 (latest)
A Contextual Bandit Bake-off
12 February 2018
A. Bietti
Alekh Agarwal
John Langford
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"A Contextual Bandit Bake-off"
22 / 22 papers shown
Title
Greedy Algorithm for Structured Bandits: A Sharp Characterization of Asymptotic Success / Failure
Aleksandrs Slivkins
Yunzong Xu
Shiliang Zuo
519
1
0
06 Mar 2025
Private Selection with Heterogeneous Sensitivities
Daniela Antonova
Allegra Laro
Audra McMillan
Lorenz Wolf
149
0
0
10 Jan 2025
On Bits and Bandits: Quantifying the Regret-Information Trade-off
Itai Shufaro
Nadav Merlis
Nir Weinberger
Shie Mannor
164
0
0
26 May 2024
Open Bandit Dataset and Pipeline: Towards Realistic and Reproducible Off-Policy Evaluation
Yuta Saito
Shunsuke Aihara
Megumi Matsutani
Yusuke Narita
OffRL
171
75
0
17 Aug 2020
Practical Contextual Bandits with Regression Oracles
Dylan J. Foster
Alekh Agarwal
Miroslav Dudík
Haipeng Luo
Robert Schapire
393
127
0
03 Mar 2018
Make the Minority Great Again: First-Order Regret Bound for Contextual Bandits
Zeyuan Allen-Zhu
Sébastien Bubeck
Yuanzhi Li
LRM
152
30
0
09 Feb 2018
Mostly Exploration-Free Algorithms for Contextual Bandits
Hamsa Bastani
Mohsen Bayati
Khashayar Khosravi
373
158
0
28 Apr 2017
Active Learning for Cost-Sensitive Classification
A. Krishnamurthy
Alekh Agarwal
Tzu-Kuo Huang
Hal Daumé
John Langford
217
79
0
03 Mar 2017
Making Contextual Decisions with Low Technical Debt
Alekh Agarwal
Sarah Bird
Markus Cozowicz
Luong Hoang
John Langford
...
Dan Melamed
Gal Oshri
Oswaldo Ribas
S. Sen
Alex Slivkins
OffRL
134
34
0
13 Jun 2016
Recommendations as Treatments: Debiasing Learning and Evaluation
Tobias Schnabel
Adith Swaminathan
Ashudeep Singh
Navin Chandak
Thorsten Joachims
CML
162
686
0
17 Feb 2016
Bootstrapped Thompson Sampling and Deep Exploration
Ian Osband
Benjamin Van Roy
152
105
0
01 Jul 2015
Efficient and Parsimonious Agnostic Active Learning
Tzu-Kuo Huang
Alekh Agarwal
Daniel J. Hsu
John Langford
Robert Schapire
132
47
0
29 Jun 2015
Thompson sampling with the online bootstrap
Dean Eckles
M. Kaptein
114
58
0
15 Oct 2014
Normalized Online Learning
Stéphane Ross
Paul Mineiro
John Langford
146
69
0
09 Aug 2014
Taming the Monster: A Fast and Simple Algorithm for Contextual Bandits
Alekh Agarwal
Daniel J. Hsu
Satyen Kale
John Langford
Lihong Li
Robert Schapire
OffRL
396
510
0
04 Feb 2014
Efficient Online Bootstrapping for Large Scale Learning
Zhen Qin
V. Petříček
Nikos Karampatziakis
Lihong Li
John Langford
OnRL
121
8
0
18 Dec 2013
Thompson Sampling for Contextual Bandits with Linear Payoffs
Shipra Agrawal
Navin Goyal
195
1,004
0
15 Sep 2012
Contextual Bandit Learning with Predictable Rewards
Alekh Agarwal
Miroslav Dudík
Satyen Kale
John Langford
Robert Schapire
OffRL
404
86
0
07 Feb 2012
Efficient Optimal Learning for Contextual Bandits
Miroslav Dudík
Daniel J. Hsu
Satyen Kale
Nikos Karampatziakis
John Langford
L. Reyzin
Tong Zhang
192
302
0
13 Jun 2011
Doubly Robust Policy Evaluation and Learning
Miroslav Dudík
John Langford
Lihong Li
OffRL
343
697
0
23 Mar 2011
Online Importance Weight Aware Updates
Nikos Karampatziakis
John Langford
174
79
0
06 Nov 2010
A Contextual-Bandit Approach to Personalized News Article Recommendation
Lihong Li
Wei Chu
John Langford
Robert Schapire
471
2,954
0
28 Feb 2010
1