Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1402.0555
Cited By
v1
v2 (latest)
Taming the Monster: A Fast and Simple Algorithm for Contextual Bandits
4 February 2014
Alekh Agarwal
Daniel J. Hsu
Satyen Kale
John Langford
Lihong Li
Robert Schapire
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Taming the Monster: A Fast and Simple Algorithm for Contextual Bandits"
50 / 202 papers shown
Title
Contextual Search via Intrinsic Volumes
R. Leme
Jon Schneider
125
43
0
09 Apr 2018
Semiparametric Contextual Bandits
A. Krishnamurthy
Zhiwei Steven Wu
Vasilis Syrgkanis
148
45
0
12 Mar 2018
Multi-objective Contextual Bandit Problem with Similarity Information
E. Turğay
Doruk Öner
Cem Tekin
66
37
0
11 Mar 2018
Enhancing Evolutionary Conversion Rate Optimization via Multi-armed Bandit Algorithms
Xin Qiu
Risto Miikkulainen
15
4
0
10 Mar 2018
Practical Contextual Bandits with Regression Oracles
Dylan J. Foster
Alekh Agarwal
Miroslav Dudík
Haipeng Luo
Robert Schapire
406
127
0
03 Mar 2018
On Oracle-Efficient PAC RL with Rich Observations
Christoph Dann
Nan Jiang
A. Krishnamurthy
Alekh Agarwal
John Langford
Robert Schapire
60
98
0
01 Mar 2018
Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling
C. Riquelme
George Tucker
Jasper Snoek
BDL
93
366
0
26 Feb 2018
Active Learning with Logged Data
Songbai Yan
Kamalika Chaudhuri
T. Javidi
159
27
0
25 Feb 2018
A Contextual Bandit Bake-off
A. Bietti
Alekh Agarwal
John Langford
433
105
0
12 Feb 2018
Learning to Bid Without Knowing your Value
Zhe Feng
Chara Podimata
Vasilis Syrgkanis
119
57
0
03 Nov 2017
Multi-objective Contextual Multi-armed Bandit with a Dominant Objective
Cem Tekin
E. Turğay
89
37
0
18 Aug 2017
Efficient Contextual Bandits in Non-stationary Worlds
Haipeng Luo
Chen-Yu Wei
Alekh Agarwal
John Langford
91
133
0
05 Aug 2017
Pyramid: Enhancing Selectivity in Big Data Protection with Count Featurization
Mathias Lécuyer
Riley Spahn
Roxana Geambasu
Tzu-Kuo Huang
S. Sen
24
9
0
21 May 2017
Mapping Instructions and Visual Observations to Actions with Reinforcement Learning
Dipendra Kumar Misra
John Langford
Yoav Artzi
86
247
0
28 Apr 2017
Unifying PAC and Regret: Uniform PAC Bounds for Episodic Reinforcement Learning
Christoph Dann
Tor Lattimore
Emma Brunskill
109
311
0
22 Mar 2017
Effective Evaluation using Logged Bandit Feedback from Multiple Loggers
Aman Agarwal
Soumya Basu
Tobias Schnabel
Thorsten Joachims
OffRL
148
69
0
17 Mar 2017
Active Learning for Cost-Sensitive Classification
A. Krishnamurthy
Alekh Agarwal
Tzu-Kuo Huang
Hal Daumé
John Langford
278
80
0
03 Mar 2017
Provably Optimal Algorithms for Generalized Linear Contextual Bandits
Lihong Li
Yu Lu
Dengyong Zhou
184
94
0
28 Feb 2017
Efficient Online Bandit Multiclass Learning with
O
~
(
T
)
\tilde{O}(\sqrt{T})
O
~
(
T
)
Regret
A. Beygelzimer
Francesco Orabona
Chicheng Zhang
48
20
0
25 Feb 2017
Policy Learning with Observational Data
Susan Athey
Stefan Wager
CML
OffRL
459
183
0
09 Feb 2017
Learning causal effects from many randomized experiments using regularized instrumental variables
A. Peysakhovich
Dean Eckles
CML
118
23
0
04 Jan 2017
Corralling a Band of Bandit Algorithms
Alekh Agarwal
Haipeng Luo
Behnam Neyshabur
Robert Schapire
166
157
0
19 Dec 2016
Oracle-Efficient Online Learning and Auction Design
Miroslav Dudík
Nika Haghtalab
Haipeng Luo
Robert Schapire
Vasilis Syrgkanis
Jennifer Wortman Vaughan
105
62
0
05 Nov 2016
Multidimensional Binary Search for Contextual Decision-Making
Ilan Lobel
R. Leme
Adrian Vladu
90
62
0
02 Nov 2016
Fair Algorithms for Infinite and Contextual Bandits
Matthew Joseph
Michael Kearns
Jamie Morgenstern
Seth Neel
Aaron Roth
FedML
FaML
93
56
0
29 Oct 2016
Contextual Decision Processes with Low Bellman Rank are PAC-Learnable
Nan Jiang
A. Krishnamurthy
Alekh Agarwal
John Langford
Robert Schapire
193
421
0
29 Oct 2016
Online Learning Schemes for Power Allocation in Energy Harvesting Communications
Pranav Sakulkar
Bhaskar Krishnamachari
OffRL
49
19
0
08 Jul 2016
Concrete Problems in AI Safety
Dario Amodei
C. Olah
Jacob Steinhardt
Paul Christiano
John Schulman
Dandelion Mané
312
2,406
0
21 Jun 2016
Learning Optimal Interventions
Jonas W. Mueller
David N. Reshef
George Du
Tommi Jaakkola
40
9
0
16 Jun 2016
Making Contextual Decisions with Low Technical Debt
Alekh Agarwal
Sarah Bird
Markus Cozowicz
Luong Hoang
John Langford
...
Dan Melamed
Gal Oshri
Oswaldo Ribas
S. Sen
Alex Slivkins
OffRL
161
34
0
13 Jun 2016
Causal Bandits: Learning Good Interventions via Causal Inference
Finnian Lattimore
Tor Lattimore
Mark D. Reid
CML
62
163
0
10 Jun 2016
Improved Regret Bounds for Oracle-Based Adversarial Contextual Bandits
Vasilis Syrgkanis
Haipeng Luo
A. Krishnamurthy
Robert Schapire
167
42
0
01 Jun 2016
Fairness in Learning: Classic and Contextual Bandits
Matthew Joseph
Michael Kearns
Jamie Morgenstern
Aaron Roth
FaML
81
477
0
23 May 2016
Stochastic Contextual Bandits with Known Reward Functions
Pranav Sakulkar
Bhaskar Krishnamachari
48
14
0
30 Apr 2016
Latent Contextual Bandits and their Application to Personalized Recommendations for New Users
Li Zhou
Emma Brunskill
88
62
0
22 Apr 2016
Bayesian Exploration: Incentivizing Exploration in Bayesian Games
Yishay Mansour
Aleksandrs Slivkins
Vasilis Syrgkanis
Zhiwei Steven Wu
91
105
0
24 Feb 2016
Efficient Algorithms for Adversarial Contextual Learning
Vasilis Syrgkanis
A. Krishnamurthy
Robert Schapire
179
80
0
08 Feb 2016
BISTRO: An Efficient Relaxation-Based Method for Contextual Bandits
Alexander Rakhlin
Karthik Sridharan
OffRL
395
72
0
06 Feb 2016
A Survey of Online Experiment Design with the Stochastic Multi-Armed Bandit
Giuseppe Burtini
Jason L. Loeppky
Ramon Lawrence
88
119
0
02 Oct 2015
A Survey on Contextual Multi-armed Bandits
Li Zhou
98
127
0
13 Aug 2015
Linear Contextual Bandits with Knapsacks
Shipra Agrawal
Nikhil R. Devanur
192
144
0
24 Jul 2015
On the Prior Sensitivity of Thompson Sampling
Che-Yu Liu
Lihong Li
90
25
0
10 Jun 2015
An efficient algorithm for contextual bandits with knapsacks, and an extension to concave objectives
Shipra Agrawal
Nikhil R. Devanur
Lihong Li
122
91
0
10 Jun 2015
Random Forest for the Contextual Bandit Problem - extended version
Raphael Feraud
Robin Allesiardo
Tanguy Urvoy
Fabrice Clérot
180
48
0
27 Apr 2015
Algorithms with Logarithmic or Sublinear Regret for Constrained Contextual Bandits
Huasen Wu
R. Srikant
Xin Liu
Chong Jiang
118
95
0
27 Apr 2015
The Computational Power of Optimization in Online Learning
Elad Hazan
Tomer Koren
211
69
0
08 Apr 2015
Contextual Dueling Bandits
Miroslav Dudík
Katja Hofmann
Robert Schapire
Aleksandrs Slivkins
M. Zoghi
125
127
0
23 Feb 2015
Contextual Semibandits via Supervised Learning Oracles
A. Krishnamurthy
Alekh Agarwal
Miroslav Dudík
OffRL
189
21
0
20 Feb 2015
Learning Reductions that Really Work
A. Beygelzimer
Hal Daumé
John Langford
Paul Mineiro
AI4CE
107
24
0
09 Feb 2015
Counterfactual Risk Minimization: Learning from Logged Bandit Feedback
Adith Swaminathan
Thorsten Joachims
OffRL
158
167
0
09 Feb 2015
Previous
1
2
3
4
5
Next