Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1902.00819
Cited By
v1
v2
v3 (latest)
Randomized Allocation with Nonparametric Estimation for Contextual Multi-Armed Bandits with Delayed Rewards
3 February 2019
Sakshi Arya
Yuhong Yang
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Randomized Allocation with Nonparametric Estimation for Contextual Multi-Armed Bandits with Delayed Rewards"
8 / 8 papers shown
Title
Contextual Linear Bandits with Delay as Payoff
Mengxiao Zhang
Yingfei Wang
Haipeng Luo
166
2
0
18 Feb 2025
Stochastic Bandit Models for Delayed Conversions
Claire Vernade
Olivier Cappé
Vianney Perchet
38
95
0
28 Jun 2017
Delay and Cooperation in Nonstochastic Bandits
Nicolò Cesa-Bianchi
Claudio Gentile
Yishay Mansour
Alberto Minora
49
145
0
15 Feb 2016
Online Learning under Delayed Feedback
Pooria Joulani
András Gyorgy
Csaba Szepesvári
90
279
0
04 Jun 2013
The multi-armed bandit problem with covariates
Vianney Perchet
Philippe Rigollet
441
174
0
27 Oct 2011
Efficient Optimal Learning for Contextual Bandits
Miroslav Dudík
Daniel J. Hsu
Satyen Kale
Nikos Karampatziakis
John Langford
L. Reyzin
Tong Zhang
192
302
0
13 Jun 2011
A Contextual-Bandit Approach to Personalized News Article Recommendation
Lihong Li
Wei Chu
John Langford
Robert Schapire
471
2,954
0
28 Feb 2010
Contextual Bandits with Similarity Information
Aleksandrs Slivkins
461
450
0
23 Jul 2009
1