Randomized Allocation with Nonparametric Estimation for Contextual
Multi-Armed Bandits with Delayed Rewards

v1v2v3 (latest)

Randomized Allocation with Nonparametric Estimation for Contextual Multi-Armed Bandits with Delayed Rewards

3 February 2019

ArXiv (abs)PDF HTML

Papers citing "Randomized Allocation with Nonparametric Estimation for Contextual Multi-Armed Bandits with Delayed Rewards"

8 / 8 papers shown

Title
Contextual Linear Bandits with Delay as Payoff Mengxiao Zhang Yingfei Wang Haipeng Luo 166 2 0 18 Feb 2025
Stochastic Bandit Models for Delayed Conversions Claire Vernade Olivier Cappé Vianney Perchet 38 95 0 28 Jun 2017
Delay and Cooperation in Nonstochastic Bandits Nicolò Cesa-Bianchi Claudio Gentile Yishay Mansour Alberto Minora 49 145 0 15 Feb 2016
Online Learning under Delayed Feedback Pooria Joulani András Gyorgy Csaba Szepesvári 90 279 0 04 Jun 2013
The multi-armed bandit problem with covariates Vianney Perchet Philippe Rigollet 441 174 0 27 Oct 2011
Efficient Optimal Learning for Contextual Bandits Miroslav Dudík Daniel J. Hsu Satyen Kale Nikos Karampatziakis John Langford L. Reyzin Tong Zhang 192 302 0 13 Jun 2011
A Contextual-Bandit Approach to Personalized News Article Recommendation Lihong Li Wei Chu John Langford Robert Schapire 471 2,954 0 28 Feb 2010
Contextual Bandits with Similarity Information Aleksandrs Slivkins 461 450 0 23 Jul 2009