Randomized Allocation with Nonparametric Estimation for Contextual
  Multi-Armed Bandits with Delayed Rewards
v1v2v3 (latest)

Randomized Allocation with Nonparametric Estimation for Contextual Multi-Armed Bandits with Delayed Rewards

Papers citing "Randomized Allocation with Nonparametric Estimation for Contextual Multi-Armed Bandits with Delayed Rewards"