Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1902.00819
Cited By
Randomized Allocation with Nonparametric Estimation for Contextual Multi-Armed Bandits with Delayed Rewards
3 February 2019
Sakshi Arya
Yuhong Yang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Randomized Allocation with Nonparametric Estimation for Contextual Multi-Armed Bandits with Delayed Rewards"
9 / 9 papers shown
Title
Contextual Linear Bandits with Delay as Payoff
Mengxiao Zhang
Yingfei Wang
Haipeng Luo
48
0
0
18 Feb 2025
Batched Nonparametric Contextual Bandits
Rong Jiang
Cong Ma
OffRL
39
1
0
27 Feb 2024
Nearest Neighbour with Bandit Feedback
Stephen Pasteris
Chris Hicks
V. Mavroudis
16
3
0
23 Jun 2023
Multi-Armed Bandits with Generalized Temporally-Partitioned Rewards
Ronald C. van den Broek
Rik Litjens
Tobias Sagis
Luc Siecker
Nina Verbeeke
Pratik Gajane
29
0
0
01 Mar 2023
Incorporating Multi-armed Bandit with Local Search for MaxSAT
Jiongzhi Zheng
Kun He
Jianrong Zhou
Yan Jin
ChuMin Li
F. Manyà
19
1
0
29 Nov 2022
BandMaxSAT: A Local Search MaxSAT Solver with Multi-armed Bandit
Jiongzhi Zheng
Kun He
Jianrong Zhou
Yan Jin
ChuMin Li
F. Manyà
23
13
0
14 Jan 2022
Metadata-based Multi-Task Bandits with Bayesian Hierarchical Models
Runzhe Wan
Linjuan Ge
Rui Song
38
28
0
13 Aug 2021
Stochastic bandits with arm-dependent delays
Anne Gael Manegueu
Claire Vernade
Alexandra Carpentier
Michal Valko
32
44
0
18 Jun 2020
Nonstochastic Multiarmed Bandits with Unrestricted Delays
Tobias Sommer Thune
Nicolò Cesa-Bianchi
Yevgeny Seldin
28
52
0
03 Jun 2019
1