Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1711.03591
Cited By
Efficient-UCBV: An Almost Optimal Algorithm using Variance Estimates
9 November 2017
Subhojyoti Mukherjee
K. P. Naveen
N. Sudarsanam
Balaraman Ravindran
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Efficient-UCBV: An Almost Optimal Algorithm using Variance Estimates"
4 / 4 papers shown
Title
Kullback-Leibler Maillard Sampling for Multi-armed Bandits with Bounded Rewards
Hao Qin
Kwang-Sung Jun
Chicheng Zhang
49
0
0
28 Apr 2023
SPEED: Experimental Design for Policy Evaluation in Linear Heteroscedastic Bandits
Subhojyoti Mukherjee
Qiaomin Xie
Josiah P. Hanna
R. Nowak
OffRL
58
5
0
29 Jan 2023
Safety Aware Changepoint Detection for Piecewise i.i.d. Bandits
Subhojyoti Mukherjee
26
1
0
27 May 2022
ReVar: Strengthening Policy Evaluation via Reduced Variance Sampling
Subhojyoti Mukherjee
Josiah P. Hanna
Robert D. Nowak
OffRL
34
12
0
09 Mar 2022
1