Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.05247
Cited By
Bootstrapping Upper Confidence Bound
12 June 2019
Botao Hao
Yasin Abbasi-Yadkori
Zheng Wen
Guang Cheng
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Bootstrapping Upper Confidence Bound"
14 / 14 papers shown
Title
Not All Documents Are What You Need for Extracting Instruction Tuning Data
Chi Zhang
Huaping Zhong
Hongtao Li
Chengliang Chai
Jiawei Hong
...
Jiantao Qiu
Ye Yuan
Guoren Wang
Zeang Sheng
Lei Cao
SyDa
29
0
0
18 May 2025
Gaussian Approximation and Multiplier Bootstrap for Polyak-Ruppert Averaged Linear Stochastic Approximation with Applications to TD Learning
S. Samsonov
Eric Moulines
Qi-Man Shao
Zhuo-Song Zhang
Alexey Naumov
38
5
0
26 May 2024
Zero-Inflated Bandits
Haoyu Wei
Runzhe Wan
Lei Shi
Rui Song
51
0
0
25 Dec 2023
Multi-Agent Probabilistic Ensembles with Trajectory Sampling for Connected Autonomous Vehicles
Ruoqi Wen
Jiahao Huang
Rongpeng Li
Guoru Ding
Zhifeng Zhao
42
1
0
21 Dec 2023
Did we personalize? Assessing personalization by an online reinforcement learning algorithm using resampling
Susobhan Ghosh
Raphael Kim
Prasidh Chhabria
Raaz Dwivedi
Predrag Klasjna
Peng Liao
Kelly Zhang
Susan Murphy
OffRL
43
8
0
11 Apr 2023
Multiplier Bootstrap-based Exploration
Runzhe Wan
Haoyu Wei
Branislav Kveton
R. Song
21
3
0
03 Feb 2023
Can Direct Latent Model Learning Solve Linear Quadratic Gaussian Control?
Yi Tian
Kai Zhang
Russ Tedrake
S. Sra
52
4
0
30 Dec 2022
A Critical Review of Traffic Signal Control and A Novel Unified View of Reinforcement Learning and Model Predictive Control Approaches for Adaptive Traffic Signal Control
Xiaoyu Wang
Scott Sanner
Baher Abdulhai
32
5
0
26 Nov 2022
Lower Bounds for the Convergence of Tensor Power Iteration on Random Overcomplete Models
Yuchen Wu
Kangjie Zhou
36
6
0
07 Nov 2022
Misspecified Phase Retrieval with Generative Priors
Zhaoqiang Liu
Xinshao Wang
Jiulong Liu
55
4
0
11 Oct 2022
Robust Tests in Online Decision-Making
Gi-Soo Kim
Hyun-Joon Yang
J. P. Kim
OffRL
26
0
0
21 Aug 2022
Optimal Regret Is Achievable with Bounded Approximate Inference Error: An Enhanced Bayesian Upper Confidence Bound Framework
Ziyi Huang
Henry Lam
A. Meisami
Haofeng Zhang
55
4
0
31 Jan 2022
Online Bootstrap Inference For Policy Evaluation in Reinforcement Learning
Pratik Ramprasad
Yuantong Li
Zhuoran Yang
Zhaoran Wang
W. Sun
Guang Cheng
OffRL
57
27
0
08 Aug 2021
A Unifying Framework for Reinforcement Learning and Planning
Thomas M. Moerland
Joost Broekens
Aske Plaat
Catholijn M. Jonker
OffRL
48
9
0
26 Jun 2020
1