Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2010.07283
Cited By
Statistical Inference for Online Decision-Making: In a Contextual Bandit Setting
14 October 2020
Haoyu Chen
Wenbin Lu
R. Song
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Statistical Inference for Online Decision-Making: In a Contextual Bandit Setting"
17 / 17 papers shown
Title
Contextual Online Uncertainty-Aware Preference Learning for Human Feedback
Nan Lu
Ethan X. Fang
Junwei Lu
167
0
0
27 Apr 2025
Asymptotic Time-Uniform Inference for Parameters in Averaged Stochastic Approximation
Chuhan Xie
Kaicheng Jin
Jiadong Liang
Zhihua Zhang
26
0
0
19 Oct 2024
Linear Contextual Bandits with Interference
Yang Xu
Wenbin Lu
Rui Song
27
0
0
24 Sep 2024
Dynamic Online Recommendation for Two-Sided Market with Bayesian Incentive Compatibility
Yuantong Li
Guang Cheng
Xiaowu Dai
32
1
0
04 Jun 2024
When Mining Electric Locomotives Meet Reinforcement Learning
Ying Li
Z. Zhu
Xiaoqiang Li
Chunyu Yang
Hao Lu
13
1
0
14 Nov 2023
Online Estimation and Inference for Robust Policy Evaluation in Reinforcement Learning
Weidong Liu
Jiyuan Tu
Yichen Zhang
Xi Chen
OffRL
24
2
0
04 Oct 2023
Statistical Inference on Multi-armed Bandits with Delayed Feedback
Lei Shi
Jingshen Wang
Tianhao Wu
22
4
0
03 Jul 2023
Kernel
ε
ε
ε
-Greedy for Contextual Bandits
Sakshi Arya
Bharath K. Sriperumbudur
11
0
0
29 Jun 2023
Online Statistical Inference for Contextual Bandits via Stochastic Gradient Descent
Xinyu Chen
Zehua Lai
He Li
Yichen Zhang
26
4
0
30 Dec 2022
Anytime-valid off-policy inference for contextual bandits
Ian Waudby-Smith
Lili Wu
Aaditya Ramdas
Nikos Karampatziakis
Paul Mineiro
OffRL
43
25
0
19 Oct 2022
Robust Tests in Online Decision-Making
Gi-Soo Kim
Hyun-Joon Yang
J. P. Kim
OffRL
18
0
0
21 Aug 2022
Doubly Robust Interval Estimation for Optimal Policy Evaluation in Online Learning
Ye Shen
Hengrui Cai
Rui Song
OffRL
34
2
0
29 Oct 2021
Sequential Estimation under Multiple Resources: a Bandit Point of View
Alireza Masoumian
Shayan Kiyani
Mohammad Hossein Yassaee
28
1
0
29 Sep 2021
Statistical Inference with M-Estimators on Adaptively Collected Data
Kelly W. Zhang
Lucas Janson
S. Murphy
OffRL
13
40
0
29 Apr 2021
Statistical Inference for Online Decision Making via Stochastic Gradient Descent
Haoyu Chen
Wenbin Lu
R. Song
OffRL
8
26
0
14 Oct 2020
Nearly Dimension-Independent Sparse Linear Bandit over Small Action Spaces via Best Subset Selection
Yining Wang
Yi Chen
Ethan X. Fang
Zhaoran Wang
Runze Li
12
16
0
04 Sep 2020
Online Regularization towards Always-Valid High-Dimensional Dynamic Pricing
ChiHua Wang
Zhanyu Wang
W. Sun
Guang Cheng
16
7
0
05 Jul 2020
1