Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2406.00317
Cited By
Combining Experimental and Historical Data for Policy Evaluation
1 June 2024
Ting Li
Chengchun Shi
Qianglin Wen
Yang Sui
Yongli Qin
Chunbo Lai
Hongtu Zhu
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Combining Experimental and Historical Data for Policy Evaluation"
20 / 20 papers shown
Title
Off-policy Evaluation in Doubly Inhomogeneous Environments
Zeyu Bian
C. Shi
Zhengling Qi
Lan Wang
OffRL
47
7
0
14 Jun 2023
Policy learning "without" overlap: Pessimism and generalized empirical Bernstein's inequality
Ying Jin
Zhimei Ren
Zhuoran Yang
Zhaoran Wang
OffRL
91
26
0
19 Dec 2022
A Review of Off-Policy Evaluation in Reinforcement Learning
Masatoshi Uehara
C. Shi
Nathan Kallus
OffRL
85
74
0
13 Dec 2022
Testing Stationarity and Change Point Detection in Reinforcement Learning
Mengbing Li
C. Shi
Zhanghua Wu
Piotr Fryzlewicz
OffRL
73
9
0
03 Mar 2022
Pessimistic Q-Learning for Offline Reinforcement Learning: Towards Optimal Sample Complexity
Laixi Shi
Gen Li
Yuting Wei
Yuxin Chen
Yuejie Chi
OffRL
70
93
0
28 Feb 2022
Off-Policy Evaluation in Partially Observed Markov Decision Processes under Sequential Ignorability
Yupeng Tang
Seung-seob Lee
OffRL
81
26
0
24 Oct 2021
Bellman-consistent Pessimism for Offline Reinforcement Learning
Tengyang Xie
Ching-An Cheng
Nan Jiang
Paul Mineiro
Alekh Agarwal
OffRL
LRM
131
276
0
13 Jun 2021
A Deep Value-network Based Approach for Multi-Driver Order Dispatching
Xiaocheng Tang
Zhiwei Qin
Fan Zhang
Zhaodong Wang
Zhe Xu
Yintai Ma
Hongtu Zhu
Jieping Ye
OffRL
45
180
0
08 Jun 2021
Pattern Transfer Learning for Reinforcement Learning in Order Dispatching
Runzhe Wan
Sheng Zhang
C. Shi
Shuang Luo
R. Song
AI4TS
30
3
0
27 May 2021
Reinforcement Learning for Ridesharing: An Extended Survey
Zhiwei Qin
Hongtu Zhu
Jieping Ye
76
85
0
03 May 2021
Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism
Paria Rashidinejad
Banghua Zhu
Cong Ma
Jiantao Jiao
Stuart J. Russell
OffRL
198
286
0
22 Mar 2021
Bootstrapping Fitted Q-Evaluation for Off-Policy Inference
Botao Hao
X. Ji
Yaqi Duan
Hao Lu
Csaba Szepesvári
Mengdi Wang
OffRL
39
40
0
06 Feb 2021
Conservative Q-Learning for Offline Reinforcement Learning
Aviral Kumar
Aurick Zhou
George Tucker
Sergey Levine
OffRL
OnRL
131
1,806
0
08 Jun 2020
Double Reinforcement Learning for Efficient Off-Policy Evaluation in Markov Decision Processes
Nathan Kallus
Masatoshi Uehara
OffRL
68
186
0
22 Aug 2019
Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction
Aviral Kumar
Justin Fu
George Tucker
Sergey Levine
OffRL
OnRL
109
1,054
0
03 Jun 2019
Breaking the Curse of Horizon: Infinite-Horizon Off-Policy Estimation
Qiang Liu
Lihong Li
Ziyang Tang
Dengyong Zhou
OffRL
138
355
0
29 Oct 2018
Orthogonal Random Forest for Causal Inference
Miruna Oprescu
Vasilis Syrgkanis
Zhiwei Steven Wu
CML
50
111
0
09 Jun 2018
Time series experiments and causal estimands: exact randomization tests and trading
Iavor Bojinov
N. Shephard
30
113
0
23 Jun 2017
Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning
Philip S. Thomas
Emma Brunskill
OffRL
373
576
0
04 Apr 2016
Doubly Robust Policy Evaluation and Optimization
Miroslav Dudík
D. Erhan
John Langford
Lihong Li
OffRL
168
285
0
10 Mar 2015
1