ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2406.00317
  4. Cited By
Combining Experimental and Historical Data for Policy Evaluation

Combining Experimental and Historical Data for Policy Evaluation

1 June 2024
Ting Li
Chengchun Shi
Qianglin Wen
Yang Sui
Yongli Qin
Chunbo Lai
Hongtu Zhu
    OffRL
ArXivPDFHTML

Papers citing "Combining Experimental and Historical Data for Policy Evaluation"

20 / 20 papers shown
Title
Off-policy Evaluation in Doubly Inhomogeneous Environments
Off-policy Evaluation in Doubly Inhomogeneous Environments
Zeyu Bian
C. Shi
Zhengling Qi
Lan Wang
OffRL
47
7
0
14 Jun 2023
Policy learning "without" overlap: Pessimism and generalized empirical Bernstein's inequality
Policy learning "without" overlap: Pessimism and generalized empirical Bernstein's inequality
Ying Jin
Zhimei Ren
Zhuoran Yang
Zhaoran Wang
OffRL
91
26
0
19 Dec 2022
A Review of Off-Policy Evaluation in Reinforcement Learning
A Review of Off-Policy Evaluation in Reinforcement Learning
Masatoshi Uehara
C. Shi
Nathan Kallus
OffRL
85
74
0
13 Dec 2022
Testing Stationarity and Change Point Detection in Reinforcement Learning
Testing Stationarity and Change Point Detection in Reinforcement Learning
Mengbing Li
C. Shi
Zhanghua Wu
Piotr Fryzlewicz
OffRL
73
9
0
03 Mar 2022
Pessimistic Q-Learning for Offline Reinforcement Learning: Towards
  Optimal Sample Complexity
Pessimistic Q-Learning for Offline Reinforcement Learning: Towards Optimal Sample Complexity
Laixi Shi
Gen Li
Yuting Wei
Yuxin Chen
Yuejie Chi
OffRL
70
93
0
28 Feb 2022
Off-Policy Evaluation in Partially Observed Markov Decision Processes
  under Sequential Ignorability
Off-Policy Evaluation in Partially Observed Markov Decision Processes under Sequential Ignorability
Yupeng Tang
Seung-seob Lee
OffRL
81
26
0
24 Oct 2021
Bellman-consistent Pessimism for Offline Reinforcement Learning
Bellman-consistent Pessimism for Offline Reinforcement Learning
Tengyang Xie
Ching-An Cheng
Nan Jiang
Paul Mineiro
Alekh Agarwal
OffRL
LRM
131
276
0
13 Jun 2021
A Deep Value-network Based Approach for Multi-Driver Order Dispatching
A Deep Value-network Based Approach for Multi-Driver Order Dispatching
Xiaocheng Tang
Zhiwei Qin
Fan Zhang
Zhaodong Wang
Zhe Xu
Yintai Ma
Hongtu Zhu
Jieping Ye
OffRL
45
180
0
08 Jun 2021
Pattern Transfer Learning for Reinforcement Learning in Order
  Dispatching
Pattern Transfer Learning for Reinforcement Learning in Order Dispatching
Runzhe Wan
Sheng Zhang
C. Shi
Shuang Luo
R. Song
AI4TS
30
3
0
27 May 2021
Reinforcement Learning for Ridesharing: An Extended Survey
Reinforcement Learning for Ridesharing: An Extended Survey
Zhiwei Qin
Hongtu Zhu
Jieping Ye
76
85
0
03 May 2021
Bridging Offline Reinforcement Learning and Imitation Learning: A Tale
  of Pessimism
Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism
Paria Rashidinejad
Banghua Zhu
Cong Ma
Jiantao Jiao
Stuart J. Russell
OffRL
198
286
0
22 Mar 2021
Bootstrapping Fitted Q-Evaluation for Off-Policy Inference
Bootstrapping Fitted Q-Evaluation for Off-Policy Inference
Botao Hao
X. Ji
Yaqi Duan
Hao Lu
Csaba Szepesvári
Mengdi Wang
OffRL
39
40
0
06 Feb 2021
Conservative Q-Learning for Offline Reinforcement Learning
Conservative Q-Learning for Offline Reinforcement Learning
Aviral Kumar
Aurick Zhou
George Tucker
Sergey Levine
OffRL
OnRL
131
1,806
0
08 Jun 2020
Double Reinforcement Learning for Efficient Off-Policy Evaluation in
  Markov Decision Processes
Double Reinforcement Learning for Efficient Off-Policy Evaluation in Markov Decision Processes
Nathan Kallus
Masatoshi Uehara
OffRL
68
186
0
22 Aug 2019
Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction
Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction
Aviral Kumar
Justin Fu
George Tucker
Sergey Levine
OffRL
OnRL
109
1,054
0
03 Jun 2019
Breaking the Curse of Horizon: Infinite-Horizon Off-Policy Estimation
Breaking the Curse of Horizon: Infinite-Horizon Off-Policy Estimation
Qiang Liu
Lihong Li
Ziyang Tang
Dengyong Zhou
OffRL
138
355
0
29 Oct 2018
Orthogonal Random Forest for Causal Inference
Orthogonal Random Forest for Causal Inference
Miruna Oprescu
Vasilis Syrgkanis
Zhiwei Steven Wu
CML
50
111
0
09 Jun 2018
Time series experiments and causal estimands: exact randomization tests
  and trading
Time series experiments and causal estimands: exact randomization tests and trading
Iavor Bojinov
N. Shephard
30
113
0
23 Jun 2017
Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning
Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning
Philip S. Thomas
Emma Brunskill
OffRL
373
576
0
04 Apr 2016
Doubly Robust Policy Evaluation and Optimization
Doubly Robust Policy Evaluation and Optimization
Miroslav Dudík
D. Erhan
John Langford
Lihong Li
OffRL
168
285
0
10 Mar 2015
1