Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2001.04515
Cited By
Statistical Inference of the Value Function for Reinforcement Learning in Infinite Horizon Settings
13 January 2020
C. Shi
Shengyao Zhang
W. Lu
R. Song
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Statistical Inference of the Value Function for Reinforcement Learning in Infinite Horizon Settings"
20 / 20 papers shown
Title
IGN : Implicit Generative Networks
Haozheng Luo
Tianyi Wu
Feiyu Han
Zhijun Yan
OffRL
69
1
0
24 Feb 2025
Statistical Inference in Reinforcement Learning: A Selective Survey
Chengchun Shi
OffRL
195
2
0
22 Feb 2025
Counterfactually Fair Reinforcement Learning via Sequential Data Preprocessing
Jitao Wang
C. Shi
John D. Piette
Joshua R. Loftus
Donglin Zeng
Zhenke Wu
OffRL
112
0
0
10 Jan 2025
Online Estimation and Inference for Robust Policy Evaluation in Reinforcement Learning
Weidong Liu
Jiyuan Tu
Yichen Zhang
Xi Chen
OffRL
47
4
0
04 Oct 2023
Doubly Inhomogeneous Reinforcement Learning
Liyuan Hu
Mengbing Li
C. Shi
Zhanghua Wu
Piotr Fryzlewicz
OffRL
62
2
0
08 Nov 2022
Deeply-Debiased Off-Policy Interval Estimation
C. Shi
Runzhe Wan
Victor Chernozhukov
R. Song
OffRL
38
38
0
10 May 2021
Fast Rates for the Regret of Offline Reinforcement Learning
Yichun Hu
Nathan Kallus
Masatoshi Uehara
OffRL
51
30
0
31 Jan 2021
Minimax Weight and Q-Function Learning for Off-Policy Evaluation
Masatoshi Uehara
Jiawei Huang
Nan Jiang
OffRL
103
186
0
28 Oct 2019
Doubly Robust Bias Reduction in Infinite Horizon Off-Policy Estimation
Ziyang Tang
Yihao Feng
Lihong Li
Dengyong Zhou
Qiang Liu
OffRL
99
68
0
16 Oct 2019
Efficiently Breaking the Curse of Horizon in Off-Policy Evaluation with Double Reinforcement Learning
Nathan Kallus
Masatoshi Uehara
OffRL
49
91
0
12 Sep 2019
Double Reinforcement Learning for Efficient Off-Policy Evaluation in Markov Decision Processes
Nathan Kallus
Masatoshi Uehara
OffRL
68
185
0
22 Aug 2019
When to Trust Your Model: Model-Based Policy Optimization
Michael Janner
Justin Fu
Marvin Zhang
Sergey Levine
OffRL
73
939
0
19 Jun 2019
A Theoretical Analysis of Deep Q-Learning
Jianqing Fan
Zhuoran Yang
Yuchen Xie
Zhaoran Wang
119
601
0
01 Jan 2019
Real-Time Bidding with Multi-Agent Reinforcement Learning in Display Advertising
Junqi Jin
Cheng-Ning Song
Han Li
Kun Gai
Jun Wang
Weinan Zhang
46
179
0
27 Feb 2018
Estimating Dynamic Treatment Regimes in Mobile Health Using V-learning
Daniel J. Luckett
Eric B. Laber
A. Kahkoska
D. Maahs
E. Mayer‐Davis
Michael R. Kosorok
46
137
0
10 Nov 2016
Statistical inference for the mean outcome under a possibly non-unique optimal treatment strategy
Alexander Luedtke
M. J. van der Laan
105
220
0
24 Mar 2016
Doubly Robust Off-policy Value Evaluation for Reinforcement Learning
Nan Jiang
Lihong Li
OffRL
157
621
0
11 Nov 2015
Optimal Uniform Convergence Rates and Asymptotic Normality for Series Estimators Under Weak Dependence and Weak Conditions
Xiaohong Chen
T. Christensen
51
151
0
18 Dec 2014
Performance guarantees for individualized treatment rules
Min Qian
Susan Murphy
141
556
0
17 May 2011
Fast learning rates for plug-in classifiers
Jean-Yves Audibert
Alexandre B. Tsybakov
412
467
0
17 Aug 2007
1