ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2001.04515
  4. Cited By
Statistical Inference of the Value Function for Reinforcement Learning
  in Infinite Horizon Settings

Statistical Inference of the Value Function for Reinforcement Learning in Infinite Horizon Settings

13 January 2020
C. Shi
Shengyao Zhang
W. Lu
R. Song
    OffRL
ArXivPDFHTML

Papers citing "Statistical Inference of the Value Function for Reinforcement Learning in Infinite Horizon Settings"

20 / 20 papers shown
Title
IGN : Implicit Generative Networks
IGN : Implicit Generative Networks
Haozheng Luo
Tianyi Wu
Feiyu Han
Zhijun Yan
OffRL
69
1
0
24 Feb 2025
Statistical Inference in Reinforcement Learning: A Selective Survey
Statistical Inference in Reinforcement Learning: A Selective Survey
Chengchun Shi
OffRL
195
2
0
22 Feb 2025
Counterfactually Fair Reinforcement Learning via Sequential Data Preprocessing
Counterfactually Fair Reinforcement Learning via Sequential Data Preprocessing
Jitao Wang
C. Shi
John D. Piette
Joshua R. Loftus
Donglin Zeng
Zhenke Wu
OffRL
112
0
0
10 Jan 2025
Online Estimation and Inference for Robust Policy Evaluation in Reinforcement Learning
Online Estimation and Inference for Robust Policy Evaluation in Reinforcement Learning
Weidong Liu
Jiyuan Tu
Yichen Zhang
Xi Chen
OffRL
47
4
0
04 Oct 2023
Doubly Inhomogeneous Reinforcement Learning
Doubly Inhomogeneous Reinforcement Learning
Liyuan Hu
Mengbing Li
C. Shi
Zhanghua Wu
Piotr Fryzlewicz
OffRL
62
2
0
08 Nov 2022
Deeply-Debiased Off-Policy Interval Estimation
Deeply-Debiased Off-Policy Interval Estimation
C. Shi
Runzhe Wan
Victor Chernozhukov
R. Song
OffRL
38
38
0
10 May 2021
Fast Rates for the Regret of Offline Reinforcement Learning
Fast Rates for the Regret of Offline Reinforcement Learning
Yichun Hu
Nathan Kallus
Masatoshi Uehara
OffRL
51
30
0
31 Jan 2021
Minimax Weight and Q-Function Learning for Off-Policy Evaluation
Minimax Weight and Q-Function Learning for Off-Policy Evaluation
Masatoshi Uehara
Jiawei Huang
Nan Jiang
OffRL
103
186
0
28 Oct 2019
Doubly Robust Bias Reduction in Infinite Horizon Off-Policy Estimation
Doubly Robust Bias Reduction in Infinite Horizon Off-Policy Estimation
Ziyang Tang
Yihao Feng
Lihong Li
Dengyong Zhou
Qiang Liu
OffRL
99
68
0
16 Oct 2019
Efficiently Breaking the Curse of Horizon in Off-Policy Evaluation with
  Double Reinforcement Learning
Efficiently Breaking the Curse of Horizon in Off-Policy Evaluation with Double Reinforcement Learning
Nathan Kallus
Masatoshi Uehara
OffRL
49
91
0
12 Sep 2019
Double Reinforcement Learning for Efficient Off-Policy Evaluation in
  Markov Decision Processes
Double Reinforcement Learning for Efficient Off-Policy Evaluation in Markov Decision Processes
Nathan Kallus
Masatoshi Uehara
OffRL
68
185
0
22 Aug 2019
When to Trust Your Model: Model-Based Policy Optimization
When to Trust Your Model: Model-Based Policy Optimization
Michael Janner
Justin Fu
Marvin Zhang
Sergey Levine
OffRL
73
939
0
19 Jun 2019
A Theoretical Analysis of Deep Q-Learning
A Theoretical Analysis of Deep Q-Learning
Jianqing Fan
Zhuoran Yang
Yuchen Xie
Zhaoran Wang
119
601
0
01 Jan 2019
Real-Time Bidding with Multi-Agent Reinforcement Learning in Display
  Advertising
Real-Time Bidding with Multi-Agent Reinforcement Learning in Display Advertising
Junqi Jin
Cheng-Ning Song
Han Li
Kun Gai
Jun Wang
Weinan Zhang
46
179
0
27 Feb 2018
Estimating Dynamic Treatment Regimes in Mobile Health Using V-learning
Estimating Dynamic Treatment Regimes in Mobile Health Using V-learning
Daniel J. Luckett
Eric B. Laber
A. Kahkoska
D. Maahs
E. Mayer‐Davis
Michael R. Kosorok
46
137
0
10 Nov 2016
Statistical inference for the mean outcome under a possibly non-unique
  optimal treatment strategy
Statistical inference for the mean outcome under a possibly non-unique optimal treatment strategy
Alexander Luedtke
M. J. van der Laan
105
220
0
24 Mar 2016
Doubly Robust Off-policy Value Evaluation for Reinforcement Learning
Doubly Robust Off-policy Value Evaluation for Reinforcement Learning
Nan Jiang
Lihong Li
OffRL
157
621
0
11 Nov 2015
Optimal Uniform Convergence Rates and Asymptotic Normality for Series
  Estimators Under Weak Dependence and Weak Conditions
Optimal Uniform Convergence Rates and Asymptotic Normality for Series Estimators Under Weak Dependence and Weak Conditions
Xiaohong Chen
T. Christensen
51
151
0
18 Dec 2014
Performance guarantees for individualized treatment rules
Performance guarantees for individualized treatment rules
Min Qian
Susan Murphy
141
556
0
17 May 2011
Fast learning rates for plug-in classifiers
Fast learning rates for plug-in classifiers
Jean-Yves Audibert
Alexandre B. Tsybakov
412
467
0
17 Aug 2007
1