Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2210.15543
Cited By
Beyond the Return: Off-policy Function Estimation under User-specified Error-measuring Distributions
27 October 2022
Audrey Huang
Nan Jiang
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Beyond the Return: Off-policy Function Estimation under User-specified Error-measuring Distributions"
8 / 8 papers shown
Title
The Optimal Approximation Factors in Misspecified Off-Policy Value Function Estimation
P. Amortila
Nan Jiang
Csaba Szepesvári
OffRL
21
3
0
25 Jul 2023
Offline Reinforcement Learning with Additional Covering Distributions
Chenjie Mao
OffRL
23
0
0
22 May 2023
Minimax Instrumental Variable Regression and
L
2
L_2
L
2
Convergence Guarantees without Identification or Closedness
Andrew Bennett
Nathan Kallus
Xiaojie Mao
Whitney Newey
Vasilis Syrgkanis
Masatoshi Uehara
28
14
0
10 Feb 2023
Offline Minimax Soft-Q-learning Under Realizability and Partial Coverage
Masatoshi Uehara
Nathan Kallus
Jason D. Lee
Wen Sun
OffRL
37
5
0
05 Feb 2023
Reinforcement Learning in Low-Rank MDPs with Density Features
Audrey Huang
Jinglin Chen
Nan Jiang
OffRL
13
14
0
04 Feb 2023
A Review of Off-Policy Evaluation in Reinforcement Learning
Masatoshi Uehara
C. Shi
Nathan Kallus
OffRL
36
47
0
13 Dec 2022
Efficiently Breaking the Curse of Horizon in Off-Policy Evaluation with Double Reinforcement Learning
Nathan Kallus
Masatoshi Uehara
OffRL
16
87
0
12 Sep 2019
Double Reinforcement Learning for Efficient Off-Policy Evaluation in Markov Decision Processes
Nathan Kallus
Masatoshi Uehara
OffRL
38
181
0
22 Aug 2019
1