ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2210.15543
  4. Cited By
Beyond the Return: Off-policy Function Estimation under User-specified
  Error-measuring Distributions

Beyond the Return: Off-policy Function Estimation under User-specified Error-measuring Distributions

27 October 2022
Audrey Huang
Nan Jiang
    OffRL
ArXivPDFHTML

Papers citing "Beyond the Return: Off-policy Function Estimation under User-specified Error-measuring Distributions"

8 / 8 papers shown
Title
The Optimal Approximation Factors in Misspecified Off-Policy Value
  Function Estimation
The Optimal Approximation Factors in Misspecified Off-Policy Value Function Estimation
P. Amortila
Nan Jiang
Csaba Szepesvári
OffRL
21
3
0
25 Jul 2023
Offline Reinforcement Learning with Additional Covering Distributions
Offline Reinforcement Learning with Additional Covering Distributions
Chenjie Mao
OffRL
23
0
0
22 May 2023
Minimax Instrumental Variable Regression and $L_2$ Convergence
  Guarantees without Identification or Closedness
Minimax Instrumental Variable Regression and L2L_2L2​ Convergence Guarantees without Identification or Closedness
Andrew Bennett
Nathan Kallus
Xiaojie Mao
Whitney Newey
Vasilis Syrgkanis
Masatoshi Uehara
28
14
0
10 Feb 2023
Offline Minimax Soft-Q-learning Under Realizability and Partial Coverage
Offline Minimax Soft-Q-learning Under Realizability and Partial Coverage
Masatoshi Uehara
Nathan Kallus
Jason D. Lee
Wen Sun
OffRL
37
5
0
05 Feb 2023
Reinforcement Learning in Low-Rank MDPs with Density Features
Reinforcement Learning in Low-Rank MDPs with Density Features
Audrey Huang
Jinglin Chen
Nan Jiang
OffRL
13
14
0
04 Feb 2023
A Review of Off-Policy Evaluation in Reinforcement Learning
A Review of Off-Policy Evaluation in Reinforcement Learning
Masatoshi Uehara
C. Shi
Nathan Kallus
OffRL
36
47
0
13 Dec 2022
Efficiently Breaking the Curse of Horizon in Off-Policy Evaluation with
  Double Reinforcement Learning
Efficiently Breaking the Curse of Horizon in Off-Policy Evaluation with Double Reinforcement Learning
Nathan Kallus
Masatoshi Uehara
OffRL
16
87
0
12 Sep 2019
Double Reinforcement Learning for Efficient Off-Policy Evaluation in
  Markov Decision Processes
Double Reinforcement Learning for Efficient Off-Policy Evaluation in Markov Decision Processes
Nathan Kallus
Masatoshi Uehara
OffRL
38
181
0
22 Aug 2019
1