ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2104.12820
  4. Cited By
Universal Off-Policy Evaluation

Universal Off-Policy Evaluation

26 April 2021
Yash Chandak
S. Niekum
Bruno C. da Silva
Erik Learned-Miller
Emma Brunskill
Philip S. Thomas
    OffRL
    ELM
ArXivPDFHTML

Papers citing "Universal Off-Policy Evaluation"

13 / 13 papers shown
Title
Counterfactual Inference under Thompson Sampling
Counterfactual Inference under Thompson Sampling
Olivier Jeunen
OffRL
LRM
49
0
0
03 Apr 2025
Reinforcement Learning for Strategic Recommendations
Reinforcement Learning for Strategic Recommendations
Georgios Theocharous
Yash Chandak
Philip S. Thomas
F. D. Nijs
OffRL
33
11
0
15 Sep 2020
Reducing Sampling Error in Batch Temporal Difference Learning
Reducing Sampling Error in Batch Temporal Difference Learning
Brahma S. Pavse
Ishan Durugkar
Josiah P. Hanna
Peter Stone
OffRL
50
12
0
15 Aug 2020
Off-policy Evaluation in Infinite-Horizon Reinforcement Learning with
  Latent Confounders
Off-policy Evaluation in Infinite-Horizon Reinforcement Learning with Latent Confounders
Andrew Bennett
Nathan Kallus
Lihong Li
Ali Mousavi
OffRL
47
43
0
27 Jul 2020
Statistical Bootstrapping for Uncertainty Estimation in Off-Policy
  Evaluation
Statistical Bootstrapping for Uncertainty Estimation in Off-Policy Evaluation
Ilya Kostrikov
Ofir Nachum
OffRL
22
30
0
27 Jul 2020
Parameter-Based Value Functions
Parameter-Based Value Functions
Francesco Faccio
Louis Kirsch
Jürgen Schmidhuber
OffRL
40
25
0
16 Jun 2020
Optimizing for the Future in Non-Stationary MDPs
Optimizing for the Future in Non-Stationary MDPs
Yash Chandak
Georgios Theocharous
Shiv Shankar
Martha White
Sridhar Mahadevan
Philip S. Thomas
OffRL
28
65
0
17 May 2020
Off-policy Policy Evaluation For Sequential Decisions Under Unobserved
  Confounding
Off-policy Policy Evaluation For Sequential Decisions Under Unobserved Confounding
Hongseok Namkoong
Ramtin Keramati
Steve Yadlowsky
Emma Brunskill
OffRL
91
64
0
12 Mar 2020
Double Reinforcement Learning for Efficient Off-Policy Evaluation in
  Markov Decision Processes
Double Reinforcement Learning for Efficient Off-Policy Evaluation in Markov Decision Processes
Nathan Kallus
Masatoshi Uehara
OffRL
66
185
0
22 Aug 2019
Challenges of Real-World Reinforcement Learning
Challenges of Real-World Reinforcement Learning
Gabriel Dulac-Arnold
D. Mankowitz
Todd Hester
OffRL
57
545
0
29 Apr 2019
Statistics and Samples in Distributional Reinforcement Learning
Statistics and Samples in Distributional Reinforcement Learning
Mark Rowland
Robert Dadashi
Saurabh Kumar
Rémi Munos
Marc G. Bellemare
Will Dabney
OffRL
38
89
0
21 Feb 2019
Importance Sampling with Unequal Support
Importance Sampling with Unequal Support
Philip S. Thomas
Emma Brunskill
28
14
0
10 Nov 2016
Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning
Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning
Philip S. Thomas
Emma Brunskill
OffRL
188
573
0
04 Apr 2016
1