ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.03255
  4. Cited By
Off-policy evaluation for MDPs with unknown structure

Off-policy evaluation for MDPs with unknown structure

11 February 2015
Assaf Hallak
François Schnitzler
Timothy A. Mann
Shie Mannor
    OffRL
ArXivPDFHTML

Papers citing "Off-policy evaluation for MDPs with unknown structure"

7 / 7 papers shown
Title
Factored Adaptation for Non-Stationary Reinforcement Learning
Factored Adaptation for Non-Stationary Reinforcement Learning
Fan Feng
Erdun Gao
Kun Zhang
Sara Magliacane
CML
OffRL
49
32
0
30 Mar 2022
Autoregressive Dynamics Models for Offline Policy Evaluation and
  Optimization
Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization
Michael Ruogu Zhang
T. Paine
Ofir Nachum
Cosmin Paduraru
George Tucker
Ziyun Wang
Mohammad Norouzi
OffRL
30
45
0
28 Apr 2021
Model-Invariant State Abstractions for Model-Based Reinforcement
  Learning
Model-Invariant State Abstractions for Model-Based Reinforcement Learning
Manan Tomar
Amy Zhang
Roberto Calandra
Matthew E. Taylor
Joelle Pineau
24
24
0
19 Feb 2021
Counterfactual Data Augmentation using Locally Factored Dynamics
Counterfactual Data Augmentation using Locally Factored Dynamics
Silviu Pitis
Elliot Creager
Animesh Garg
BDL
OffRL
28
87
0
06 Jul 2020
Estimating Counterfactual Treatment Outcomes over Time Through
  Adversarially Balanced Representations
Estimating Counterfactual Treatment Outcomes over Time Through Adversarially Balanced Representations
Ioana Bica
Ahmed Alaa
James Jordon
M. Schaar
BDL
CML
16
180
0
10 Feb 2020
Using Options and Covariance Testing for Long Horizon Off-Policy Policy
  Evaluation
Using Options and Covariance Testing for Long Horizon Off-Policy Policy Evaluation
Z. Guo
Philip S. Thomas
Emma Brunskill
OffRL
21
2
0
09 Mar 2017
Multi-step Off-policy Learning Without Importance Sampling Ratios
Multi-step Off-policy Learning Without Importance Sampling Ratios
A. R. Mahmood
Huizhen Yu
R. Sutton
OffRL
24
54
0
09 Feb 2017
1