ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.00956
  4. Cited By
Model-Free and Model-Based Policy Evaluation when Causality is Uncertain

Model-Free and Model-Based Policy Evaluation when Causality is Uncertain

2 April 2022
David Bruns-Smith
    CML
    ELM
    OffRL
ArXivPDFHTML

Papers citing "Model-Free and Model-Based Policy Evaluation when Causality is Uncertain"

9 / 9 papers shown
Title
Reinforcement Learning with Continuous Actions Under Unmeasured Confounding
Reinforcement Learning with Continuous Actions Under Unmeasured Confounding
Yuhan Li
Eugene Han
Yifan Hu
Wenzhuo Zhou
Zhengling Qi
Yifan Cui
Ruoqing Zhu
OffRL
292
0
0
01 May 2025
Efficient and Sharp Off-Policy Evaluation in Robust Markov Decision
  Processes
Efficient and Sharp Off-Policy Evaluation in Robust Markov Decision Processes
Andrew Bennett
Nathan Kallus
Miruna Oprescu
Wen Sun
Kaiwen Wang
AAML
OffRL
60
1
0
29 Mar 2024
Why Online Reinforcement Learning is Causal
Why Online Reinforcement Learning is Causal
Oliver Schulte
Pascal Poupart
CML
OffRL
56
1
0
07 Mar 2024
A Survey on Causal Reinforcement Learning
A Survey on Causal Reinforcement Learning
Yan Zeng
Ruichu Cai
Gang Hua
Libo Huang
Zhifeng Hao
CML
51
27
0
10 Feb 2023
Robust Fitted-Q-Evaluation and Iteration under Sequentially Exogenous
  Unobserved Confounders
Robust Fitted-Q-Evaluation and Iteration under Sequentially Exogenous Unobserved Confounders
David Bruns-Smith
Angela Zhou
OffRL
25
9
0
01 Feb 2023
Offline Policy Evaluation and Optimization under Confounding
Offline Policy Evaluation and Optimization under Confounding
Chinmaya Kausik
Yangyi Lu
Kevin Tan
Maggie Makar
Yixin Wang
Ambuj Tewari
OffRL
43
8
0
29 Nov 2022
Learning Mixtures of Markov Chains and MDPs
Learning Mixtures of Markov Chains and MDPs
Chinmaya Kausik
Kevin Tan
Ambuj Tewari
41
11
0
17 Nov 2022
Off-Policy Evaluation for Episodic Partially Observable Markov Decision
  Processes under Non-Parametric Models
Off-Policy Evaluation for Episodic Partially Observable Markov Decision Processes under Non-Parametric Models
Rui Miao
Zhengling Qi
Xiaoke Zhang
OffRL
44
10
0
21 Sep 2022
Double Reinforcement Learning for Efficient Off-Policy Evaluation in
  Markov Decision Processes
Double Reinforcement Learning for Efficient Off-Policy Evaluation in Markov Decision Processes
Nathan Kallus
Masatoshi Uehara
OffRL
55
183
0
22 Aug 2019
1