ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2102.09907
  4. Cited By
Instrumental Variable Value Iteration for Causal Offline Reinforcement
  Learning

Instrumental Variable Value Iteration for Causal Offline Reinforcement Learning

19 February 2021
Luofeng Liao
Zuyue Fu
Zhuoran Yang
Yixin Wang
Mladen Kolar
Zhaoran Wang
    OffRL
ArXivPDFHTML

Papers citing "Instrumental Variable Value Iteration for Causal Offline Reinforcement Learning"

7 / 7 papers shown
Title
Off-policy Evaluation in Infinite-Horizon Reinforcement Learning with
  Latent Confounders
Off-policy Evaluation in Infinite-Horizon Reinforcement Learning with Latent Confounders
Andrew Bennett
Nathan Kallus
Lihong Li
Ali Mousavi
OffRL
47
43
0
27 Jul 2020
Information Theoretic Regret Bounds for Online Nonlinear Control
Information Theoretic Regret Bounds for Online Nonlinear Control
Sham Kakade
A. Krishnamurthy
Kendall Lowrey
Motoya Ohnishi
Wen Sun
49
117
0
22 Jun 2020
Active Learning for Nonlinear System Identification with Guarantees
Active Learning for Nonlinear System Identification with Guarantees
Horia Mania
Michael I. Jordan
Benjamin Recht
65
102
0
18 Jun 2020
Off-policy Policy Evaluation For Sequential Decisions Under Unobserved
  Confounding
Off-policy Policy Evaluation For Sequential Decisions Under Unobserved Confounding
Hongseok Namkoong
Ramtin Keramati
Steve Yadlowsky
Emma Brunskill
OffRL
100
64
0
12 Mar 2020
DualDICE: Behavior-Agnostic Estimation of Discounted Stationary
  Distribution Corrections
DualDICE: Behavior-Agnostic Estimation of Discounted Stationary Distribution Corrections
Ofir Nachum
Yinlam Chow
Bo Dai
Lihong Li
OffRL
87
332
0
10 Jun 2019
Deep Generalized Method of Moments for Instrumental Variable Analysis
Deep Generalized Method of Moments for Instrumental Variable Analysis
Andrew Bennett
Nathan Kallus
Tobias Schnabel
48
125
0
29 May 2019
Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal
  Models
Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal Models
Michael Oberst
David Sontag
CML
OffRL
43
170
0
14 May 2019
1